octopus: rgw: resolve empty ordered bucket listing results w/ CLS filtering *and* bucket index list produces incorrect result when non-ascii entries by dvanders · Pull Request #45088 · ceph/ceph

dvanders · 2022-02-18T21:32:08Z

backport tracker: https://tracker.ceph.com/issues/52076

backport of #42125 and #42404 and #44562
parent tracker: https://tracker.ceph.com/issues/51462 and https://tracker.ceph.com/issues/51429

A recent PR that helped address the issue of non-ascii plain entries didn't cover all the bases, allowing I/O errors to be produced in some circumstances during a bucket index list (i.e., `radosgw-admin bi list ...`). This fixes those issue and does some additional clean-up. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit e714f0d) (cherry picked from commit d3d8df7)

When using asynchronous (concurrent) IO for bucket index requests, there are two int ids that are used that need to be kept separate -- shard id and request id. In many cases they're the same -- shard 0 gets request 0, and so forth. But in preparation for re-requests, those ids can diverge, where request 13 maps to shard 2. The existing code maintained the OIDs that went with each request. This PR also maintains the shard id as well. Documentation has been beefed up to help future developers navigate this. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit 9606346) Conflicts: src/cls/rgw/cls_rgw_client.cc src/cls/rgw/cls_rgw_client.h In all cases I took the patch code, not mangling anything. These were all cases of std:: or auto type.

When doing an asynchronous/concurrent bucket index operation against multiple bucket index shards, a special error code is set aside to indicate that an "advancing" retry of a/some shard(s) is necessary. In that case another asynchronous call is made on the indicated shard(s) from the client (i.e., CLSRGWConcurrentIO). It is up to the subclass of CLSRGWConcurrentIO to handle the retry such that it "advances" and simply doesn't get stuck, looping forever. The retry functionality only works when the "need_multiple_rounds" functionality is not in use. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit 5d28307) Conflicts: src/cls/rgw/cls_rgw_client.cc src/cls/rgw/cls_rgw_client.h Resolved by taking the patch version -- all cases of auto type and std::

A previous PR moved the much of the filtering that's part of bucket listing to the CLS layer. One unanticipated result was that it is now possible for a call to return 0 entries. In such a case we want to retry the call with the marker moved forward (i.e., advanced), repeatedly if necessary, in order to either retrieve some entries or to hit the end of the entries. This PR adds that functionality. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit 423c183) Conflicts: src/cls/rgw/cls_rgw_ops.h s/ceph::buffer::list/bufferlist/g

dvanders · 2022-02-18T21:33:14Z

This includes the single commit from #44858.

When "bucket index list" traverses the different regions in the bucket index assembling the output, it miscalculates how many entries to ask for at one point. This fixes that. This fixes previous "rgw: bucket index list can produce I/O errors". Credit for finding this bug goes to Soumya Koduri <skoduri@redhat.com>. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit aa76051)

dvanders · 2022-02-18T22:04:03Z

@ivancich as mentioned in https://tracker.ceph.com/issues/51462#note-7 we started to get bucket listing problems after upgrading to octopus. So I'm trying to collect here the backports needed to bring octopus up to the state of the art in cls_rgw bucket listing. Does this look complete and safe to try?

dvanders · 2022-02-23T09:07:47Z

@ivancich as mentioned in https://tracker.ceph.com/issues/51462#note-7 we started to get bucket listing problems after upgrading to octopus. So I'm trying to collect here the backports needed to bring octopus up to the state of the art in cls_rgw bucket listing. Does this look complete and safe to try?

We tested this PR on our cluster with bucket listing. It fixes!

cbodley · 2022-02-28T16:05:31Z

https://pulpito.ceph.com/yuriw-2022-02-24_22:51:58-rgw-wip-yuri10-testing-2022-02-24-1329-octopus-distro-default-smithi/ shows 3 jobs failing in ceph_test_cls_rgw:

[ RUN      ] cls_rgw.bi_list
unknown file: Failure
C++ exception with description "std::bad_alloc" thrown in the test body.
[  FAILED  ] cls_rgw.bi_list (83 ms)

the exception is thrown in the cls_rgw_client code (not the cls_rgw part running in the osd). in the past, i've seen bad_alloc errors from logic errors in decode, like when it tries to decode a string, finds a really big length prefix, and tries to allocate that much memory. but i don't see any obvious mistakes in rgw_cls_list_ret here

cbodley · 2022-02-28T16:57:09Z

so far it's unclear whether this also happens in the pacific backport, we didn't get a clean run there due to issues around centos 8 eol https://pulpito.ceph.com/yuriw-2022-02-24_20:49:23-rgw-wip-yuri2-testing-2022-02-23-1524-pacific-distro-default-smithi/

dvanders · 2022-02-28T22:35:12Z

@cbodley I think we're missing this: 1bf0581

Can we re-qa it with that?
I'll push it to the pacific PR too.

Make sure marker is cleared. Put end-of-list check inside the conditional with the rest of the test. Add some additional testing. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit 1bf0581)

dvanders · 2022-03-01T09:07:37Z

jenkins retest this please

cbodley · 2022-03-01T13:55:11Z

thanks @dvanders, worth a shot!

i tried reproducing this in an old focal vm, but the test passes consistently there. so i guess it's somehow specific to centos?

dvanders · 2022-03-01T16:05:03Z

jenkins test api

GillesMocellin · 2022-03-02T18:34:18Z

Argh ! v15.2.16 is just released, and this PR is not in ;-(

cbodley · 2022-03-03T18:06:53Z

Argh ! v15.2.16 is just released, and this PR is not in ;-(

@GillesMocellin i'm afraid that Ceph no longer have much of a backport team, and rgw has not been getting timely backports as a result. if you're interested in getting involved, you can find some resources in https://tracker.ceph.com/projects/ceph-releases/wiki and https://github.com/ceph/ceph/blob/master/SubmittingPatches-backports.rst

(and thanks to @dvanders for working on this one)

Fix bugs surrounding calculation of number of entries returned and whether the end of a listing range has been reached. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>

dvanders · 2022-03-09T17:26:32Z

@ivancich I have added the missing commit from #42836 here now too.
The logging changes don't apply cleanly so I'll leave them out unless you insist.

cbodley · 2022-03-09T17:57:49Z

@yuriw sorry, i had requested a rerun of this PR but a new commit was added since then. this applies to #45087 also

ivancich · 2022-03-09T19:21:33Z

@ivancich I have added the missing commit from #42836 here now too. The logging changes don't apply cleanly so I'll leave them out unless you insist.

I agree, the logging changes are not that important. Thank you, @dvanders !

dvanders · 2022-03-10T07:48:34Z

jenkins test api

ivancich added 4 commits February 18, 2022 22:18

dvanders added this to the octopus milestone Feb 18, 2022

dvanders added the rgw label Feb 18, 2022

dvanders requested a review from ivancich February 18, 2022 21:33

ivancich approved these changes Feb 18, 2022

View reviewed changes

dvanders mentioned this pull request Feb 23, 2022

octopus: rgw: bucket index list produces incorrect result when non-ascii entries #44858

Closed

14 tasks

dvanders changed the title ~~octopus: rgw: resolve empty ordered bucket listing results w/ CLS filtering~~ octopus: rgw: resolve empty ordered bucket listing results w/ CLS filtering *and* bucket index list produces incorrect result when non-ascii entries Feb 23, 2022

cbodley added the needs-qa label Feb 23, 2022

yuriw added the wip-yuri10-testing label Feb 24, 2022

cbodley removed needs-qa wip-yuri10-testing labels Feb 28, 2022

test/rgw: fix and add to rgw/cls bi_list tests

dccf0a6

Make sure marker is cleared. Put end-of-list check inside the conditional with the rest of the test. Add some additional testing. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com> (cherry picked from commit 1bf0581)

cbodley added the needs-qa label Mar 1, 2022

rgw: fix bucket index listing count bug

4def14d

Fix bugs surrounding calculation of number of entries returned and whether the end of a listing range has been reached. Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>

ivancich added bug-fix feature labels Mar 9, 2022

yuriw modified the milestones: octopus, pacific Mar 16, 2022

yuriw added the wip-yuri3-testing label May 5, 2022

yuriw merged commit 3bdf9a2 into ceph:octopus May 9, 2022

Conversation

dvanders commented Feb 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dvanders commented Feb 18, 2022

Uh oh!

dvanders commented Feb 18, 2022

Uh oh!

dvanders commented Feb 23, 2022

Uh oh!

cbodley commented Feb 28, 2022

Uh oh!

cbodley commented Feb 28, 2022

Uh oh!

dvanders commented Feb 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dvanders commented Mar 1, 2022

Uh oh!

cbodley commented Mar 1, 2022

Uh oh!

dvanders commented Mar 1, 2022

Uh oh!

GillesMocellin commented Mar 2, 2022

Uh oh!

cbodley commented Mar 3, 2022

Uh oh!

dvanders commented Mar 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cbodley commented Mar 9, 2022

Uh oh!

ivancich commented Mar 9, 2022

Uh oh!

dvanders commented Mar 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dvanders commented Feb 18, 2022 •

edited

Loading

dvanders commented Feb 28, 2022 •

edited

Loading

dvanders commented Mar 9, 2022 •

edited

Loading