Skip to content

rgw/admin: Fix assert on datalog list of invalid shard#62770

Merged
adamemerson merged 8 commits intoceph:mainfrom
adamemerson:wip-70882
Aug 7, 2025
Merged

rgw/admin: Fix assert on datalog list of invalid shard#62770
adamemerson merged 8 commits intoceph:mainfrom
adamemerson:wip-70882

Conversation

@adamemerson
Copy link
Contributor

@adamemerson adamemerson commented Apr 10, 2025

Fixes: https://tracker.ceph.com/issues/70882

Contribution Guidelines

  • To sign and title your commits, please refer to Submitting Patches to Ceph.

  • If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.

  • When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands

@adamemerson adamemerson requested a review from a team as a code owner April 10, 2025 21:07
@github-actions github-actions bot added the rgw label Apr 10, 2025
@adamemerson adamemerson requested a review from cbodley April 11, 2025 16:12
Copy link
Contributor

@cbodley cbodley left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i like the run_coro() stuff 👍

@cbodley
Copy link
Contributor

cbodley commented Apr 25, 2025

https://jenkins.ceph.com/job/ceph-pull-requests-arm64/73027/

run-cli-tests .............................***Failed

https://tracker.ceph.com/issues/71027

@cbodley
Copy link
Contributor

cbodley commented Apr 25, 2025

jenkins test make check arm64

@github-actions
Copy link

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

@NitzanMordhai
Copy link
Contributor

Rados approved: tracker.ceph.com/projects/rados/wiki/MAIN#httpstrackercephcomissues71678

@NitzanMordhai
Copy link
Contributor

jenkins test make check arm64

@SrinivasaBharath
Copy link
Contributor

jenkins test make check arm64

@cbodley
Copy link
Contributor

cbodley commented Jul 2, 2025

thanks @NitzanMordhai @SrinivasaBharath for testing, but please don't merge this until it's been through the rgw suite

@adamemerson
Copy link
Contributor Author

jenkins test make check arm64

@ivancich
Copy link
Member

This seems to have failed a QA run involving DataLogBulky.TestSemBulky.

2025-07-15T06:55:03.021 INFO:tasks.workunit.client.0.smithi081.stdout:[ RUN ] DataLogBulky.TestSemBulky
2025-07-15T06:55:06.034 INFO:tasks.workunit.client.0.smithi081.stdout:unknown file: Failure
2025-07-15T06:55:06.035 INFO:tasks.workunit.client.0.smithi081.stdout:C++ exception with description "1 is not a valid shard. Valid shards are integers in [0, 1): Invalid argument [generic:22]" thrown in the test body.
2025-07-15T06:55:06.035 INFO:tasks.workunit.client.0.smithi081.stdout:
2025-07-15T06:55:06.035 INFO:tasks.workunit.client.0.smithi081.stdout:[ FAILED ] DataLogBulky.TestSemBulky (3014 ms)

see: https://qa-proxy.ceph.com/teuthology/anuchaithra-2025-07-15_05:47:12-rgw-wip-anrao1-testing-2025-07-14-1609-distro-default-smithi/8388018/teuthology.log

@adamemerson
Copy link
Contributor Author

@anrao19 Fixed! Could you give this another go, please?

@anrao19
Copy link
Contributor

anrao19 commented Jul 17, 2025

@anrao19 Fixed! Could you give this another go, please?

sure @adamemerson

@anrao19
Copy link
Contributor

anrao19 commented Jul 17, 2025

jenkins test make check arm64

@anrao19
Copy link
Contributor

anrao19 commented Jul 22, 2025

@adamemerson , please find the result : https://pulpito.ceph.com/?branch=wip-anrao3-testing-2025-07-17-1157 there are some failures

Fixes: https://tracker.ceph.com/issues/70882
Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
@github-actions
Copy link

github-actions bot commented Aug 6, 2025

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

I had been thinking of list and trim as purely internal interfaces,
but they are called through HTTP and thus need to be prepared for bad
input.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
Didn't include `associated_cancellation_slot.hpp`.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
Asio does not have nearly as many actual explicit concepts one can use
as one might like.

And there's no reason we might not want our own asynchrony-related concepts.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
Reimplement with `initiate` rather than the old style. This
necessitates getting rid of the old `async::Completion` in anything
that was calling it, and other changes.

Also, use disposition for error handling.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
A convenience function for turning coroutines that return values and
use exceptions, `error_code`, or similar into `int`-returning
functions that take references to out parameters.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
This avoids having two entry points with different error checking
preparation, etc. to get out of sync or have a fix get forgotten.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
Easier to debug.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
@adamemerson
Copy link
Contributor Author

jenkins test api

@adamemerson
Copy link
Contributor Author

jenkins test make check arm64

@adamemerson adamemerson removed the mon label Aug 7, 2025
@adamemerson adamemerson merged commit 3f96a23 into ceph:main Aug 7, 2025
13 checks passed
while (!version_requests.empty()) {
ceph::async::post(std::move(version_requests.begin()->second),
monc_errc::session_reset, 0, 0);
asio::dispatch(asio::append(std::move(version_requests.begin()->second),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the switch from post() to dispatch() here is a bug -- it can cause the MonClient to lock up. I have proposed a fix in #65177.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants