osdc: Objecter::linger_by_cookie() for safe cast from uint64 by cbodley · Pull Request #65698 · ceph/ceph

cbodley · 2025-09-26T22:26:17Z

a linger_ops_set was added for Objecter::handle_watch_notify() as a safety check before casting uint64_t cookie to LingerOp* and deferencing it

neorados also made use of this set through Objecter::is_valid_watch() checks. however, this approach was still susceptible to use-after-free, because the callers didn't preserve a LingerOp reference between this check and its use - and the Objecter lock is dropped in between. in addition, neorados::RADOS::unwatch_() was missing its check for is_valid_watch()

librados did not make use of this is_valid_watch() at all, so was casting cookies directly to LingerOp* and dereferencing. this results in use-after-free for any cookies invalidated by linger_cancel() - for example when called by CB_DoWatchError

replace is_valid_watch() with a linger_by_cookie() function that

performs the validity check with linger_ops_set,
safely reinterpret_casts the cookie to LingerOp*, and
returns a reference to the caller via intrusive_ptr<LingerOp>

librados::IoCtxImpl::watch_check(), unwatch() and aio_unwatch() now call linger_by_cookie(), so have to handle the null case by returning -ENOTCONN (this matches neorados' existing behavior)

Fixes: https://tracker.ceph.com/issues/72771

Show available Jenkins commands

jenkins test classic perf Jenkins Job | Jenkins Job Definition
jenkins test crimson perf Jenkins Job | Jenkins Job Definition
jenkins test signed Jenkins Job | Jenkins Job Definition
jenkins test make check Jenkins Job | Jenkins Job Definition
jenkins test make check arm64 Jenkins Job | Jenkins Job Definition
jenkins test submodules Jenkins Job | Jenkins Job Definition
jenkins test dashboard Jenkins Job | Jenkins Job Definition
jenkins test dashboard cephadm Jenkins Job | Jenkins Job Definition
jenkins test api Jenkins Job | Jenkins Job Definition
jenkins test docs ReadTheDocs | Github Workflow Definition
jenkins test ceph-volume all Jenkins Jobs | Jenkins Jobs Definition
jenkins test windows Jenkins Job | Jenkins Job Definition
jenkins test rook e2e Jenkins Job | Jenkins Job Definition

You must only issue one Jenkins command per-comment. Jenkins does not understand
comments with more than one command.

preserve a reference to LingerOp in case their invocation races with another linger_cancel() Signed-off-by: Casey Bodley <cbodley@redhat.com>

a `linger_ops_set` was added for `Objecter::handle_watch_notify()` as a safety check before casting `uint64_t cookie` to `LingerOp*` and deferencing it neorados also made use of this set through `Objecter::is_valid_watch()` checks. however, this approach was still susceptible to use-after-free, because the callers didn't preserve a LingerOp reference between this check and its use - and the Objecter lock is dropped in between. in addition, `neorados::RADOS::unwatch_()` was missing its check for `is_valid_watch()` librados did not make use of this `is_valid_watch()` at all, so was casting cookies directly to LingerOp* and dereferencing. this results in use-after-free for any cookies invalidated by `linger_cancel()` - for example when called by `CB_DoWatchError` replace `is_valid_watch()` with a `linger_by_cookie()` function that * performs the validity check with `linger_ops_set`, * safely reinterpret_casts the cookie to LingerOp*, and * returns a reference to the caller via intrusive_ptr<LingerOp> `librados::IoCtxImpl::watch_check()`, `unwatch()` and `aio_unwatch()` now call `linger_by_cookie()`, so have to handle the null case by returning `-ENOTCONN` (this matches neorados' existing behavior) Fixes: https://tracker.ceph.com/issues/72771 Signed-off-by: Casey Bodley <cbodley@redhat.com>

phlogistonjohn · 2025-09-29T16:46:08Z

jenkins test container make check

cbodley · 2025-10-02T16:33:03Z

librados did not make use of this is_valid_watch() at all, so was casting cookies directly to LingerOp* and dereferencing. this results in use-after-free for any cookies invalidated by linger_cancel() - for example when called by CB_DoWatchError

i was mistaken about this part, as CB_DoWatchError doesn't call linger_cancel(). the valgrind report in https://tracker.ceph.com/issues/72771#note-2 shows ~CB_DoWatchError dropping the last ref, but it isn't clear what if anything had called linger_cancel() before

JonBailey1993 · 2025-11-19T11:03:31Z

Rados approved: https://tracker.ceph.com/projects/rados/wiki/MAIN#httpstrackercephcomissues73711

idryomov · 2025-12-11T12:57:05Z

src/librados/IoCtxImpl.cc

-  Objecter::LingerOp *linger_op = reinterpret_cast<Objecter::LingerOp*>(cookie);
+  boost::intrusive_ptr linger_op = objecter->linger_by_cookie(cookie);
+  if (!linger_op) {
+    return -ENOTCONN;


Hi @cbodley,

This introduced a regression in RBD: librbd asserts that IoCtx::aio_watch() returns 0 because that has been the expected behavior for years meaning that we are experiencing sporadic crashes now. If the linger op is canceled, we expect to get ENOTCONN via the callback, not from the initiating function. Having the initiating function fail this way is prone to inconsistent behavior: in the scenario where the removal of the object on which the watch is established races with a call to IoCtx::aio_watch(), one could get the error from either the initiating function or the callback depending on the outcome of the race. Writing code such that it's prepared to handle both cases is suboptimal and also isn't aligned with neorados which always delivers the error via the callback.

librados did not make use of this is_valid_watch() at all, so was casting cookies directly to LingerOp* and dereferencing. this results in use-after-free for any cookies invalidated by linger_cancel() - for example when called by CB_DoWatchError

I'm not convinced there was a potential for a use-after-free in IoCtx::unwatch() or IoCtx::aio_watch() for a cookie that had gotten invalidated in response to a watch error -- assuming the caller passed a cookie previously obtained from IoCtx::watch(), of course. My understanding has been that Objecter::linger_register() returns a linger op with two references, one of which is intended for the caller of IoCtx::watch() (i.e. the user of the public API). That reference remains alive until the user performs the unwatch. A linger op that is canceled just gets marked as such (LingerOp::canceled bool) and loses the internal reference -- nothing should happen to the external reference. If that is correct, an extra validity check isn't strictly required but just a nice to have to catch totally bogus cookies. Am I missing something?

I'm not convinced there was a potential for a use-after-free

sorry for the confusion, i was mistaken about that comment and tried to clarify in #65698 (comment)

If that is correct, an extra validity check isn't strictly required but just a nice to have to catch totally bogus cookies.

this includes cookies that the client already unwatched, so ioctx.unwatch(cookie); ioctx.unwatch(cookie); can lead to undefined behavior

in rgw, our librados::WatchCtx2::handle_error() override schedules calls to reinitialize the watch with ioctx.unwatch() -> ioctx.watch(). it doesn't look like anything prevents that unwatch from racing with our shutdown code that also calls unwatch, so there are still bugs to fix here

regardless, i'd argue that correct use of watch/notify is subtle enough that it's worth detecting these cases in librados to avoid dereferencing invalid memory

apologies for the regression. if you find my argument convincing, i can address the error handling issue in aio_unwatch()

Sure, I don't see anything wrong with protecting against duplicate unwatch calls or similar mishaps. I just wanted to ensure that my understanding of the existing code was correct.

From the RBD POV, what we need to move past this is IoCtx::aio_watch() delivering this early ENOTCONN via the callback, just like its neorados counterpart. Is that what you have in mind?

From the RBD POV, what we need to move past this is IoCtx::aio_watch() delivering this early ENOTCONN via the callback, just like its neorados counterpart. Is that what you have in mind?

👍 on it

cbodley added 2 commits September 26, 2025 17:24

librados: linger callbacks hold a reference to LingerOp

2455a71

preserve a reference to LingerOp in case their invocation races with another linger_cancel() Signed-off-by: Casey Bodley <cbodley@redhat.com>

github-actions bot added the core label Sep 26, 2025

cbodley added the bug-fix label Sep 26, 2025

adamemerson self-assigned this Sep 26, 2025

cbodley marked this pull request as ready for review September 29, 2025 16:57

cbodley requested a review from a team as a code owner September 29, 2025 16:57

ivancich added the needs-review label Oct 2, 2025

cbodley requested review from adamemerson and idryomov October 2, 2025 14:19

ivancich approved these changes Oct 9, 2025

View reviewed changes

cbodley added needs-qa rgw labels Oct 23, 2025

SrinivasaBharath added the wip-bharath7-testing label Nov 4, 2025

anrao19 added wip-anrao3-testing wip-anrao1-testing and removed wip-anrao3-testing labels Nov 12, 2025

ljflores merged commit 0908b59 into ceph:main Nov 21, 2025
17 of 20 checks passed

anrao19 removed the wip-anrao1-testing label Dec 3, 2025

idryomov reviewed Dec 11, 2025

View reviewed changes

idryomov mentioned this pull request Dec 12, 2025

librados: aio_unwatch() delivers ENOTCONN to AioCompletion #66610

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

osdc: Objecter::linger_by_cookie() for safe cast from uint64#65698

osdc: Objecter::linger_by_cookie() for safe cast from uint64#65698
ljflores merged 2 commits intoceph:mainfrom
cbodley:wip-72771

cbodley commented Sep 26, 2025 •

edited

Loading

Uh oh!

phlogistonjohn commented Sep 29, 2025

Uh oh!

cbodley commented Oct 2, 2025

Uh oh!

JonBailey1993 commented Nov 19, 2025

Uh oh!

Uh oh!

idryomov Dec 11, 2025

Uh oh!

cbodley Dec 11, 2025

Uh oh!

idryomov Dec 11, 2025 •

edited

Loading

Uh oh!

cbodley Dec 11, 2025

Uh oh!

cbodley Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

cbodley commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phlogistonjohn commented Sep 29, 2025

Uh oh!

cbodley commented Oct 2, 2025

Uh oh!

JonBailey1993 commented Nov 19, 2025

Uh oh!

Uh oh!

idryomov Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

cbodley Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

idryomov Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cbodley Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

cbodley Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

cbodley commented Sep 26, 2025 •

edited

Loading

idryomov Dec 11, 2025 •

edited

Loading