Skip to content

mds: use SimpleLock::WAIT_ALL for wait mask#67496

Merged
vshankar merged 1 commit intoceph:mainfrom
batrick:i75143
Mar 17, 2026
Merged

mds: use SimpleLock::WAIT_ALL for wait mask#67496
vshankar merged 1 commit intoceph:mainfrom
batrick:i75143

Conversation

@batrick
Copy link
Member

@batrick batrick commented Feb 24, 2026

The Locker uses has_any_waiter for a particular lock to evaluate whether to nudge the log. For the squid, tentacle, and main branches, this larger bit mask (all 64 bits) will cause this to wrongly return true for other locks which have waiters. The side-effect of waking requests spuriously is undesirable but should not affect performance significantly.

For reef and older releases, using std::numeric_limits<uint64_t>::max() in has_any_waiter() causes a bitwise overflow that sets the wait-queue search bound impossibly high, resulting in the method always incorrectly returning false. This results in nudge_log never nudging the log!

Fixes: db5c9dc
Fixes: https://tracker.ceph.com/issues/75141

Contribution Guidelines

  • To sign and title your commits, please refer to Submitting Patches to Ceph.

  • If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.

  • When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands

You must only issue one Jenkins command per-comment. Jenkins does not understand
comments with more than one command.

@github-actions github-actions bot added the cephfs Ceph File System label Feb 24, 2026
@batrick
Copy link
Member Author

batrick commented Feb 24, 2026

The reef fix is tracked in #67495

@batrick batrick requested a review from a team February 24, 2026 20:31
@batrick
Copy link
Member Author

batrick commented Feb 25, 2026

The following tests FAILED:
	228 - unittest_peeringstate (Failed)

@batrick
Copy link
Member Author

batrick commented Feb 25, 2026

jenkins test make check

@batrick
Copy link
Member Author

batrick commented Feb 25, 2026

jenkins test make check arm64

The Locker uses has_any_waiter for a particular lock to evaluate whether
to nudge the log. For the squid, tentacle, and main branches, this
larger bit mask (all 64 bits) will cause this to wrongly return true for
other locks which have waiters. The side-effect of waking requests
spuriously is undesirable but should not affect performance
significantly.

For reef and older releases, using std::numeric_limits<uint64_t>::max()
in has_any_waiter() causes a bitwise overflow that sets the wait-queue
search bound impossibly high, resulting in the method always incorrectly
returning false. This results in nudge_log never nudging the log!

Fixes: db5c9dc
Fixes: https://tracker.ceph.com/issues/75141
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Copy link
Contributor

@vshankar vshankar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fantastic find!

@vshankar
Copy link
Contributor

vshankar commented Mar 4, 2026

This PR is under test in https://tracker.ceph.com/issues/75334.

vshankar added a commit to vshankar/ceph that referenced this pull request Mar 16, 2026
* refs/pull/67496/head:

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Copy link
Contributor

@vshankar vshankar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vshankar vshankar merged commit 6896328 into ceph:main Mar 17, 2026
13 checks passed
@github-actions
Copy link

This is an automated message by src/script/redmine-upkeep.py.

I have resolved the following tracker ticket due to the merge of this PR:

No backports are pending for the ticket. If this is incorrect, please update the tracker
ticket and reset to Pending Backport state.

Update Log: https://github.com/ceph/ceph/actions/runs/23188475930

@github-actions
Copy link

This is an automated message by src/script/redmine-upkeep.py.

I found one or more Fixes: tags in the commit messages in

git log 6896328a86a99d04bd4d16a45146bb3da2bec883^..6896328a86a99d04bd4d16a45146bb3da2bec883

The referenced tickets are:

Those tickets do not reference this merged Pull Request. If this Pull Request merge resolves any of those tickets, please update the "Pull Request ID" field on each ticket. A future run of this script will appropriately update them.

Update Log: https://github.com/ceph/ceph/actions/runs/23188475930

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants