Skip to content

quincy: osd/PeeringState: fix missed recheck_readable from laggy#49304

Merged
yuriw merged 1 commit intoceph:quincyfrom
rzarzynski:wip-56736-quincy-v2
Jun 5, 2023
Merged

quincy: osd/PeeringState: fix missed recheck_readable from laggy#49304
yuriw merged 1 commit intoceph:quincyfrom
rzarzynski:wip-56736-quincy-v2

Conversation

@rzarzynski
Copy link
Contributor

Previously, the first pg_lease_ack_t after becoming laggy would not trigger recheck_readable. However, every other ack would trigger it. The logic is inverted, causing unnecessarily long laggy PG state.

Fixes: 3bb8a72 (osd: requeue ops when PG is no longer laggy) Fixes: https://tracker.ceph.com/issues/53806
Signed-off-by: 胡玮文 huww98@outlook.com
(cherry picked from commit caeca39)


This PR is minimized version of #47901 which got reverted in #48104. It looks the failures were caused by the first, not essential commit (see https://tracker.ceph.com/issues/57546#note-9), so let's try the essential one alone.

Contribution Guidelines

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard cephadm
  • jenkins test api
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox
  • jenkins test windows

Previously, the first `pg_lease_ack_t` after becoming laggy would not
trigger `recheck_readable`. However, every other ack would trigger it.
The logic is inverted, causing unnecessarily long laggy PG state.

Fixes: 3bb8a72 (osd: requeue ops when PG is no longer laggy)
Fixes: https://tracker.ceph.com/issues/53806
Signed-off-by: 胡玮文 <huww98@outlook.com>
(cherry picked from commit caeca39)
@rzarzynski rzarzynski requested a review from a team as a code owner December 7, 2022 14:59
@github-actions github-actions bot added the core label Dec 7, 2022
@github-actions github-actions bot added this to the quincy milestone Dec 7, 2022
@neha-ojha neha-ojha requested a review from amathuria December 7, 2022 15:11
@rzarzynski rzarzynski changed the title osd/PeeringState: fix missed recheck_readable from laggy quincy: osd/PeeringState: fix missed recheck_readable from laggy Dec 7, 2022
rzarzynski added a commit to rzarzynski/ceph that referenced this pull request Dec 21, 2022
This reverts commit 9337fa6.

It's a `main`'s counterpart of quincy's PRs ceph#47901 and ceph#49304.

Fixes: https://tracker.ceph.com/issues/57546
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
rzarzynski added a commit to rzarzynski/ceph that referenced this pull request Dec 21, 2022
This reverts commit 9337fa6.

It's a `main`'s counterpart of quincy's PRs ceph#48104 and ceph#49304.

Fixes: https://tracker.ceph.com/issues/57546
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
@ljflores
Copy link
Member

@rzarzynski does this need qa?

@rzarzynski
Copy link
Contributor Author

@ljflores: I think so. Just added the label.

@ljflores
Copy link
Member

ljflores commented Jun 1, 2023

@yuriw yuriw merged commit 0cea26b into ceph:quincy Jun 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants