Skip to content

quincy: osd/PeeringState: fix missed recheck_readable from laggy#47901

Merged
yuriw merged 2 commits intoceph:quincyfrom
amathuria:wip-56736-quincy
Sep 7, 2022
Merged

quincy: osd/PeeringState: fix missed recheck_readable from laggy#47901
yuriw merged 2 commits intoceph:quincyfrom
amathuria:wip-56736-quincy

Conversation

@amathuria
Copy link
Contributor

backport tracker: https://tracker.ceph.com/issues/56736


backport of #44499
parent tracker: https://tracker.ceph.com/issues/53806

this backport was staged using ceph-backport.sh version 16.0.0.6848
find the latest version at https://github.com/ceph/ceph/blob/main/src/script/ceph-backport.sh

We should not have duplicated OSD ID in `acting`. So the loop would
execute once anyway.

Signed-off-by: 胡玮文 <huww98@outlook.com>
(cherry picked from commit 9337fa6)
Previously, the first `pg_lease_ack_t` after becoming laggy would not
trigger `recheck_readable`. However, every other ack would trigger it.
The logic is inverted, causing unnecessarily long laggy PG state.

Fixes: 3bb8a72 (osd: requeue ops when PG is no longer laggy)
Fixes: https://tracker.ceph.com/issues/53806
Signed-off-by: 胡玮文 <huww98@outlook.com>
(cherry picked from commit caeca39)
@amathuria amathuria requested a review from a team as a code owner September 1, 2022 13:39
@amathuria amathuria added this to the quincy milestone Sep 1, 2022
@amathuria amathuria added the core label Sep 1, 2022
@rzarzynski
Copy link
Contributor

The following tests FAILED:
	 31 - run-rbd-unit-tests-1.sh (Timeout)

@rzarzynski
Copy link
Contributor

jenkins test make check

@NitzanMordhai
Copy link
Contributor

Unrelated failure:

  1. 7009609\7009612\7009615 \7009621\7009622 \7009623\7009624 \7009628\7009631\7009635\7009637 - Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F22.04%2Fx86_64&sha1=fa057dc6789b82a9e40ab5fe39d555445fdd68f1
  2. 7009610\7009625\ 7009633- The CustomResourceDefinition "installations.operator.tigera.io" is invalid: metadata.annotations: Too long: must have at most 262144 bytes
  3. 7009611\7009629 \ - expected valgrind issues and found none
  4. 7009619\7009627 \ - Failed command: apt-get update: E: The repository 'https://download.ceph.com/debian-octopus jammy Release' does not have a Release file
  5. 7009620 -Updating cephclusters/rook-ceph is forbidden
  6. 7009632 - cephadm/test_dashboard_e2e.sh: Expected to find content: '/^foo$/' within the selector: 'cd-modal .badge' but never did
  7. 7009634 repro_long_log.sh:51: test_log_size: jq info.stats.log_dups_size

Tracked by:

  1. https://tracker.ceph.com/issues/57303
  2. https://tracker.ceph.com/issues/57368
  3. https://tracker.ceph.com/issues/57346
  4. https://tracker.ceph.com/issues/57297
  5. https://tracker.ceph.com/issues/56951
  6. https://tracker.ceph.com/issues/57386
  7. https://tracker.ceph.com/issues/57164

@yuriw yuriw merged commit 2a5682f into ceph:quincy Sep 7, 2022
@ljflores ljflores mentioned this pull request Sep 14, 2022
14 tasks
ljflores added a commit that referenced this pull request Sep 29, 2022
This PR was reverted.

Signed-off-by: Laura Flores <lflores@redhat.com>
rzarzynski added a commit to rzarzynski/ceph that referenced this pull request Dec 21, 2022
This reverts commit 9337fa6.

It's a `main`'s counterpart of quincy's PRs ceph#47901 and ceph#49304.

Fixes: https://tracker.ceph.com/issues/57546
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants