Skip to content

[DNM]quincy:mon/OSDMonitor: Added extra check before mon.go_recovery_stretch_mode()#48802

Closed
kamoltat wants to merge 3 commits intoceph:quincyfrom
kamoltat:wip-ksirivad-backport-quincy-47340
Closed

[DNM]quincy:mon/OSDMonitor: Added extra check before mon.go_recovery_stretch_mode()#48802
kamoltat wants to merge 3 commits intoceph:quincyfrom
kamoltat:wip-ksirivad-backport-quincy-47340

Conversation

@kamoltat
Copy link
Member

@kamoltat kamoltat commented Nov 8, 2022

Problem:
There are certain scenarios in degraded
stretched cluster where will try to
go into the
function Monitor::go_recovery_stretch_mode()
that will lead to a ceph_assert.

Solution:
Make sure dead_mon_buckets.size() == 0
in OSDMonitor:update_from_paxos()
before going into Monitor::go_recovery_stretch_mode().

Fixes:
https://tracker.ceph.com/issues/57017

Backporting relevant commits from main PR:

#47340

Signed-off-by: Kamoltat ksirivad@redhat.com

Contribution Guidelines

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard cephadm
  • jenkins test api
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox
  • jenkins test windows

@kamoltat kamoltat added this to the quincy milestone Nov 8, 2022
@kamoltat kamoltat requested a review from a team as a code owner November 8, 2022 20:58
@kamoltat kamoltat self-assigned this Nov 8, 2022
@kamoltat kamoltat changed the title quincy:mon/OSDMonitor: Added extra check before mon.go_recovery_stretch_mode() [DNM]quincy:mon/OSDMonitor: Added extra check before mon.go_recovery_stretch_mode() Dec 9, 2022
@kamoltat
Copy link
Member Author

kamoltat commented Dec 9, 2022

#47340
Introduced: https://tracker.ceph.com/issues/58239
so don't backport just yet!

Added bug reproducer for
https://bugzilla.redhat.com/show_bug.cgi?id=2104207

Added more logs in MON.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit 62fe3cb)
Problem:
There are certain scenarios in degraded
stretched cluster where will try to
go into the
function ``Monitor::go_recovery_stretch_mode()``
that will lead to a `ceph_assert`.

Solution:
Make sure ``dead_mon_buckets.size() == 0``
in ``OSDMonitor:update_from_paxos()``
before going into ``Monitor::go_recovery_stretch_mode()``.

Fixes:
https://bugzilla.redhat.com/show_bug.cgi?id=2104207

Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit d95c41a)
Signed-off-by: Kamoltat <ksirivad@redhat.com>
@kamoltat kamoltat force-pushed the wip-ksirivad-backport-quincy-47340 branch from 3392f14 to 02b0579 Compare February 8, 2023 19:40
@kamoltat
Copy link
Member Author

PR moved to #51413.

Closing the PR since it is not relevant anymore.

@kamoltat kamoltat closed this May 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants