Project

General

Profile

Actions

Bug #69012

open

Commands using Mgr Modules fail if run immediately post a mgr failover/ restart

Added by Laura Flores over 1 year ago. Updated about 1 year ago.

Status:
Fix Under Review
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Backport:
squid
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Tags (freeform):
Merge Commit:
Fixed In:
Released In:
Upkeep Timestamp:

Description

Bug Description

Occasionally, after a ceph-mgr restart, some mgr modules may become temporarily unavailable, causing their commands to fail. For example:

# ceph mgr fail; ceph orch host ls
Error ENOTSUP: Warning: due to ceph-mgr restart, some PG states may not be up to date
Module 'orchestrator' is not enabled/loaded (required by command 'orch host ls'): use `ceph mgr module enable orchestrator` to enable it

Who is affected?

Users who run commands dependent on ceph-mgr modules may experience this after running `ceph mgr fail`. Users may also experience this bug when a mgr daemon restarts during an upgrade.

Workaround

Turning off the balancer module reduces the time it takes to load the rest of the mgr modules. Users experiencing the error above may disable the balancer module temporarily with `ceph balancer off` and rerun the commands.

Related issues

Tracker ticket #68657 addresses an obvious bottleneck in the balancer module that exacerbated the above issue. With this fix, the frequency of the error was reduced.

Tracker ticket #68648 addresses the error code that gets returned and modifies it to return a more appropriate EAGAIN message to convey that the command may be retried.

Related BZ: https://bugzilla.redhat.com/show_bug.cgi?id=2314146


Related issues 2 (0 open2 closed)

Related to mgr - Bug #68657: squid: mgr/balancer preventing orchestrator and dashboard functionalityResolvedLaura Flores

Actions
Has duplicate mgr - Bug #68648: Commands using Mgr Modules fail with ENOTSUP if run immediately post a mgr failover/ restartDuplicateBrad Hubbard

Actions
Actions #1

Updated by Laura Flores over 1 year ago

  • Related to Bug #68657: squid: mgr/balancer preventing orchestrator and dashboard functionality added
  • Related to Bug #68648: Commands using Mgr Modules fail with ENOTSUP if run immediately post a mgr failover/ restart added
Actions #2

Updated by Laura Flores over 1 year ago

  • Status changed from New to In Progress
  • Assignee set to Laura Flores
  • Backport set to squid
Actions #3

Updated by Laura Flores about 1 year ago

  • Status changed from In Progress to Fix Under Review
  • Pull request ID set to 61325
Actions #4

Updated by Laura Flores about 1 year ago

  • Related to deleted (Bug #68648: Commands using Mgr Modules fail with ENOTSUP if run immediately post a mgr failover/ restart)
Actions #5

Updated by Laura Flores about 1 year ago

  • Has duplicate Bug #68648: Commands using Mgr Modules fail with ENOTSUP if run immediately post a mgr failover/ restart added
Actions

Also available in: Atom PDF