Project

General

Profile

Actions

Bug #66123

closed

Quiesce timeout due to exporting

Added by Leonid Usov almost 2 years ago. Updated 5 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Correctness/Safety
Target version:
% Done:

0%

Source:
Backport:
squid
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
quiesce
Labels (FS):
task(medium)
Pull request ID:
Tags (freeform):
Fixed In:
v19.3.0-2435-g25e4ee2fa7
Released In:
v20.2.0~2814
Upkeep Timestamp:
2025-11-01T01:11:32+00:00

Description

Failure: https://pulpito.ceph.com/leonidus-2024-05-19_09:55:47-fs-wip-lusov-quiesce-distro-default-smithi/7713434/

Outstanding OPs at rank 0 (b) - hundreds of them. All quiesce_inode ops have the flag point

      "flag_point": "failed to authpin, subtree is being exported",

There is no trace of an ongoing export operation on either of the mds-es

Could this be just a very long export? However, it's more likely that there's some kind of deadlock with the quiescing, as the quiesce has completed on all other ranks


Related issues 1 (0 open1 closed)

Copied to CephFS - Backport #66259: squid: Quiesce timeout due to exportingResolvedLeonid UsovActions
Actions #1

Updated by Venky Shankar almost 2 years ago

  • Category set to Correctness/Safety
  • Status changed from New to Triaged
  • Assignee set to Leonid Usov
  • Target version set to v20.0.0
  • Labels (FS) task(medium) added
Actions #2

Updated by Leonid Usov almost 2 years ago

After analyzing this with @Patrick Donnelly we've concluded that the issue here is due to the remote authpining done by the quiesce on replicas. This prevents the migrator from reaching the tree frozen, while the quiesce may be stalled on some of the replicas due to freezing.

Actions #3

Updated by Leonid Usov almost 2 years ago

  • Status changed from Triaged to Fix Under Review
  • Pull request ID set to 57579
Actions #4

Updated by Leonid Usov almost 2 years ago

  • Backport set to squid
Actions #5

Updated by Patrick Donnelly almost 2 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #6

Updated by Upkeep Bot almost 2 years ago

Actions #8

Updated by Leonid Usov almost 2 years ago

  • Status changed from Pending Backport to Resolved
Actions #9

Updated by Upkeep Bot 9 months ago

  • Merge Commit set to 25e4ee2fa7e8913bac09af9e43706ddeba1cd14a
  • Fixed In set to v19.3.0-2435-g25e4ee2fa7e
  • Upkeep Timestamp set to 2025-06-27T03:18:44+00:00
Actions #10

Updated by Upkeep Bot 8 months ago

  • Fixed In changed from v19.3.0-2435-g25e4ee2fa7e to v19.3.0-2435-g25e4ee2fa7
  • Upkeep Timestamp changed from 2025-06-27T03:18:44+00:00 to 2025-07-14T16:44:58+00:00
Actions #11

Updated by Upkeep Bot 5 months ago

  • Released In set to v20.2.0~2814
  • Upkeep Timestamp changed from 2025-07-14T16:44:58+00:00 to 2025-11-01T01:11:32+00:00
Actions

Also available in: Atom PDF