Project

General

Profile

Actions

Bug #54435

closed

bucket index completions may not retry after reshard

Added by Casey Bodley about 4 years ago. Updated 8 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
-
% Done:

100%

Source:
Development
Backport:
octopus pacific quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Tags (freeform):
Fixed In:
v17.0.0-11009-ge11e62e5e57
Released In:
v18.2.0~2593
Upkeep Timestamp:
2025-07-14T06:47:58+00:00

Description

the RGWIndexCompletionThread stores a list of index completion ops that failed with ERR_BUSY_RESHARDING, for synchronous retry under guard_reshard() in RGWIndexCompletionThread::process()

however, after process() finishes it's work, it goes to sleep on its condition variable without checking for more completions. so it's easy for RGWIndexCompletionThread to miss signals on this condition variable, and not wake up even though there's still work to do. once completions are stalled this way, nothing will wake up the thread again until new completions race with another reshard


Related issues 3 (0 open3 closed)

Copied to rgw - Backport #54496: quincy: bucket index completions may not retry after reshardResolvedCory SnyderActions
Copied to rgw - Backport #54497: pacific: bucket index completions may not retry after reshardResolvedCory SnyderActions
Copied to rgw - Backport #54498: octopus: bucket index completions may not retry after reshardRejectedActions
Actions #1

Updated by Casey Bodley about 4 years ago

  • Status changed from New to Fix Under Review
  • Pull request ID set to 45212
Actions #2

Updated by Casey Bodley about 4 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #3

Updated by Upkeep Bot about 4 years ago

  • Copied to Backport #54496: quincy: bucket index completions may not retry after reshard added
Actions #4

Updated by Upkeep Bot about 4 years ago

  • Copied to Backport #54497: pacific: bucket index completions may not retry after reshard added
Actions #5

Updated by Upkeep Bot about 4 years ago

  • Copied to Backport #54498: octopus: bucket index completions may not retry after reshard added
Actions #7

Updated by Konstantin Shalygin over 2 years ago

  • Status changed from Pending Backport to Resolved
  • Assignee set to Casey Bodley
  • % Done changed from 0 to 100
  • Source set to Development
Actions #8

Updated by Upkeep Bot 8 months ago

  • Merge Commit set to e11e62e5e57dfb6b907654477cc52f9357612cb1
  • Fixed In set to v17.0.0-11009-ge11e62e5e57
  • Released In set to v18.2.0~2593
  • Upkeep Timestamp set to 2025-07-14T06:47:58+00:00
Actions

Also available in: Atom PDF