Project

General

Profile

Actions

Bug #65660

closed

mds: drop client metrics during recovery

Added by Patrick Donnelly almost 2 years ago. Updated 4 months ago.

Status:
Resolved
Priority:
High
Category:
Correctness/Safety
Target version:
% Done:

0%

Source:
Development
Backport:
squid,reef,quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:
Tags (freeform):
backport_processed
Fixed In:
v19.3.0-2917-g426beef37e
Released In:
v20.2.0~2644
Upkeep Timestamp:
2025-11-20T13:09:19+00:00

Description

When the rank is coming up, there's little reason to record historical metrics from the clients. We've also seen floods of these metrics messages slow down up:rejoin significantly.


Related issues 3 (0 open3 closed)

Copied to CephFS - Backport #66620: squid: mds: drop client metrics during recoveryResolvedPatrick DonnellyActions
Copied to CephFS - Backport #66621: quincy: mds: drop client metrics during recoveryRejectedPatrick DonnellyActions
Copied to CephFS - Backport #66622: reef: mds: drop client metrics during recoveryResolvedPatrick DonnellyActions
Actions #1

Updated by Xiubo Li almost 2 years ago

Is this new in the upstream master ? As I remembered we have improved this and the clients will only send the metrics when the MDS is ready, which is in active state.

Actions #2

Updated by Christopher Hoffman almost 2 years ago

there's little reason to record historical metrics from the clients

Can you expand on this? Are we losing anything by dropping them?

Actions #3

Updated by Patrick Donnelly almost 2 years ago

Xiubo Li wrote in #note-1:

Is this new in the upstream master ? As I remembered we have improved this and the clients will only send the metrics when the MDS is ready, which is in active state.

That may be but the MDS should be resilient to older clients.

Actions #4

Updated by Patrick Donnelly almost 2 years ago

Christopher Hoffman wrote in #note-2:

there's little reason to record historical metrics from the clients

Can you expand on this? Are we losing anything by dropping them?

The metrics are there to provide a real-time view of performance for clients. Jos or Venky may correct me if I'm wrong but I don't think there is any use for past metrics.

Actions #5

Updated by Venky Shankar almost 2 years ago

Patrick Donnelly wrote in #note-4:

Christopher Hoffman wrote in #note-2:

there's little reason to record historical metrics from the clients

Can you expand on this? Are we losing anything by dropping them?

The metrics are there to provide a real-time view of performance for clients. Jos or Venky may correct me if I'm wrong but I don't think there is any use for past metrics.

Unless the metrics are persisted by the exporter daemon, which is currently in the works. So, we do not loose anything by dropping them.

Actions #6

Updated by Dhairya Parmar almost 2 years ago

Venky Shankar wrote in #note-5:

Patrick Donnelly wrote in #note-4:

Christopher Hoffman wrote in #note-2:

there's little reason to record historical metrics from the clients

Can you expand on this? Are we losing anything by dropping them?

The metrics are there to provide a real-time view of performance for clients. Jos or Venky may correct me if I'm wrong but I don't think there is any use for past metrics.

Unless the metrics are persisted by the exporter daemon, which is currently in the works. So, we do not loose anything by dropping them.

So you mean It'd be fine to have past metrics once they are being persisted by the exporter daemon? So once the exporter daemon is operational, we'd need to have this code adjusted again, right?

Actions #7

Updated by Patrick Donnelly over 1 year ago

  • Category set to Correctness/Safety
  • Status changed from In Progress to Pending Backport
  • Backport changed from squid,reef to squid,reef,quincy
Actions #8

Updated by Upkeep Bot over 1 year ago

  • Copied to Backport #66620: squid: mds: drop client metrics during recovery added
Actions #9

Updated by Upkeep Bot over 1 year ago

  • Copied to Backport #66621: quincy: mds: drop client metrics during recovery added
Actions #10

Updated by Upkeep Bot over 1 year ago

  • Copied to Backport #66622: reef: mds: drop client metrics during recovery added
Actions #11

Updated by Upkeep Bot over 1 year ago

  • Tags (freeform) set to backport_processed
Actions #12

Updated by Venky Shankar over 1 year ago

  • Pull request ID set to 57084
Actions #13

Updated by Upkeep Bot 8 months ago

  • Merge Commit set to 426beef37ec414b84001d601ca9cb2febb628f1f
  • Fixed In set to v19.3.0-2917-g426beef37ec
  • Upkeep Timestamp set to 2025-07-09T13:46:26+00:00
Actions #14

Updated by Upkeep Bot 8 months ago

  • Fixed In changed from v19.3.0-2917-g426beef37ec to v19.3.0-2917-g426beef37e
  • Upkeep Timestamp changed from 2025-07-09T13:46:26+00:00 to 2025-07-14T17:11:31+00:00
Actions #15

Updated by Upkeep Bot 5 months ago

  • Released In set to v20.2.0~2644
  • Upkeep Timestamp changed from 2025-07-14T17:11:31+00:00 to 2025-11-01T00:57:42+00:00
Actions #16

Updated by Upkeep Bot 4 months ago

  • Status changed from Pending Backport to Resolved
  • Upkeep Timestamp changed from 2025-11-01T00:57:42+00:00 to 2025-11-20T13:09:19+00:00
Actions

Also available in: Atom PDF