Skip to content

Stats drop after hot restart #6924

@rgs1

Description

@rgs1

Since stats moved from shared memory to being copied over IPC (#5910), our metrics are seeing a transient drop during hot restarts. E.g.: for cluster.${cluster}.health_check.failure:

image

which isn't great because it triggers alerts, etc. I haven't looked into it too much yet, but one suspicion is that it's due to the overhead of copying things over....

Thoughts?

cc: @fredlas @mattklein123 @fishcakez

Metadata

Metadata

Assignees

Labels

bugno stalebotDisables stalebot from closing an issue

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions