Add distributed.metrics.monotonic#6181
Conversation
1ff857e to
fd31724
Compare
|
I think that maybe distributed.metrics.time would work on in that case? Maybe we can just use that. |
I'm worried about bad OS use cases. Specifically, the NTP sync could, for whatever reason, suddenly bring back the wall clock several seconds, instead of a few tens of milliseconds like it happens on a healthy box. If that ever happens, the spill system will send the worker in a complete apnoea until it's completely done, thus triggering timeouts. An even more delicate use case is #6189. There, if the clock is sent back a few seconds during a particularly painful spill it may give enough time for the worker to rise in memory usage from 70% to 95% without having a chance to pause. |
To be used in #6174