We observed some cases (#50241 for example) where a data node responding slowly can cause accumulation of ResponseContexts for indices:monitor/recovery[n], indices:monitor/stats[n], cluster:monitor/stats[n] and cluster:monitor/xpack/ml/job/stats/get[n] which correspond to _xpack/usage and _nodes/stats calls.
We would like to improve robustness of stats and usage call in case of a slowly responding data nodes by
- introducing timeout on stats and usage APIs and/or
- making stats and usage APIs tasks cancellable and cancel them if the REST client disconnects
We observed some cases (#50241 for example) where a data node responding slowly can cause accumulation of ResponseContexts for
indices:monitor/recovery[n],indices:monitor/stats[n],cluster:monitor/stats[n]andcluster:monitor/xpack/ml/job/stats/get[n]which correspond to_xpack/usageand_nodes/statscalls.We would like to improve robustness of stats and usage call in case of a slowly responding data nodes by