https://docs.google.com/document/d/1NqsIgizseyMxUBimpE10m6sSnbebQ25VyJRUwxdmJyM/edit
Is your feature request related to a problem? Please describe.
The problem to solve it provide operators & others better signals of the healthiness of kvclient & below.
Operators & others can alert on these signals. CRL can use them to think about how "big" outages are & to understand where in CRDB stack (is KV broken or not?) & keyspace the impact is.
Describe the solution you'd like
Implement a prober that sends queries to KV in order to generate signal on the healthiness of KV client & below. For a POC that does reads only, see #58124.
Describe alternatives you've considered
More discussion in design doc linked above about alternatives.
By 21.2:
Later:
Epic CC-5241
https://docs.google.com/document/d/1NqsIgizseyMxUBimpE10m6sSnbebQ25VyJRUwxdmJyM/edit
Is your feature request related to a problem? Please describe.
The problem to solve it provide operators & others better signals of the healthiness of kvclient & below.
Operators & others can alert on these signals. CRL can use them to think about how "big" outages are & to understand where in CRDB stack (is KV broken or not?) & keyspace the impact is.
Describe the solution you'd like
Implement a prober that sends queries to KV in order to generate signal on the healthiness of KV client & below. For a POC that does reads only, see #58124.
Describe alternatives you've considered
More discussion in design doc linked above about alternatives.
By 21.2:
Getsto the start key of a randomly selected ranges in loop: kvprober: introduce a package for probing the KV layer that does reads only #58124timeutil.NewTimerper comments from Tobias here: kvprober: rate limit the planner #61275Later:
kvproberpkg/util/log/eventpbEpic CC-5241