raftstore: enhance the detection to cover I/O jitters on kvdb.#18439
raftstore: enhance the detection to cover I/O jitters on kvdb.#18439ti-chi-bot[bot] merged 6 commits intotikv:masterfrom
Conversation
Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>
|
Skipping CI for Draft Pull Request. |
Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>
Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>
Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>
|
/retest |
hbisheng
left a comment
There was a problem hiding this comment.
LGTM. Looks like the PR is mainly about tuning the params:
- inspect_kvdb_interval: 2s -> 100ms
- ratio_thresh:60% -> 10%
These changes will significantly increase the sensitivity of the KV IO jitter detection. This makes sense if we want to catch jitter as short as 100ms, though the trade-off is a higher risk of false positives.
Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>
Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: hbisheng, overvenus The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
In response to a cherrypick label: new pull request created to branch |
|
In response to a cherrypick label: new pull request created to branch |
close tikv#18463 Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>
|
In response to a cherrypick label: new pull request created to branch |
…18439) close tikv#18463 Enhances the detection mechanism to cover the I/O jitters on kvdb disk if deploys with separated mount paths. Signed-off-by: lucasliang <nkcs_lykx@hotmail.com> Signed-off-by: okjiang <819421878@qq.com>
…18439) (tikv#18723) close tikv#18463 Enhances the detection mechanism to cover the I/O jitters on kvdb disk if deploys with separated mount paths. Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io> Signed-off-by: lucasliang <nkcs_lykx@hotmail.com> Co-authored-by: lucasliang <nkcs_lykx@hotmail.com>
…18439) (tikv#18723) close tikv#18463 Enhances the detection mechanism to cover the I/O jitters on kvdb disk if deploys with separated mount paths. Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io> Signed-off-by: lucasliang <nkcs_lykx@hotmail.com> Co-authored-by: lucasliang <nkcs_lykx@hotmail.com> Signed-off-by: okjiang <819421878@qq.com>
What is changed and how it works?
Issue Number: Close #18463
What's Changed:
In previous work #17801, TiKV has introduced the detection mechanism for kvdb disk to detect I/O hang issues.
However, recent customer feedback highlighted the need to extend detection coverage to I/O jitters, ensuring TiKV can automatically recover from abnormal states caused by KVDB I/O jitters.
Therefore, this ticket is built to tracks the development efforts to enhance TiKV’s I/O jitter detection and recovery mechanism. And the majority parts of this change are listed as followings show:
raftstore.inspect_kvdb_interval: 2s -> 100msSlowScore::ratio_thresh:60% -> 10%Related changes
pingcap/docs/pingcap/docs-cn:Check List
Tests
Side effects
Release note