kubelet: add failure threshold info to probe failure events#135879
kubelet: add failure threshold info to probe failure events#135879sonikaarora wants to merge 1 commit intokubernetes:masterfrom
Conversation
|
|
|
Welcome @sonikaarora! |
|
Hi @sonikaarora. Thanks for your PR. I'm waiting for a github.com member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
This issue is currently awaiting triage. If a SIG or subproject determines this is a relevant issue, they will accept it by applying the The DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: sonikaarora The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
3de4109 to
cc05909
Compare
|
/sig node |
|
/retitle kubelet: add failure threshold info to probe failure events |
|
@sonikaarora: Re-titling can only be requested by trusted users, like repository collaborators. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
a1cff97 to
a761084
Compare
When a probe fails, the event message now includes the failure count and threshold (e.g., 'Liveness probe failed (1/3, will be ignored): ...'). This helps users understand whether a failure will trigger action or be ignored due to FailureThreshold not being reached yet.
a761084 to
1c7426d
Compare
|
/cc @Random-Liu |
|
cc @ardaguclu @Random-Liu Could you please help me to review this pr |
|
@ardaguclu Could you please help me with review for this pr? |
|
Hi @ardaguclu, Hope you're doing well! I wanted to follow up on this PR that addresses #115823. This is a focused change that enhances probe failure events with threshold information, helping users understand whether failures will trigger action. The approach is minimal (modifies existing event messages only) and maintains full backward compatibility. I understand you're busy, but I'd really appreciate your feedback when you have a chance. Happy to make any adjustments based on your review! Thank you! 🙏 |
What this PR does / why we need it:
This PR addresses issue #115823 by enhancing existing probe failure events with
threshold context. When a probe fails, the event message now includes:
Example event messages:
"Liveness probe failed (1/3, will be ignored): ..."← No action taken yet"Liveness probe failed (3/3): ..."← Action will be taken (container restart)Why this matters: Before this change, users consuming probe failure events had no
way to know if a failure would result in action or be ignored due to FailureThreshold.
This could lead to confusion about whether containers were actually being restarted.
Approach: This PR modifies the existing event message format rather than creating
new event types, making it a minimal, backward-compatible change that directly addresses
the issue's request to "give an indication in container events."
Fixes #115823
What type of PR is this?
/kind bug
/kind cleanup
What this PR does / why we need it:
Which issue(s) this PR is related to:
Fixes #115823
Special notes for your reviewer:
Does this PR introduce a user-facing change?
Yes - probe failure events now include threshold information to indicate if the failure is being ignored.
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.: