Skip to content

Graduate PodLifecycleSleepAction to GA#128046

Merged
k8s-ci-robot merged 1 commit into
kubernetes:masterfrom
AxeZhan:ga3960
Nov 4, 2024
Merged

Graduate PodLifecycleSleepAction to GA#128046
k8s-ci-robot merged 1 commit into
kubernetes:masterfrom
AxeZhan:ga3960

Conversation

@AxeZhan

@AxeZhan AxeZhan commented Oct 14, 2024

Copy link
Copy Markdown
Member

What type of PR is this?

/kind feature

What this PR does / why we need it:

Graduate PodLifecycleSleepAction to GA

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?

PodLifecycleSleepAction is graduated to GA

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

- [KEP]: https://github.com/kubernetes/enhancements/issues/3960

@k8s-ci-robot k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. kind/feature Categorizes issue or PR as related to a new feature. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. needs-priority Indicates a PR lacks a `priority/foo` label and requires one. labels Oct 14, 2024
@k8s-ci-robot k8s-ci-robot added area/kubelet area/test sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. and removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Oct 14, 2024
Comment thread test/e2e/common/node/lifecycle_hook.go Outdated
Comment thread test/e2e/common/node/lifecycle_hook.go

@SergeyKanzhelev SergeyKanzhelev left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just a test classification suggestion, please fix

@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Oct 14, 2024
@AxeZhan AxeZhan force-pushed the ga3960 branch 2 times, most recently from bf786f4 to ba66acb Compare October 14, 2024 08:06
@k8s-ci-robot k8s-ci-robot added area/conformance Issues or PRs related to kubernetes conformance tests sig/architecture Categorizes an issue or PR as relevant to SIG Architecture. labels Oct 14, 2024
@AxeZhan AxeZhan force-pushed the ga3960 branch 3 times, most recently from 55be0cc to bbf3b3a Compare October 14, 2024 09:13
@bart0sh

bart0sh commented Oct 14, 2024

Copy link
Copy Markdown
Contributor

/triage accepted

@k8s-ci-robot k8s-ci-robot added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Oct 14, 2024
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 1, 2024
@k8s-ci-robot k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 1, 2024
@k8s-ci-robot k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 1, 2024

@thockin thockin left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

API LGTM - notes on tests.

Comment thread test/e2e/common/node/lifecycle_hook.go Outdated
Comment thread test/e2e/common/node/lifecycle_hook.go Outdated
// longer than 5 seconds (pod should sleep for 5 seconds)
// shorter than gracePeriodSeconds (default 30 seconds here)
if !validDuration(cost, 5, 30) {
// shorter than gracePeriodSeconds (60 seconds here)

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why change to 60?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same reason why change to 30 in another case: short duration led to flake tests in this submit.
Let me changed back to 30 and 15 to check the flake again.

Comment thread test/e2e/common/node/lifecycle_hook.go
Comment thread test/e2e/common/node/lifecycle_hook.go Outdated
// longer than 2 seconds (we change gracePeriodSeconds to 2 seconds here, and it's less than sleep action)
// shorter than sleep action (to make sure it doesn't take effect)
if !validDuration(cost, 2, 15) {
if !validDuration(cost, 5, 30) {

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think 15 makes more sense here - 30 is close enough to the original sleep that a little bit of nondeterminism could flake this test. 15 is 3x the expected value (~5) and 1/2 the requested value (30). If we hit 15 something went wrong, no?

@thockin

thockin commented Nov 4, 2024

Copy link
Copy Markdown
Member

Thanks!

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 4, 2024
@k8s-ci-robot

Copy link
Copy Markdown
Contributor

LGTM label has been added.

DetailsGit tree hash: 83f435f7069eb197628bf2f0d0835c29432a2dfb

@k8s-ci-robot

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: AxeZhan, SergeyKanzhelev, thockin

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 4, 2024
@k8s-ci-robot k8s-ci-robot merged commit 9fe41b6 into kubernetes:master Nov 4, 2024
@k8s-ci-robot k8s-ci-robot added this to the v1.32 milestone Nov 4, 2024
@tengqm

tengqm commented Nov 5, 2024

Copy link
Copy Markdown
Contributor

Are we updating the website for this feature graduation?

@pacoxu

pacoxu commented Nov 5, 2024

Copy link
Copy Markdown
Member

Are we updating the website for this feature graduation?

kubernetes/website#48295 is a placeholder.

lifecycle := &v1.Lifecycle{
PreStop: &v1.LifecycleHandler{
Sleep: &v1.SleepAction{Seconds: 15},
Sleep: &v1.SleepAction{Seconds: 30},

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why did we change the test values during the graduation? we should avoid this things, or we modify the test first and wait some time to check its stability or we graduate and we modify the values later,

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

From what I can tell, this test was not run with periodics. I don't see a dedicated job for this feature. It was added to pull-kubernetes-e2e-gce-cos-alpha-features.

I think maybe we should pull back this PR from promotion and make sure there is a periodic that verifies that this test is working.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the flake😢.

From what I can tell, this test was not run with periodics. I don't see a dedicated job for this feature. It was added to pull-kubernetes-e2e-gce-cos-alpha-features.

Yes, I thought pull-kubernetes-e2e-gce-cos-alpha-features will run periodicity before, seems it will only run with submit.

why did we change the test values during the graduation?

The current test parameters showed flakiness with this PR submission (it didn't occur in pull-kubernetes-e2e-gce-cos-alpha-features, which I suspect is related to the cluster used in CI runs).

I guess we'll need to modify the tests first, and we may graduate this feature to GA in next release.
I opened #128642 for testing.
But I'm not sure how to make these tests run periodicity, can I just remove the feature in SIGDescribe ? Or should I somehow add a job in test-infra?

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no worries, this things happen to all of us ...

if the feature is beta enabled by default you can just remove the features tag

@liggitt

liggitt commented Nov 6, 2024

Copy link
Copy Markdown
Member

@thockin

thockin commented Nov 6, 2024

Copy link
Copy Markdown
Member

The PR originally changed some of the timeouts, I asked to revert those parts. Mea culpa - this flaking indicates that our ability to kill a pod "on time" could be off by more than 10 seconds.

@AxeZhan do you want to bump those again (sorry!!)?

@SergeyKanzhelev

Copy link
Copy Markdown
Member

@AxeZhan do you want to bump those again (sorry!!)?

Revert: #128619

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. area/conformance Issues or PRs related to kubernetes conformance tests area/kubelet area/test cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/feature Categorizes issue or PR as related to a new feature. lgtm "Looks good to me", indicates that a PR is ready to be merged. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. release-note Denotes a PR that will be considered when it comes time to generate release notes. sig/architecture Categorizes an issue or PR as relevant to SIG Architecture. sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. triage/accepted Indicates an issue or PR is ready to be actively worked on.

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.