Skip to content

revert changes in sleep action test#128619

Closed
SergeyKanzhelev wants to merge 1 commit into
kubernetes:masterfrom
SergeyKanzhelev:revertSleepActionTest
Closed

revert changes in sleep action test#128619
SergeyKanzhelev wants to merge 1 commit into
kubernetes:masterfrom
SergeyKanzhelev:revertSleepActionTest

Conversation

@SergeyKanzhelev

Copy link
Copy Markdown
Member

What type of PR is this?

/kind failing-test
/sig node

What this PR does / why we need it:

See #128613 (comment)

NONE

@k8s-ci-robot k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. sig/node Categorizes an issue or PR as relevant to SIG Node. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Nov 6, 2024
@k8s-ci-robot

Copy link
Copy Markdown
Contributor

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@k8s-ci-robot k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. needs-priority Indicates a PR lacks a `priority/foo` label and requires one. labels Nov 6, 2024
@k8s-ci-robot k8s-ci-robot added area/test sig/testing Categorizes an issue or PR as relevant to SIG Testing. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Nov 6, 2024
@SergeyKanzhelev

Copy link
Copy Markdown
Member Author

/assign @AxeZhan

@kannon92

kannon92 commented Nov 6, 2024

Copy link
Copy Markdown
Contributor

Should we just revert the entire PR?

@kannon92

kannon92 commented Nov 6, 2024

Copy link
Copy Markdown
Contributor

Ideally it would be better to revert the entire test including its movement to node conformance.

@SergeyKanzhelev

SergeyKanzhelev commented Nov 6, 2024

Copy link
Copy Markdown
Member Author

Ideally it would be better to revert the entire test including its movement to node conformance.

I remember looking at test history when reviewed the PR and it was stable green. This shows it was reasonably stable for a long time: https://storage.googleapis.com/k8s-triage/index.html?date=2024-11-06&pr=1&text=unexpected%20delay%20duration%20before%20killing%20the%20pod I don't remember seeing these 3 flakes it shows currently from before the promotion to conformance. Need to look deeper into those

@kannon92

kannon92 commented Nov 6, 2024

Copy link
Copy Markdown
Contributor

Looking at that I don't know if we had this test running as a periodic..

@liggitt

liggitt commented Nov 6, 2024

Copy link
Copy Markdown
Member

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 6, 2024
@k8s-ci-robot

Copy link
Copy Markdown
Contributor

LGTM label has been added.

DetailsGit tree hash: 16debf58cc5b2650f455f9c97039d1f8cb246acf

@k8s-ci-robot

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: liggitt, SergeyKanzhelev

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@aojea

aojea commented Nov 6, 2024

Copy link
Copy Markdown
Member

E2eNode Suite: [It] [sig-storage] HostPath should support r/w [NodeConformance] expand_more

/test pull-kubernetes-node-e2e-containerd

@SergeyKanzhelev

Copy link
Copy Markdown
Member Author

/retest

Seems the be unrelated issue

@kannon92

kannon92 commented Nov 6, 2024

Copy link
Copy Markdown
Contributor

#128619 (comment)

I don't see any periodics that are actually running this test before the merge of the PR.

Looking at what you linked @SergeyKanzhelev I don't think this test was ever running as a periodic. There was one PR that it ran on a few weeks ago but I don't see actual test runs for this feature.

It looks like when we promoted this test to node conformance we started running it and it shows flakes. I worry that we will still have these flakes even if we lower the sleep value.

@liggitt

liggitt commented Nov 6, 2024

Copy link
Copy Markdown
Member

should we revert the promotion to NodeConformance entirely then if we can't find anywhere it was running flake-free, rather than try to deflake in master?

@thockin

thockin commented Nov 6, 2024

Copy link
Copy Markdown
Member

Isn't this making the timing tighter? Maybe I misunderstood the flake?

@liggitt

liggitt commented Nov 6, 2024

Copy link
Copy Markdown
Member

/hold

I didn't review the PR closely, I just assumed it was reverting to a more stable version of the test

#128619 (comment) indicates maybe there never was a stable version of the test

Would be good to figure out what we're doing in a single rollback or edit asap

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 6, 2024
@aojea

aojea commented Nov 6, 2024

Copy link
Copy Markdown
Member

Would be good to figure out what we're doing in a single rollback or edit asap

I don't feel comfortable promoting the feature to GA in a pod lifecycle if we are not able to say if is being tested
My suggestion

  1. rollback the entire PR with the feature promotion Graduate PodLifecycleSleepAction to GA #128046 (postpone it for 1.33)
  2. figure out if the test are running or not and provide proof of stability
  3. promote to GA in 1.33 with guarantees

@kannon92

kannon92 commented Nov 6, 2024

Copy link
Copy Markdown
Contributor

https://github.com/kubernetes/kubernetes/pull/128046/files#r1831730521

The only job that runs this test is pull-kubernetes-e2e-gce-cos-alpha-features.

I agree with @aojea. I will put up a revert and we will require a periodic so we can monitor stability of this test before promoting to GA.

@kannon92

kannon92 commented Nov 6, 2024

Copy link
Copy Markdown
Contributor

#128627 is up.

@SergeyKanzhelev

Copy link
Copy Markdown
Member Author

/close

@k8s-ci-robot

Copy link
Copy Markdown
Contributor

@SergeyKanzhelev: Closed this PR.

Details

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. area/test cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. lgtm "Looks good to me", indicates that a PR is ready to be merged. needs-priority Indicates a PR lacks a `priority/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. release-note-none Denotes a PR that doesn't merit a release note. sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

7 participants