fix: set mtu as workaround to fix network timeouts in DinD test#23757
Conversation
remove empty line fix comment
|
/lgtm |
|
@lizhuqi: changing LGTM is restricted to collaborators DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/lgtm |
|
/lgtm |
|
Depending on what image is actually used in your jobs, you will need to:
|
|
Basically, I'm not opposed, but I'd like to understand more about the scope here:
|
|
For the Secrets Store CSI Driver:
All the jobs in this config using docker-in-docker are failing. This was blocking our release last week.
All the jobs are using
We merged PR to add the iptables rules in all the jobs after some validation in our project. The tests have been passing after the PR merge. Testgrid dashboard: https://testgrid.k8s.io/sig-auth-secrets-store-csi-driver All the failures are on 09/22, 09/23 and mitigated on 09/24 with the fix. Example failure: https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/directory/pull-secrets-store-csi-driver-image-scan/1441120709228105728 |
|
/cc |
|
The windows capz job is failing https://testgrid.k8s.io/sig-windows-master-release#capz-windows-dockershim-master, which uses DinD and kubekins. |
|
Examples of jobs that were fixed with the workaround: all the jobs changed in #23744, such as https://testgrid.k8s.io/sig-cluster-lifecycle-cluster-api-provider-azure#capz-periodic-conformance-v1alpha4-main. |
|
AWS docker in docker jobs that are failing: |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: andyzhangx, spiffxp The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
#23784 - autobump PR that auto-merged did this
#23795 - I just approved this autobump PR
I'm going to assume folks who have commented here can handle this part? |
|
We just got our first conformance test pass since 9/22 so looks like it's working 🎉 https://testgrid.k8s.io/provider-azure-master-signal#capz-conformance |
|
I'm planning on gradually rolling this back using a mechanism that will allow us to do so with fewer PRs if we need to enable it again in the future, ref: #23741 (comment) |
What type of PR is this?
/kind bug
What this PR does / why we need it:
fix: set mtu as workaround to fix network timeouts in DinD test, find detailed discussion here: https://kubernetes.slack.com/archives/C09QZ4DQB/p1632710433059900?thread_ts=1632414457.031600&cid=C09QZ4DQB
Which issue(s) this PR fixes:
Fixes #23741