Bump up timeout for pilot e2e tests#3653
Conversation
Tests on prow seem to be timing out with 20m. Bumping up to 30m.
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: ldemailly The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these OWNERS Files:
You can indicate your approval by writing |
|
@ldemailly it looks like prow is running really slow for some reason. Also #3516 (merged on 2/16) fixed the fact that we were not previously running the auth tests. Now that we're running the full suite of pilot tests, its clear that the auth tests are taking a ridiculously long time on prow. When I run them locally on a GKE cluster, all of the tests take ~5min (with the same parameters as prow). |
|
/test istio-presubmit |
|
/test all [submit-queue is verifying that this PR is safe to merge] |
|
can you raise this with @sebastienvas @chxchx cc @hklai |
|
Automatic merge from submit-queue. |
|
I've just managed to replicate the slowness of the pilot e2e tests from my machine on a gke cluster. The problem appears to be the fact that our test cluster has only 1 node. Previously, I was using a cluster with 4 nodes (nodeType=n1-standard-4) and the test completed in ~5 min. With an identical cluster running a single node, the same tests took 1 h 12 min and failed several tests. As a short-term solution we could we bump up the cluster size for these tests to something reasonable (e.g. 4). Going forward, we might consider using a large shared cluster, separating the test resources by namespace. WDYT? |
|
I've created a short-term fix in #3663 |
|
@nmittler: The following test failed, say
DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
thanks for the investigation. in the past we've had the issue where some tasks were run on prow itself instead of being scheduled on the target cluster, is this another example of that ? |
|
@ldemailly I think so ... the logs showed that the cluster had a single node. |
|
@ldemailly I've added details in #3663 (comment) |
|
ty! |
Tests on prow seem to be timing out with 20m. Bumping up to 30m.