Skip to content

[Flacky pipeline]: Flaky test from submit-upgrade-test-cloud-build #4235

@lacroixthomas

Description

@lacroixthomas

What happened:
The step Step 25: submit-upgrade-test-cloud-build is flaky and works one every x times

What you expected to happen:
It should pass all the time (if no issue on the test itself)

How to reproduce it (as minimally and precisely as possible):
Run a pipeline

This is the logs we got:

Job completed successfully on cluster associated with log: /tmp/tmp.qakD2oDHTr/gke-autopilot-upgrade-test-cluster-1-31.log
Reading output from log file: /tmp/tmp.qakD2oDHTr/gke-autopilot-upgrade-test-cluster-1-33.log:
{
  "lastProbeTime": "2025-07-31T00:09:43Z",
  "lastTransitionTime": "2025-07-31T00:09:43Z",
  "message": "Job has reached the specified backoff limit",
  "reason": "BackoffLimitExceeded",
  "status": "True",
  "type": "FailureTarget"
}
Unexpected job status: 'FailureTarget' with message: 'Job has reached the specified backoff limit' in log /tmp/tmp.qakD2oDHTr/gke-autopilot-upgrade-test-cluster-1-33.log
ERROR: Upgrade test failed on cluster: gke-autopilot-upgrade-test-cluster-1-33

Seems that the helm uninstall of agones is failing because there is still gameservers running

Anything else we need to know?:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions