Skip to content

Handle missing upgrade-test-runner pod to avoid log collection errors#4224

Merged
igooch merged 12 commits intoagones-dev:mainfrom
0xaravindh:logs-ci
Aug 15, 2025
Merged

Handle missing upgrade-test-runner pod to avoid log collection errors#4224
igooch merged 12 commits intoagones-dev:mainfrom
0xaravindh:logs-ci

Conversation

@0xaravindh
Copy link
Copy Markdown

What type of PR is this?

Uncomment only one /kind <> line, press enter to put that in a new line, and remove leading whitespace from that line:

/kind breaking
/kind bug

/kind cleanup

/kind documentation
/kind feature
/kind hotfix
/kind release

What this PR does / Why we need it:

Which issue(s) this PR fixes:

Closes #

Special notes for your reviewer:

Avoids errors when no pod is found for the job. Adds a check before running 'kubectl logs' to prevent running it with an empty pod name
@github-actions github-actions bot added kind/cleanup Refactoring code, fixing up documentation, etc size/S labels Jul 23, 2025
@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 207e43f9-5b7d-4a10-a6ea-b1934df227d2

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Succeeded 🥳

Build Id: 758d4b29-b376-4fa1-8472-a9d6794b85ec

The following development artifacts have been built, and will exist for the next 30 days:

A preview of the website (the last 30 builds are retained):

To install this version:

git fetch https://github.com/googleforgames/agones.git pull/4224/head:pr_4224 && git checkout pr_4224
helm install agones ./install/helm/agones --namespace agones-system --set agones.image.registry=us-docker.pkg.dev/agones-images/ci --set agones.image.tag=1.51.0-dev-e771a50

@0xaravindh 0xaravindh requested review from gongmax and igooch July 23, 2025 14:34
@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 18a243e3-68f8-4586-9820-71c445cf56c1

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@0xaravindh
Copy link
Copy Markdown
Author

/gcbrun

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: d62b41aa-f2ab-4f91-87f0-58fc9b82b931

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@lacroixthomas
Copy link
Copy Markdown
Collaborator

From the error logs of the failed build, I can see that it the logs looks correct: {"lastProbeTime":"2025-07-25T06:23:12Z","lastTransitionTime":"2025-07-25T06:23:12Z","message":"Reached expected number of succeeded pods","reason":"CompletionsReached","status":"True","type":"SuccessCriteriaMet"}{"lastProbeTime":"2025-07-25T06:24:47Z","lastTransitionTime":"2025-07-25T06:24:47Z","message":"Reached expected number of succeeded pods","reason":"CompletionsReached","status":"True","type":"SuccessCriteriaMet"}{"lastProbeTime":"2025-07-25T06:27:03Z","lastTransitionTime":"2025-07-25T06:27:03Z","message":"Reached expected number of succeeded pods","reason":"CompletionsReached","status":"True","type":"SuccessCriteriaMet"}{"lastProbeTime":"2025-07-25T06:28:44Z","lastTransitionTime":"2025-07-25T06:28:44Z","message":"Reached expected number of succeeded pods","reason":"CompletionsReached","status":"True","type":"SuccessCriteriaMet"}

But I'm still pretty sure it's using the wrong kube context. It's the call of the print_failure_logs from this line: https://github.com/googleforgames/agones/blob/main/build/e2e_upgrade_test.sh#L191

        outputLog="${waitPids[$pid]}"
        echo "One of the upgrade tests pid $pid from cluster log $outputLog exited with a non-zero status ${status}."
        print_failure_logs "$(basename "$outputLog" .log)"

The function print_failure_logs do some calls with kubectl, but it doesn't have the right kube context, it only have the last kube context set (which currently seems to have valid logs / no errors). The last kube context set from this loop: https://github.com/googleforgames/agones/blob/main/build/e2e_upgrade_test.sh#L70-L84

It's maybe worth trying to run gcloud container clusters get-credentials "$testCluster" --region="$testClusterLocation" --project="$PROJECT_ID" before calling this print_failure_logs ? Or setting it in the function ?

What do you think about it ?

@0xaravindh
Copy link
Copy Markdown
Author

  • Cloud Build view

@lacroixthomas this issue is not related to the upgrade test. We're encountering problems in the e2e test on gke-autopilot-e2e-test-cluster-1-33 due to node unavailability. . link

@lacroixthomas
Copy link
Copy Markdown
Collaborator

  • Cloud Build view

@lacroixthomas this issue is not related to the upgrade test. We're encountering problems in the e2e test on gke-autopilot-e2e-test-cluster-1-33 due to node unavailability. . link

Oh ok, but I was expecting to see some error logs regarding it though, maybe I was not looking at the right logs, will double check that

I don't have access to this link, seems to be internal to Google, it's asking SS0 from google

@0xaravindh
Copy link
Copy Markdown
Author

/gcbrun

1 similar comment
@0xaravindh
Copy link
Copy Markdown
Author

/gcbrun

@0xaravindh
Copy link
Copy Markdown
Author

/gcbrun

1 similar comment
@0xaravindh
Copy link
Copy Markdown
Author

/gcbrun

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: fb60720a-60a5-46f4-9c82-d4a9f179fe10

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 453e1389-ddff-4065-b723-e2f9ed2bb51c

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 4e22ef2b-ec81-4dc1-ad1b-fd64b0740b02

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 502f4f56-d54c-49fb-9d60-7a7de015598a

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: a055be56-6a12-487f-af4e-1ac1da618fbd

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 23bb5739-3b26-4ce7-8b98-a99c92102552

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Succeeded 🥳

Build Id: 2f62dda8-8475-4424-8e10-9d63e5d47a32

The following development artifacts have been built, and will exist for the next 30 days:

A preview of the website (the last 30 builds are retained):

To install this version:

git fetch https://github.com/googleforgames/agones.git pull/4224/head:pr_4224 && git checkout pr_4224
helm install agones ./install/helm/agones --namespace agones-system --set agones.image.registry=us-docker.pkg.dev/agones-images/ci --set agones.image.tag=1.52.0-dev-940962d

@gongmax gongmax requested a review from igooch August 6, 2025 16:03
@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 65666d55-bc2a-4afd-b71e-446d82a1038f

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@0xaravindh
Copy link
Copy Markdown
Author

Looks like the VirtualBox website (https://www.virtualbox.org/) is currently down. I’ll wait for a couple of hours and try again.

Copy link
Copy Markdown
Collaborator

@igooch igooch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's two more calls to print_failure_logs "$(basename "$outputLog" .log)" that will need the correct cluster region passed through. Or alternatively you could make an associative array that maps cluster name -> cluster location, and just pass in the cluster name to the print_failure_logs() function, and query the associative array for the region.

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Succeeded 🥳

Build Id: 793cba50-b810-41b5-b1fe-c40d863ea711

The following development artifacts have been built, and will exist for the next 30 days:

A preview of the website (the last 30 builds are retained):

To install this version:

git fetch https://github.com/googleforgames/agones.git pull/4224/head:pr_4224 && git checkout pr_4224
helm install agones ./install/helm/agones --namespace agones-system --set agones.image.registry=us-docker.pkg.dev/agones-images/ci --set agones.image.tag=1.52.0-dev-d860f15

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: 65b7f682-adc6-496c-ad16-7e56fb533b73

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@0xaravindh
Copy link
Copy Markdown
Author

/gcbrun

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: cb706c18-f5ec-48fc-b654-44ad4db2c81b

Status: TIMEOUT

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@igooch igooch enabled auto-merge (squash) August 14, 2025 16:21
@igooch
Copy link
Copy Markdown
Collaborator

igooch commented Aug 14, 2025

/gcbrun

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Failed 😭

Build Id: cdda7cc9-bad4-43e9-90f3-8d236f0a3ff5

Status: FAILURE

To get permission to view the Cloud Build view, join the agones-discuss Google Group.

@lacroixthomas
Copy link
Copy Markdown
Collaborator

/gcbrun

@agones-bot
Copy link
Copy Markdown
Collaborator

Build Succeeded 🥳

Build Id: 77e500ac-0cde-4ee5-a857-c20b79afd9bf

The following development artifacts have been built, and will exist for the next 30 days:

A preview of the website (the last 30 builds are retained):

To install this version:

git fetch https://github.com/googleforgames/agones.git pull/4224/head:pr_4224 && git checkout pr_4224
helm install agones ./install/helm/agones --namespace agones-system --set agones.image.registry=us-docker.pkg.dev/agones-images/ci --set agones.image.tag=1.52.0-dev-5634b55

@igooch igooch merged commit f0c08d3 into agones-dev:main Aug 15, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

kind/cleanup Refactoring code, fixing up documentation, etc size/S

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants