Skip to content

Split e2e cluster creation#371

Merged
danbar2 merged 59 commits into
ai-dynamo:mainfrom
danbar2:split-e2e-cluster-creation
Feb 9, 2026
Merged

Split e2e cluster creation#371
danbar2 merged 59 commits into
ai-dynamo:mainfrom
danbar2:split-e2e-cluster-creation

Conversation

@danbar2

@danbar2 danbar2 commented Jan 27, 2026

Copy link
Copy Markdown
Contributor

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a API change?


Additional documentation e.g., enhancement proposals, usage docs, etc.:


@danbar2 danbar2 force-pushed the split-e2e-cluster-creation branch 6 times, most recently from a64386a to 966c360 Compare February 2, 2026 12:14
@danbar2 danbar2 marked this pull request as ready for review February 2, 2026 13:32

@shayasoolin shayasoolin left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great

Comment thread operator/hack/create-e2e-cluster.sh Outdated
Comment thread operator/hack/create-e2e-cluster.sh Outdated
Comment thread .github/workflows/build-check-test.yaml Outdated
@copy-pr-bot

copy-pr-bot Bot commented Feb 3, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@danbar2 danbar2 force-pushed the split-e2e-cluster-creation branch from a6f37e3 to 21e2734 Compare February 3, 2026 06:48
shayasoolin
shayasoolin previously approved these changes Feb 3, 2026
Comment thread operator/hack/create-e2e-cluster.sh Outdated
@danbar2 danbar2 force-pushed the split-e2e-cluster-creation branch from 603a8bf to ac0110f Compare February 4, 2026 07:52

@gflarity gflarity left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review of the Python cluster creation script. Nice work overall - a few suggestions below!

@gflarity gflarity left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for switching to python. I found a couple issues but otherwise it's looking good.

Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/e2e/tests/cert_management_test.go
Comment thread operator/hack/create-e2e-cluster.py Outdated
@danbar2 danbar2 marked this pull request as draft February 5, 2026 05:53
@danbar2 danbar2 marked this pull request as ready for review February 5, 2026 13:55
shayasoolin
shayasoolin previously approved these changes Feb 5, 2026
Comment thread operator/hack/create-e2e-cluster.py Outdated
gflarity
gflarity previously approved these changes Feb 6, 2026

@gflarity gflarity left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left one nit. Looks like there's a conflicted rebase required, so I'd suggest fixing the nit as I believe a re-approval will be required anyways. But if you can merge it go for it, it's not a big deal.

@danbar2 danbar2 dismissed stale reviews from gflarity and shayasoolin via 1ae50a1 February 8, 2026 06:48
…actual booleans

Typer passes environment variables as strings, so 'False' was being
treated as truthy. Now properly convert string booleans to bool type.
Commit 34d129e added topology tests that use busybox images, but
commit 122d0bc deleted operator/e2e/dependencies.yaml which contained
the busybox reference. This caused topology tests to fail because
busybox wasn't being pre-pulled into the local registry.

The topology test YAMLs reference registry:5001/busybox:latest, which
requires busybox to be in the local k3d registry. Pre-pulling is not
optional - without it, test pods fail with ImagePullBackOff.

Added busybox back to operator/hack/dependencies.yaml and updated
create-e2e-cluster.py to pre-pull it into the local registry.
@danbar2 danbar2 force-pushed the split-e2e-cluster-creation branch from c8d07af to 330573e Compare February 9, 2026 06:34
shayasoolin
shayasoolin previously approved these changes Feb 9, 2026
Comment thread operator/hack/create-e2e-cluster.py Outdated
Comment thread operator/hack/e2e-cluster/create-e2e-cluster.py
Comment thread operator/e2e/setup/k8s_clusters.go Outdated
Comment thread operator/e2e/setup/k8s_clusters.go Outdated
Comment thread operator/e2e/setup/shared_cluster.go Outdated
Comment thread operator/Makefile Outdated

@shmuel-runai shmuel-runai left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great update, very needed
LGTM

@danbar2 danbar2 merged commit 91283f2 into ai-dynamo:main Feb 9, 2026
17 of 18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants