Skip to content

Auto-MNNVL: Add autoMNNVL e2e tests and cluster setup scripts#421

Merged
shmuel-runai merged 9 commits into
ai-dynamo:mainfrom
shmuel-runai:RUN-36134/mnnvl-e2e-2
Feb 24, 2026
Merged

Auto-MNNVL: Add autoMNNVL e2e tests and cluster setup scripts#421
shmuel-runai merged 9 commits into
ai-dynamo:mainfrom
shmuel-runai:RUN-36134/mnnvl-e2e-2

Conversation

@shmuel-runai

@shmuel-runai shmuel-runai commented Feb 10, 2026

Copy link
Copy Markdown
Contributor

What type of PR is this?

/kind feature

What this PR does / why we need it:

Auto-MNNVL: Add autoMNNVL e2e tests and cluster setup scripts

Introduce end-to-end tests for the autoMNNVL feature, covering all 4
configurations:
- supported + enabled
- supported + disabled
- unsupported + enabled
- unsupported + disabled.

Each test suite validates operator behavior (annotation mutation, ComputeDomain lifecycle, resourceClaim injection,
annotation immutability) under its specific cluster configuration.

Add Python and shell scripts under hack/e2e-autoMNNVL/ to automate
k3d cluster creation, fake GPU operator installation, Grove operator
deployment, and test execution across all configurations.

Auto-MNNVL: Add CI workflow and Makefile targets for autoMNNVL e2e tests

Add a dedicated e2e-mnnvl job to the GitHub Actions workflow that runs
all 4 autoMNNVL configurations (supported/unsupported x enabled/disabled)
in CI, parallel to the existing e2e matrix jobs. The job uses the same
trigger conditions and self-hosted runner as the other e2e tests.

Add Makefile targets (run-e2e-mnnvl-full, e2e-mnnvl-cluster-down) to
invoke the autoMNNVL test orchestration scripts from hack/e2e-autoMNNVL/.

Which issue(s) this PR fixes:

refs #270

Special notes for your reviewer:

Does this PR introduce a API change?

NONE

Introduce end-to-end tests for the autoMNNVL feature, covering all 4
configurations:
- supported + enabled
- supported + disabled
- unsupported + enabled
- unsupported + disabled.

Each test suite validates operator behavior (annotation mutation, ComputeDomain lifecycle, resourceClaim injection,
annotation immutability) under its specific cluster configuration.

Add Python and shell scripts under hack/e2e-autoMNNVL/ to automate
k3d cluster creation, fake GPU operator installation, Grove operator
deployment, and test execution across all configurations.
Add a dedicated e2e-mnnvl job to the GitHub Actions workflow that runs
all 4 autoMNNVL configurations (supported/unsupported x enabled/disabled)
in CI, parallel to the existing e2e matrix jobs. The job uses the same
trigger conditions and self-hosted runner as the other e2e tests.

Add Makefile targets (run-e2e-mnnvl-full, e2e-mnnvl-cluster-down) to
invoke the autoMNNVL test orchestration scripts from hack/e2e-autoMNNVL/.
@copy-pr-bot

copy-pr-bot Bot commented Feb 10, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@shmuel-runai shmuel-runai self-assigned this Feb 10, 2026
Comment thread operator/hack/e2e-autoMNNVL/README.md Outdated
Comment thread operator/hack/e2e-autoMNNVL/run_autoMNNVL_e2e.py Outdated
Comment thread operator/hack/e2e-autoMNNVL/setup_autoMNNVL_cluster.py Outdated
Comment thread operator/hack/e2e-autoMNNVL/setup_autoMNNVL_cluster.py Outdated
Comment thread operator/e2e/tests/auto-mnnvl/shared_cases.go Outdated
@shmuel-runai shmuel-runai force-pushed the RUN-36134/mnnvl-e2e-2 branch 3 times, most recently from 30b9e3b to f7bf382 Compare February 15, 2026 10:54
Comment thread operator/e2e/tests/auto-mnnvl/testutils.go
Comment thread operator/e2e/tests/auto-mnnvl/shared_cases.go
Comment thread operator/hack/e2e-cluster/config-cluster.py
shayasoolin
shayasoolin previously approved these changes Feb 19, 2026

@gflarity gflarity left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall looks good. A few comments, ping me for a quick re-review.

Comment thread operator/hack/e2e-cluster/config-cluster.py
Comment thread operator/e2e/tests/auto-mnnvl/shared_cases.go Outdated
Comment thread operator/Makefile
Comment thread .github/actions/e2e-setup/action.yaml
Comment thread operator/e2e/tests/auto-mnnvl/supported_and_enabled_test.go
Comment thread operator/e2e/tests/auto-mnnvl/testutils.go Outdated
Comment thread operator/e2e/tests/auto-mnnvl/testutils.go Outdated
Comment thread operator/e2e/tests/auto-mnnvl/supported_and_enabled_test.go Outdated
Comment thread operator/e2e/utils/grove_resources.go
Comment thread operator/hack/e2e-autoMNNVL/README.md
Comment thread operator/hack/e2e-cluster/config-cluster.py Outdated
@shmuel-runai shmuel-runai merged commit 5c076fc into ai-dynamo:main Feb 24, 2026
20 of 21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants