Skip to content

Auto-MNNVL: add MNNVL configuration and startup validation#346

Merged
shmuel-runai merged 4 commits into
ai-dynamo:mainfrom
shmuel-runai:RUN-35353/main-mnnvl-opcfg
Jan 22, 2026
Merged

Auto-MNNVL: add MNNVL configuration and startup validation#346
shmuel-runai merged 4 commits into
ai-dynamo:mainfrom
shmuel-runai:RUN-35353/main-mnnvl-opcfg

Conversation

@shmuel-runai

Copy link
Copy Markdown
Contributor

/kind feature

Ref: GREP-270

Add MNNVL configuration support to the Grove operator. This includes configuration, startup validation, and Helm chart integration.

Changes:

  • Add MNNVLConfiguration struct with Enabled field (default: false) and to the OperatorConfiguration
  • Add validateMNNVLPrerequisites() to check for ComputeDomain CRD at startup
  • Update Helm chart values.yaml and _helpers.tpl with mnnvl.enabled
  • Add unit tests for config parsing and CRD detection logic

When MNNVL is enabled, but the ComputeDomain CRD is not installed, the operator exits with a non-zero exit code.

Tested

  • UT

Ref: GREP-270

Add MNNVL configuration support to the Grove operator.
This includes configuration, startup validation, and Helm chart integration.

Changes:
- Add MNNVLConfiguration struct with Enabled field (default: false) and to the OperatorConfiguration
- Add validateMNNVLPrerequisites() to check for ComputeDomain CRD at startup
- Update Helm chart values.yaml and _helpers.tpl with mnnvl.enabled
- Add unit tests for config parsing and CRD detection logic

When MNNVL is enabled but the ComputeDomain CRD is not installed,
the operator exits with a non-zero exit code.

Tested
- UT
Comment thread operator/cmd/main.go Outdated
Comment thread operator/api/common/constants/constants.go Outdated
shayasoolin
shayasoolin previously approved these changes Jan 19, 2026

@shayasoolin shayasoolin left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, I'd consider just one thing:
Instead of "MNNVL MNNVLConfiguration" residing right inside OperatorConfiguration, maybe we should have a sub section called "NetworkAcceleration" with MNNVL as one of its field. In the end MNNVL is NVIDIA's accelerator and we better leave room for other technologies without having to break the API or having too many fields in the config struct's root level.

Comment thread operator/api/config/v1alpha1/types.go Outdated
Comment thread operator/cmd/cli/cli_test.go
gflarity
gflarity previously approved these changes Jan 19, 2026

@gflarity gflarity left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left a couple nits, but no reason to block. 👍

@sanjaychatterjee

Copy link
Copy Markdown
Collaborator

@unmarshall can you please review this as well.

@nvrohanv

Copy link
Copy Markdown
Contributor

LGTM, I'd consider just one thing: Instead of "MNNVL MNNVLConfiguration" residing right inside OperatorConfiguration, maybe we should have a sub section called "NetworkAcceleration" with MNNVL as one of its field. In the end MNNVL is NVIDIA's accelerator and we better leave room for other technologies without having to break the API or having too many fields in the config struct's root level.

I think @shayasoolin makes a great point, I would say we should shift to this kind of approach before we merge.

@shmuel-runai shmuel-runai force-pushed the RUN-35353/main-mnnvl-opcfg branch from 1d022e1 to 449c4fb Compare January 21, 2026 13:10

@gflarity gflarity left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PTAL at the mismatch, otherwise LGTM.

Comment thread operator/charts/templates/_helpers.tpl Outdated
Comment thread operator/cmd/cli/testdata/valid-config.yaml Outdated
danbar2
danbar2 previously approved these changes Jan 22, 2026

@danbar2 danbar2 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, please fix the missmatches @gflarity pointed out

Comment thread operator/internal/mnnvl/preflight.go
Comment thread operator/internal/mnnvl/preflight.go
Comment thread operator/internal/mnnvl/preflight.go
Ronkahn21
Ronkahn21 previously approved these changes Jan 22, 2026
Comment thread operator/api/config/v1alpha1/types.go Outdated
@shmuel-runai shmuel-runai merged commit 0411a1e into ai-dynamo:main Jan 22, 2026
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants