Skip to content

test: Adjust e2e test for Docker 29.2.0#1615

Merged
elezar merged 1 commit intoNVIDIA:mainfrom
elezar:fix-e2e-for-docker-29.2.0
Jan 28, 2026
Merged

test: Adjust e2e test for Docker 29.2.0#1615
elezar merged 1 commit intoNVIDIA:mainfrom
elezar:fix-e2e-for-docker-29.2.0

Conversation

@elezar
Copy link
Member

@elezar elezar commented Jan 27, 2026

As of Docker 29.2.0, the --gpus=all flag is converted to a --device=nvidia.com/gpu=all request. This means that tests that expect DIFFERENT behaviour when using the --gpus=all flag and other flags are not applicable.

This change updates the firmware injection test for the new behaviour.

@coveralls
Copy link

coveralls commented Jan 27, 2026

Pull Request Test Coverage Report for Build 21436883306

Details

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage remained the same at 39.478%

Totals Coverage Status
Change from base Build 21413198353: 0.0%
Covered Lines: 5703
Relevant Lines: 14446

💛 - Coveralls

@elezar
Copy link
Member Author

elezar commented Jan 27, 2026

/cherry-pick release-1.18

Copy link
Collaborator

@jgehrcke jgehrcke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeeeees 🚀


It("should not fail when using the nvidia-container-runtime", func(ctx context.Context) {
_, _, err := runner.Run("docker run --rm --runtime=nvidia -e NVIDIA_VISIBLE_DEVICES=all -e NVIDIA_DRIVER_CAPABILITIES=all firmware-test")
output, _, err := runner.Run("docker run --rm --runtime=nvidia -e NVIDIA_VISIBLE_DEVICES=all -e NVIDIA_DRIVER_CAPABILITIES=all firmware-test")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I read correctly, this might be silently dropping error detail that may matter at some point to someone :-) We drop stderr of the child process here, right?

"Silently dropping or even changing error detail that may matter at some point to someone" is I think chapter one, sin 1 in my imaginary book about error handling :)

But you probably know better here!

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, we do drop stderr here. I understand that we should be more dilegent about checking these, but that's a bigger change than I want o make in this PR. I would like to revisit our testing strategy and organisation for this repo in general in the near future.

(Let me at least update this PR to explicitly check stderr).

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For the time being I updated the tests to consisder the docker version on the host and check the applicable conditions based on that.

@elezar elezar force-pushed the fix-e2e-for-docker-29.2.0 branch 3 times, most recently from 9631b9d to feb19b6 Compare January 28, 2026 10:53
Signed-off-by: Evan Lezar <elezar@nvidia.com>
@elezar elezar force-pushed the fix-e2e-for-docker-29.2.0 branch from feb19b6 to dd10e43 Compare January 28, 2026 11:43
@elezar elezar merged commit a462a19 into NVIDIA:main Jan 28, 2026
16 checks passed
@github-actions
Copy link

🤖 Backport PR created for release-1.18: #1617

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants