Switch container build to subprocess for Sagemaker by BenWilson2 · Pull Request #19277 · mlflow/mlflow

BenWilson2 · 2025-12-08T18:53:53Z

🛠 DevTools 🛠

Install mlflow from this PR

# mlflow
pip install git+https://github.com/mlflow/mlflow.git@refs/pull/19277/merge
# mlflow-skinny
pip install git+https://github.com/mlflow/mlflow.git@refs/pull/19277/merge#subdirectory=libs/skinny

For Databricks, use the following command:

%sh curl -LsSf https://raw.githubusercontent.com/mlflow/mlflow/HEAD/dev/install-skinny.sh | sh -s pull/19277/merge

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

Change the container build process for EKS to use subprocess execution to eliminate an attack vector. This is how other similar operations are already done in MLflow.

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Should this PR be included in the next patch release?

Yes should be selected for bug fixes, documentation updates, and other small changes. No should be selected for new features and larger changes. If you're unsure about the release classification of this PR, leave this unchecked to let the maintainers decide.

What is a minor/patch release?

Minor release: a release that increments the second part of the version number (e.g., 1.2.0 -> 1.3.0).
Bug fixes, doc updates and new features usually go into minor releases.
Patch release: a release that increments the third part of the version number (e.g., 1.2.0 -> 1.2.1).
Bug fixes and doc updates usually go into patch releases.

Yes (this PR will be cherry-picked and included in the next patch release)
No (this PR will be included in the next minor release)

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

Copilot

Pull request overview

This PR enhances security by replacing shell command execution (os.system()) with direct subprocess calls in the SageMaker container build and deployment process. This eliminates potential shell injection attack vectors.

Key Changes:

Replaced shell command concatenation and os.system() with individual subprocess.run() calls in push_image_to_ecr()
Changed from importing Popen directly to using subprocess.Popen for consistency
Removed platform-specific command separator logic (no longer needed with subprocess)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

mlflow/sagemaker/__init__.py

github-actions · 2025-12-08T19:04:32Z

Documentation preview for d2abc1e is available at:

https://pr-19277--mlflow-docs-preview.netlify.app/docs/latest/

More info

Ignore this comment if this PR does not change the documentation.
The preview is updated when a new commit is pushed to this PR.
This comment was created by this workflow run.
The documentation was built by this workflow run.

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

harupy · 2025-12-09T02:29:20Z

mlflow/sagemaker/__init__.py

+        subprocess.run(
+            ["docker", "login", "--username", "AWS", "--password-stdin", registry],
+            input=aws_result.stdout,
+            capture_output=True,


do we need to capture_output? The original code doesn't.

The only one we need it for is the first one :) The others are not really super critical and it's best that both stdout and stderr just stream naturally. I'll remove those other ones!

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

harupy

LGTM!

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-12T11:30:53Z

mlflow/sagemaker/__init__.py

+            capture_output=True,
+            check=True,
+        )
+        subprocess.run(
+            ["docker", "login", "--username", "AWS", "--password-stdin", registry],
+            input=aws_result.stdout,
+            check=True,


For consistency with other subprocess usage in the codebase, consider adding text=True to subprocess.run calls when dealing with text output. While the current implementation works (passing bytes from aws_result.stdout to docker login stdin), using text=True would make the code more readable and consistent with patterns in mlflow/utils/environment.py and mlflow/utils/env_pack.py.

Additionally, the second subprocess.run call (line 154) should capture output with capture_output=True to enable better error diagnostics if the docker login command fails.

Suggested change

capture_output=True,

check=True,

)

subprocess.run(

["docker", "login", "--username", "AWS", "--password-stdin", registry],

input=aws_result.stdout,

check=True,

capture_output=True,

text=True,

check=True,

)

subprocess.run(

["docker", "login", "--username", "AWS", "--password-stdin", registry],

input=aws_result.stdout,

check=True,

capture_output=True,

text=True,

mlflow/sagemaker/__init__.py

harupy

LGTM！

Switch container build to subprocess

689c72e

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

Copilot AI review requested due to automatic review settings December 8, 2025 18:53

github-actions bot added v3.7.1 area/models MLmodel format, model serialization/deserialization, flavors rn/bug-fix Mention under Bug Fixes in Changelogs. labels Dec 8, 2025

Copilot started reviewing on behalf of BenWilson2 December 8, 2025 18:54 View session

BenWilson2 added the team-review Trigger a team review request label Dec 8, 2025

github-actions bot requested review from B-Step62, TomeHirata, WeichenXu123, daniellok-db, harupy, kevin-lyn, serena-ruan and xq-yin December 8, 2025 18:54

Copilot AI reviewed Dec 8, 2025

View reviewed changes

mlflow/sagemaker/__init__.py Outdated Show resolved Hide resolved

mlflow/sagemaker/__init__.py Outdated Show resolved Hide resolved

add useful failure messages

e6b1d13

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

harupy reviewed Dec 9, 2025

View reviewed changes

simplify

d2abc1e

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

BenWilson2 requested a review from harupy December 11, 2025 16:49

BenWilson2 added v3.8.0 and removed v3.7.1 labels Dec 12, 2025

harupy requested a review from Copilot December 12, 2025 11:23

Copilot started reviewing on behalf of harupy December 12, 2025 11:24 View session

harupy approved these changes Dec 12, 2025

View reviewed changes

Copilot AI reviewed Dec 12, 2025

View reviewed changes

github-actions bot assigned harupy Dec 12, 2025

harupy approved these changes Dec 12, 2025

View reviewed changes

BenWilson2 added this pull request to the merge queue Dec 12, 2025

Merged via the queue into mlflow:master with commit 8b8792a Dec 12, 2025
80 of 84 checks passed

BenWilson2 deleted the sagemaker-hardening branch December 12, 2025 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch container build to subprocess for Sagemaker#19277

Switch container build to subprocess for Sagemaker#19277
BenWilson2 merged 3 commits intomlflow:masterfrom
BenWilson2:sagemaker-hardening

BenWilson2 commented Dec 8, 2025 •

edited by github-actions bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 8, 2025 •

edited

Loading

Uh oh!

harupy Dec 9, 2025

Uh oh!

BenWilson2 Dec 10, 2025

Uh oh!

harupy left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 12, 2025

Uh oh!

Uh oh!

Uh oh!

harupy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

BenWilson2 commented Dec 8, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Install mlflow from this PR

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Should this PR be included in the next patch release?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

harupy Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

BenWilson2 Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

harupy left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

harupy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BenWilson2 commented Dec 8, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Dec 8, 2025 •

edited

Loading