Fix shell injection via `shell=True` in subprocess calls by vladapostol1 · Pull Request #23894 · astral-sh/ruff

vladapostol1 · 2026-03-11T15:32:46Z

Summary

Enable Ruff's own S602 rule (subprocess-popen-with-shell-equals-true from flake8-bandit) in pyproject.toml to enforce no-shell subprocess calls going forward, and fix the two existing violations it catches.

pyproject.toml - adds S602 to the lint select list so any future shell=True usage in the scripts is caught automatically by the linter.

scripts/setup_primer_project.py - project.install_cmd and project.deps come from the mypy-primer project registry (an externally-fetched third-party config). Both were joined into shell strings and executed with shell=True, making them a supply-chain injection vector. Fixed by tokenising with shlex.split() and dropping shell=True.

scripts/ty_benchmark/src/benchmark/snapshot.py - command.prepare was passed to subprocess.run(..., shell=True). While no current caller sets this field, it is a latent injection point. Fixed by tokenising with shlex.split() and dropping shell=True; adds the missing import shlex.

Test Plan

Both scripts are developer-only utilities. The changes are semantically equivalent for well-formed inputs - shlex.split() produces the same argument list the shell would have constructed, while refusing to pass metacharacters through to a shell process.

Verified no other shell=True uses remain in scripts/ or python/ that would be newly flagged by S602.

ntBre · 2026-03-11T15:45:36Z

I think if we're going to do this, we should enable the corresponding Ruff rule: subprocess-popen-with-shell-equals-true (S602), but these don't strike me as very problematic uses.

Enable Ruff's S602 rule (`subprocess-popen-with-shell-equals-true` from flake8-bandit) in pyproject.toml to enforce no-shell subprocess calls going forward, and fix the existing violations it would catch. pyproject.toml: - Add `S602` to the lint `select` list so any future `shell=True` usage in the scripts is caught automatically by the linter. scripts/setup_primer_project.py: - `project.install_cmd` (a freeform shell string from the mypy-primer project registry) was interpolated into a shell command and executed with `shell=True`. A compromised or malicious entry in the upstream registry could inject arbitrary shell commands. - `project.deps` were joined into a single shell string and also run with `shell=True`. - Fix: use `shlex.split()` to tokenise both strings into argument lists and pass them directly to subprocess without invoking a shell. scripts/ty_benchmark/src/benchmark/snapshot.py: - `command.prepare` was executed with `shell=True`, making it a latent injection point should the field ever be populated from external input. - Fix: tokenise with `shlex.split()` and drop `shell=True`; import `shlex` which was not previously imported in this module.

AlexWaygood · 2026-03-11T16:01:55Z

Mypy_primer is a highly trusted project that has very few maintainers -- and the maintainer team it does have includes several maintainers of ty. It feels pretty unlikely that an attacker would attempt to attack Ruff via mypy_primer.

I do see the theoretical security risk here, but we already effectively audit each new commit to mypy_primer whenever we manually bump the pinned SHA here that's used from our GitHub workflows:

ruff/scripts/mypy_primer.sh

Line 25 in 3d7eabe

    
           --from="git+https://github.com/hauntsaninja/mypy_primer@850d65d9da947ef9e02498b6f07598e7c8401641" \

. I feel like I'd prefer to plug the security hole here, such as it is, by just ensuring that these scripts use the same pinned SHA of the third-party mypy_primer project as we use from scripts/mypy_primer.sh.

Not sure if there's a good way to pin the mypy_primer dependencies of these scripts and ensure that that pin is kept in sync with mypy_primer.sh, though. Maybe this PR is fine.

vladapostol1 · 2026-03-11T16:10:05Z

Mypy_primer is a highly trusted project that has very few maintainers -- and the maintainer team it does have includes several maintainers of ty. It feels pretty unlikely that an attacker would attempt to attack Ruff via mypy_primer.

I do see the theoretical security risk here, but we already effectively audit each new commit to mypy_primer whenever we manually bump the pinned SHA here that's used from our GitHub workflows:

ruff/scripts/mypy_primer.sh

Line 25 in 3d7eabe

--from="git+https://github.com/hauntsaninja/mypy_primer@850d65d9da947ef9e02498b6f07598e7c8401641" \

. I feel like I'd prefer to plug the security hole here, such as it is, by just ensuring that these scripts use the same pinned SHA of the third-party mypy_primer project as we use from scripts/mypy_primer.sh.

Not sure if there's a good way to pin the mypy_primer dependencies of these scripts and ensure that that pin is kept in sync with mypy_primer.sh, though. Maybe this PR is fine.

Yeah that's true, the pinned SHA could be a stronger defense than just sanitizing subprocess arguments.
We could pin the SHA directly in setup_primer_project.py using the script metadata
To prevent them from drifting, I can add a quick CI check (or a pre-commit hook) to grep the SHA from mypy_primer.sh and compare it against this file. If they don't match, the build fails.

I’m happy to pivot the PR to this approach since it kills the supply-chain risk at the source. I'd personally keep the shell=False changes as cheap defense-in-depth tho.

AlexWaygood · 2026-03-11T16:17:18Z

Yeah that's true, the pinned SHA could be a stronger defense than just sanitizing subprocess arguments.
We could pin the SHA directly in setup_primer_project.py using the script metadata
To prevent them from drifting, I can add a quick CI check (or a pre-commit hook) to grep the SHA from mypy_primer.sh and compare it against this file. If they don't match, the build fails.

That could work. A simple pre-commit hook would be the option I'd prefer there (as long as it's fast), as we'd quickly spot the problem locally when running checks, as well as in CI.

I’m happy to pivot the PR to this approach since it kills the supply-chain risk at the source. I'd personally keep the shell=False changes as cheap defense-in-depth tho.

Fair enough.

Does shlex.split() always have good behaviour if this script is run on Windows? I see the shlex.split() docs don't have the big red warning box about platform portability that the shelx.quote() docs have, but IIRC the whole shlex module is mostly designed only for UNIX systems.

As suggested in review, pin the mypy-primer dependency in setup_primer_project.py to the same SHA already used in mypy_primer.sh, and add a pre-commit hook to keep them in sync. scripts/setup_primer_project.py: - Add `rev = "850d65d9..."` to the uv.sources block so the script uses the same audited mypy-primer commit as the CI workflow, closing the supply-chain vector at the source rather than just hardening the subprocess invocation. scripts/check_mypy_primer_sha.py: - New helper script that greps the SHA from both mypy_primer.sh and setup_primer_project.py and fails if they differ. Runs in ~55ms (pure Python, no imports beyond stdlib). .pre-commit-config.yaml: - Add `check-mypy-primer-sha` local hook that runs the above script only when either of the two relevant files is changed, so it is near-zero cost in the common case.

vladapostol1 · 2026-03-11T17:44:26Z

Yeah that's true, the pinned SHA could be a stronger defense than just sanitizing subprocess arguments.
We could pin the SHA directly in setup_primer_project.py using the script metadata
To prevent them from drifting, I can add a quick CI check (or a pre-commit hook) to grep the SHA from mypy_primer.sh and compare it against this file. If they don't match, the build fails.

That could work. A simple pre-commit hook would be the option I'd prefer there (as long as it's fast), as we'd quickly spot the problem locally when running checks, as well as in CI.

I’m happy to pivot the PR to this approach since it kills the supply-chain risk at the source. I'd personally keep the shell=False changes as cheap defense-in-depth tho.

Fair enough.

Does shlex.split() always have good behaviour if this script is run on Windows? I see the shlex.split() docs don't have the big red warning box about platform portability that the shelx.quote() docs have, but IIRC the whole shlex module is mostly designed only for UNIX systems.

I've updated the PR to use the SHA pin approach. setup_primer_project.py now matches the rev = "850d65d9..." in the uv.sources block from mypy_primer.sh. I also added a small scripts/check_mypy_primer_sha.py script as a local pre-commit hook—it only fires if one of those two files is touched.
Benchmarked locally at ~55ms(since it’s just two regex greps using the stdlib)

One quick sanity check on the shlex bit: Since shlex.split() defaults to POSIX rules, it'll probably trip over Windows-style quoting.

I noticed ty_benchmark already handles this with an mslex fallback so maybe I should add the same thing in snapshot.py

astral-sh-bot · 2026-03-11T17:54:03Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

Move the None-check into extract() so its return type is str rather than str | None, allowing ty to verify the final print without a not-subscriptable diagnostic.

Mirror the existing benchmark shlex/mslex import pattern in snapshot.py so command.prepare is tokenized with Windows-correct rules on win32, and remove the redundant unconditional shlex import from benchmark/__init__.py.

vladapostol1 · 2026-03-13T11:42:37Z

Does shlex.split() always have good behaviour if this script is run on Windows? I see the shlex.split() docs don't have the big red warning box about platform portability that the shelx.quote() docs have, but IIRC the whole shlex module is mostly designed only for UNIX systems.

I followed up on the Windows question, now for snapshot.py on windows we are using mslex as shlex, just to keep shell false while preserving Windows-correct tokenization.

* main: (94 commits) Fix shell injection via `shell=True` in subprocess calls (#23894) [ty] Refactor `relation.rs` to store state on a struct rather than passing around 7 arguments every time we recurse (#23837) Don't return code actions for non-Python documents (#23905) [ty] Make the default database truly statically infallible (#23929) [ty] Add `Download` button to ty playground which creates a zip export (#23478) [ty] Respect `kw_only` overwrites in dataclasses (#23930) [ty] Clarify in diagnostics that `from __future__ import annotations` only stringifies type annotations (#23928) [ty] Add a `Copy Markdown` button to playground (#23002) [ty] Fix folding range classification of lines starting with `#` (#23831) [ty] Fix folding ranges for notebooks (#23830) [ty] fix too-many-cycle panics when inferring literal type loop variables (#23875) Add `RegularCallableTypeOf` and `into_regular_callable` in `ty_extensions` (#23909) [ty] treat properties as full structural types (#23925) [ty] Avoid duplicated work during multi-inference (#23923) [ty]: make `possibly-missing-attribute` ignored by default [ty]: split out `possibly-missing-submodule` from `possibly-missing-attribute` Update astral-sh/setup-uv action to v7.5.0 (#23922) [ty] Show truthiness in ConstraintSet display and simplify falsy error message (#23913) Bump 0.15.6 (#23919) [ty] Narrow type context during collection literal inference (#23844) ...

vladapostol1 requested review from AlexWaygood, MichaReiser, carljm, dcreager, ibraheemdev and sharkdp as code owners March 11, 2026 15:32

vladapostol1 force-pushed the fix/subprocess-shell-injection branch from aac1288 to 06d2297 Compare March 11, 2026 15:53

AlexWaygood added the internal An internal refactor or improvement label Mar 11, 2026

vladapostol1 added 2 commits March 11, 2026 20:19

Fix ty type error in check_mypy_primer_sha.py

1b891b0

Move the None-check into extract() so its return type is str rather than str | None, allowing ty to verify the final print without a not-subscriptable diagnostic.

Use Windows-safe tokenization in benchmark snapshot

012f8cb

Mirror the existing benchmark shlex/mslex import pattern in snapshot.py so command.prepare is tokenized with Windows-correct rules on win32, and remove the redundant unconditional shlex import from benchmark/__init__.py.

carljm approved these changes Mar 13, 2026

View reviewed changes

carljm merged commit f854f4f into astral-sh:main Mar 13, 2026
61 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix shell injection via `shell=True` in subprocess calls#23894

Fix shell injection via `shell=True` in subprocess calls#23894
carljm merged 4 commits intoastral-sh:mainfrom
vladapostol1:fix/subprocess-shell-injection

vladapostol1 commented Mar 11, 2026 •

edited

Loading

Uh oh!

ntBre commented Mar 11, 2026 •

edited

Loading

Uh oh!

AlexWaygood commented Mar 11, 2026

Uh oh!

vladapostol1 commented Mar 11, 2026

Uh oh!

AlexWaygood commented Mar 11, 2026

Uh oh!

vladapostol1 commented Mar 11, 2026

Uh oh!

astral-sh-bot bot commented Mar 11, 2026 •

edited

Loading

Uh oh!

vladapostol1 commented Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

vladapostol1 commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

ntBre commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexWaygood commented Mar 11, 2026

Uh oh!

vladapostol1 commented Mar 11, 2026

Uh oh!

AlexWaygood commented Mar 11, 2026

Uh oh!

vladapostol1 commented Mar 11, 2026

Uh oh!

astral-sh-bot bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

Uh oh!

vladapostol1 commented Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vladapostol1 commented Mar 11, 2026 •

edited

Loading

ntBre commented Mar 11, 2026 •

edited

Loading

astral-sh-bot bot commented Mar 11, 2026 •

edited

Loading

`ruff-ecosystem` results