fix(sandbox): elevate Agent-mode shell sandbox to allow network access by Hmbown · Pull Request #274 · Hmbown/CodeWhale

Hmbown · 2026-05-02T00:39:31Z

DRAFT — security-relevant default change. Maintainer review requested.

Closes #272 (independent confirmation from external user @francofang).

Summary

In Agent mode (the default) every `exec_shell` command was wrapped in a seatbelt policy with `(deny default)` and no `(allow network-outbound)`, so DNS and outbound TCP/UDP failed at the kernel level. Symptom on macOS: "DNS resolution failed" for any URL the model tried to reach via `curl`, `yt-dlp`, package managers, …

External user repro (issue #272 from @francofang):

Start DeepSeek TUI in Agent mode (default). Ask the agent to fetch external content (e.g. YouTube). DNS fails.
Switching to YOLO mode resolves it. `config.toml` overrides have no effect.

Root cause

`engine.rs::build_tool_context` only set `elevated_sandbox_policy` for Yolo. Agent fell through to `ShellManager::sandbox_policy` = `SandboxPolicy::default()` = `WorkspaceWrite { network_access: false }`. `seatbelt::generate_policy` only emits the network-allow block when `policy.has_network_access()` is true — confirmed by the existing test `test_generate_policy_default` ("Default policy has no network").

Fix

Three-line shape change: `if mode == Yolo` → `match mode`, with Agent and Yolo both elevating to `WorkspaceWrite { network_access: true }`. Plan mode unchanged (read-only investigation never registers the shell tool).

Adds `agent_and_yolo_modes_elevate_shell_sandbox_to_allow_network` regression test so a future revert trips CI.

Trade-off the maintainer should weigh

This removes the OS-level network block from sandboxed shell commands. The remaining boundary is the application-level `NetworkPolicy` (`crates/tui/src/network_policy.rs`) plus the per-shell approval flow. Today `NetworkPolicy` only intercepts `fetch_url`, not shell commands — so a model using `curl` to reach a denied host could route around the policy. The seatbelt block was kind-of closing that hole on macOS only; it's reopened by this PR.

Options if that residual hole matters:

Land this as-is for v0.8.4 — restores parity with Claude Code/Codex; macOS Seatbelt sandbox blocks outbound DNS resolution in Agent mode #272 symptom goes away. File a follow-up to teach `NetworkPolicy` about shell-level egress.
Land this with a narrower policy (e.g. allow DNS + 80/443 only) — partial mitigation, more code.
Reject — keep the current behavior and document "use Yolo for network workflows".

Drafted (1) since it's the smallest fix that gets the user unstuck and matches what users expect from competitors.

Test plan

`cargo test -p deepseek-tui --bin deepseek-tui --locked sandbox` (27/27)
`cargo test -p deepseek-tui --bin deepseek-tui --locked core::engine` (62/62, includes new regression)
`cargo fmt --all -- --check`
`cargo clippy -p deepseek-tui --all-targets --locked -- -D warnings`

🤖 Generated with Claude Code

gemini-code-assist

Code Review

This pull request enables network access for shell commands in Agent mode by elevating the sandbox policy for both Agent and Yolo modes, while keeping Plan mode restricted. A regression test was added to verify this behavior. Feedback was provided to address inaccuracies in the code comments and changelog regarding the NetworkPolicy; specifically, it was noted that the policy currently only intercepts fetch_url and does not yet provide a complete outbound boundary for shell-level traffic.

gemini-code-assist · 2026-05-02T00:40:54Z

+            // curl, yt-dlp, …) without buying the user any safety the
+            // application-level NetworkPolicy / approval flow doesn't already
+            // provide. Elevate to workspace-write + network. (#273)


The comment suggests that the application-level NetworkPolicy provides equivalent safety to the sandbox block. However, as noted in the PR description, NetworkPolicy currently only intercepts fetch_url and does not apply to shell commands. This creates a security gap where shell tools (like curl) can bypass configured network restrictions. The comment should be updated to reflect this limitation and ideally include a TODO for implementing shell-level egress filtering.

// curl, yt-dlp, …) without buying the user any safety the // approval flow doesn't already provide. Note: NetworkPolicy // currently only covers fetch_url; shell-level egress filtering // is a planned follow-up. Elevate to workspace-write + network. (#273)

gemini-code-assist · 2026-05-02T00:40:55Z

+  investigation never registers the shell tool). The application-level
+  `NetworkPolicy` (`crates/tui/src/network_policy.rs`) remains the only
+  outbound-traffic boundary.


This statement is technically inaccurate and may give users a false sense of security. As noted in the PR description, the NetworkPolicy currently only intercepts fetch_url and does not apply to shell commands. Therefore, it is not yet a complete "outbound-traffic boundary" for the shell tool. It would be better to clarify that shell-level enforcement is a planned follow-up.

Hmbown · 2026-05-02T01:11:39Z

Maintainer signed off on the trade-off (Twitter, May 1) — "going with allowing web access on agent mode is the smart move." Marked ready for review.

Copilot

Pull request overview

This PR fixes macOS Agent-mode exec_shell network failures by elevating the shell sandbox policy in Agent mode (previously only done in YOLO), and adds a regression test + changelog entry to prevent regressions.

Changes:

Update Engine::build_tool_context to elevate the shell sandbox policy for both Agent and YOLO modes (enabling outbound network).
Add a regression unit test asserting Agent/YOLO get an elevated sandbox policy with network access.
Document the fix in CHANGELOG.md.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
`crates/tui/src/core/engine.rs`	Elevates sandbox policy for Agent + YOLO via a `match` on `AppMode`.
`crates/tui/src/core/engine/tests.rs`	Adds regression test covering sandbox elevation for Agent/YOLO (and assumptions about Plan).
`CHANGELOG.md`	Adds an Unreleased “Fixed” entry describing the sandbox/network behavior change.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+
+    // Plan mode is read-only investigation and does not register the shell
+    // tool, so it intentionally leaves the policy at the strict default.
+    assert!(
+        engine
+            .build_tool_context(AppMode::Plan, false)
+            .elevated_sandbox_policy
+            .is_none(),
+    );


+- **Agent-mode shell exec could not reach the network** (#273) — the seatbelt
+  default policy denies all outbound network including DNS, so any
+  `exec_shell` command needing the network (`curl`, `yt-dlp`, package
+  managers, …) failed in Agent mode unless the user dropped to Yolo. The
+  engine now elevates the sandbox policy to `WorkspaceWrite { network_access:
+  true, … }` for both Agent and Yolo. Plan mode is unchanged (read-only
+  investigation never registers the shell tool). The application-level
+  `NetworkPolicy` (`crates/tui/src/network_policy.rs`) remains the only
+  outbound-traffic boundary.


+            // Plan mode is read-only investigation; the shell tool is not
+            // registered, so leaving the sandbox policy at the seatbelt-strict
+            // default is fine.


+            // curl, yt-dlp, …) without buying the user any safety the
+            // application-level NetworkPolicy / approval flow doesn't already
+            // provide. Elevate to workspace-write + network. (#273)


+    // Regression for #273: the seatbelt-default policy denies all outbound
+    // network (including DNS), which broke `curl`, `yt-dlp`, package managers,
+    // and similar shell commands in Agent mode. Elevation must include
+    // network access so the application-level NetworkPolicy stays the only
+    // outbound boundary.


The seatbelt-default policy is `WorkspaceWrite { network_access: false }`, which on macOS emits `(deny default)` with no `(allow network-outbound)` / `(allow system-socket)`. Every outbound socket call from a sandboxed shell command — including `getaddrinfo` for DNS — gets denied by the kernel. Symptom: "DNS resolution failed" for any URL the model tries to reach via curl, yt-dlp, package managers, etc. Engine.build_tool_context only elevated the policy in Yolo mode, leaving Agent mode (the default) stuck on the strict default. That's tighter than competitors (Claude Code, Codex) without buying any safety the application-level NetworkPolicy or the approval flow doesn't already provide. Switch the elevation to a `match` so: - Plan → no elevation (read-only investigation; shell tool not registered) - Agent → WorkspaceWrite { network_access: true, … } - Yolo → WorkspaceWrite { network_access: true, … } (unchanged) Adds `agent_and_yolo_modes_elevate_shell_sandbox_to_allow_network` so a future revert to the no-network default trips CI immediately. Closes #273

gemini-code-assist Bot reviewed May 2, 2026

View reviewed changes

This was referenced May 2, 2026

macOS Seatbelt sandbox blocks outbound DNS resolution in Agent mode #272

Closed

exec_shell sandbox blocks network/DNS by default in Agent mode (YouTube fetch fails) #273

Closed

Hmbown marked this pull request as ready for review May 2, 2026 01:11

Copilot AI review requested due to automatic review settings May 2, 2026 01:11

Copilot started reviewing on behalf of Hmbown May 2, 2026 01:12 View session

Copilot AI reviewed May 2, 2026

View reviewed changes

Hmbown force-pushed the fix/agent-mode-shell-network-access branch from ae3f0a8 to d1beb69 Compare May 2, 2026 01:28

Hmbown merged commit 6b14e43 into feat/v0.8.4 May 2, 2026
2 checks passed

Hmbown deleted the fix/agent-mode-shell-network-access branch May 2, 2026 01:29

buko mentioned this pull request Jun 3, 2026

allow_shell does not remove the shell from the Constitution #2637

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(sandbox): elevate Agent-mode shell sandbox to allow network access#274

fix(sandbox): elevate Agent-mode shell sandbox to allow network access#274
Hmbown merged 1 commit into
feat/v0.8.4from
fix/agent-mode-shell-network-access

Hmbown commented May 2, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 2, 2026

Uh oh!

gemini-code-assist Bot May 2, 2026

Uh oh!

Hmbown commented May 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Hmbown commented May 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root cause

Fix

Trade-off the maintainer should weigh

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 2, 2026

Choose a reason for hiding this comment

Uh oh!

Hmbown commented May 2, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Hmbown commented May 2, 2026 •

edited

Loading