fix(code-system): add Task integrity guardrail preventing objective simplification (#1495) by paradoxSCH · Pull Request #1516 · esengine/DeepSeek-Reasonix

paradoxSCH · 2026-05-22T04:54:16Z

Problem

In long or high-token sessions, the model sometimes narrows or simplifies the user's original objective without asking, in order to save tokens, time, or steps. This breaks trust and produces incomplete results.

Solution

Insert a "Task integrity — non-negotiable" paragraph into the code system prompt (before `ESCALATION_CONTRACT`). It instructs the model that:

The user's original objective and ALL constraints remain in force for the entire session.
It may NOT unilaterally simplify, narrow, or change the objective.
If adjustment is genuinely needed, it must ask the user.

Changes

`src/code/prompt.ts`: add the Task integrity paragraph to `CODE_SYSTEM_TEMPLATE`.

Note on Cost-aware wording

The existing `Cost-aware` escalation contract wording is preserved unchanged. This guardrail is additive, not a replacement.

Verification

Existing tests pass (3333 passed).
The prompt now carries the guardrail in every code-mode session.

…implification (esengine#1495) Insert a "Task integrity — non-negotiable" paragraph into the code system prompt (before __ESCALATION_CONTRACT__). It instructs the model that the user's original objective and all constraints remain in force for the entire session, and that it must NOT unilaterally simplify, narrow, or change the objective to save tokens, time, or steps. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…se (#1565) * chore(release): 0.49.0 — static-history TUI, queued steers, Bing default, lifecycle plans Headline themes: - TUI: Static-history renderer is the only path; virtual-viewport layers removed (#1529 stages 1-4) - Chat: queued mid-turn steer handling so input mid-render doesn't drop or fight the live frame (#1501) - Web search: default switches to Bing; dashboard engine switcher; Mojeek dropped (#1558) - Plans: lifecycle evidence summaries surface why a plan is ready to accept (#1500) - Desktop: native OS notifications for approvals + completion (#1519) - i18n: CLI command output (/mcp /sessions /prune /theme) + approval-prompt labels translated (#1524, #1560) - Security: SSRF block in web_fetch (#1544), edit-snapshot path containment (#1454), shell redirect sandbox (#1457), Task integrity guardrail (#1516) - Tools: per-turn dispatch-rate limit (#1356); run_command discourages shell-based edits (#1514) - Client: DeepSeek 429 → concurrency-limit hint (#1526); timeoutMs honored with AbortSignal (#1535); --no-proxy opt-out for direct route (#1507) - Files: read/edit/restore preserves source encoding (GB18030 / UTF-8 BOM) (#1518) - Context: pinned constraints survive folds + full tail capture (#1515, #1552) - Refactor: lifecycle risk policy extracted into its own module (#1557) See CHANGELOG for the full list. * fix(context): align fold summary prefix with main agent for cache reuse The summarizer call was sending a bespoke "You compress conversation history" system prompt and no tools, guaranteeing a 0% cache hit against the main agent's just-cached prefix. Reshape the request so system + tools + head bytes mirror the live agent's last call — the only novel bytes are the trailing summarize instruction. Skill-pin handling now collects bodies read-only instead of stubbing mid-head, so the cache prefix stays unbroken. The summarize instruction names pinned skills so the model knows not to paraphrase their bodies (which we append verbatim regardless). Measured on a real session at 48.7K prompt tokens: OLD shape: 0.0% cache hit → $0.145 per fold NEW shape: 99.6% cache hit → $0.015 per fold saving: 89.6% per fold * tools: add fold-cache shape + live benchmarks bench-fold-cache-shape.mjs replays real session jsonls, simulates OLD vs NEW summary-call shapes at the fold point, and reports byte-level shared-prefix with the main agent's preceding request. Pure local — no API required. bench-fold-cache-live.mjs sends one priming + two summary calls to DeepSeek and reports prompt_cache_hit_tokens / cost for each shape. Used to confirm the shape change actually translates to API-side cache hits. --------- Co-authored-by: reasonix <reasonix@deepseek.com>

esengine merged commit 6ac18cf into esengine:main May 22, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(code-system): add Task integrity guardrail preventing objective simplification (#1495)#1516

fix(code-system): add Task integrity guardrail preventing objective simplification (#1495)#1516
esengine merged 1 commit into
esengine:mainfrom
paradoxSCH:fix/1495-task-integrity-guardrail

paradoxSCH commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

paradoxSCH commented May 22, 2026

Problem

Solution

Changes

Note on Cost-aware wording

Verification

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants