chore(agents): default bug-investigator to Opus + add grounding contract#3666
Conversation
The meshmonitor-bug-investigator agent ran on Sonnet and had no requirement to verify the file paths / line numbers / symbols it cited, which led to confident hallucinations (nonexistent files, wrong field names, "X doesn't exist" claims that were false) that had to be caught and corrected downstream. - Bump model sonnet -> opus (meshtastic-expert is already opus). - Add a mandatory "Grounding rules" section: cite only what you opened; re-run ls/grep to confirm each citation before reporting; prove absence with a real grep + its empty output; grep-first instead of bulk-reading large files; and label every finding CONFIRMED (with tool output) or HYPOTHESIS. - Require the report's Evidence items to be tool-confirmed. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_011JEaCGwY9Wz8jeV4e22GW4
|
Claude finished @Yeraze's task —— View job PR Review: chore(agents): default bug-investigator to Opus + add grounding contractTasks
ReviewThis is a clean, focused improvement to the agent configuration. The changes are well-motivated and well-executed. Model upgrade (line 5) Bumping to Grounding rules (lines 14–45) The 5 rules are specific, actionable, and directly address the failure modes described in the PR summary:
Evidence section in output format (lines 275–278) The inline comment in the output template reinforces the grounding rules at the exact point where the agent would produce citations. This is a nice touch — it makes the rules hard to overlook even if the agent "skims" the system prompt. Minor observations (no blocking issues)
Overall: Straightforward, well-scoped hardening. No blocking concerns. ✓ |
Summary
Hardens the
meshmonitor-bug-investigatoragent, which had been producing confidently-wrong file paths / line numbers / symbol names and false "X doesn't exist" claims that had to be caught and corrected downstream.Two root causes, both addressed:
meshtastic-expertis already Opus) → bumped tomodel: opus.Grounding rules added
ls/grep -non each citation and keep the output; drop unconfirmable claims.grep -rncommand + its empty output (the most common failure mode).The report's Evidence items must now be tool-confirmed.
Config-only change (
.claude/agents/); no application code touched.🤖 Generated with Claude Code