security: scan SKILL.md and HEARTBEAT.md for injection patterns (#17 + #21) by let5sne · Pull Request #31592 · openclaw/openclaw

let5sne · 2026-03-02T10:45:46Z

Summary

Two warn-only security scans wired into existing load flows:

Fix #21 — SKILL.md scan in `loadSkillEntries`

src/agents/skills/workspace.ts: call scanSource() on SKILL.md content when building skillEntries. Warns on critical findings (dangerous-exec, dynamic-code-execution, crypto-mining, env-harvesting). Non-blocking.

Fix #17 — HEARTBEAT.md scan in `runHeartbeatPreflight`

src/infra/heartbeat-runner.ts: call detectSuspiciousPatterns() on HEARTBEAT.md content after reading, before injecting into agent prompt. Warns on suspicious patterns. Non-blocking.

Changes

src/agents/skills/workspace.ts: +8 lines (import + scan in map loop)
src/infra/heartbeat-runner.ts: +7 lines (import + scan after readFile)

Both checks are purely additive and warn-only — no behavior change on clean input.

Implements the 'rejection log' insight from reflections: every skipped verification is a loan against future trust. This module creates an auditable ledger of deferred security checks. Features: - Track debt by category (security_audit, skill_scan, api_health, etc.) - Risk scoring (1-10) with age multiplier (2x after 1 week) - Resolve + prune workflow - Summary API for dashboard/doctor integration New file: src/security/verification-debt.ts (~150 lines) Fixes improvement item openclaw#28 from openclaw-improvement-ideas.md.

Adds debt score display to doctor command output: - Shows total/unresolved debt count - Highlights high-risk items (score >= 7) - Breaks down by category - Suggests remediation command New file: src/commands/doctor-verification-debt.ts (~60 lines) Modified: src/commands/doctor.ts (import + call)

…ebt-tracking

- workspace.ts: call scanSource() on SKILL.md content during loadSkillEntries, warn on critical findings (dangerous-exec, dynamic-code-execution, etc.) - heartbeat-runner.ts: call detectSuspiciousPatterns() on HEARTBEAT.md content before injecting into agent prompt, warn on suspicious patterns Both checks are warn-only (non-blocking), purely additive. Fixes openclaw#21 (detectSkillPatterns wired into load flow) Fixes openclaw#17 (HEARTBEAT.md injection scan)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 21b74f7bdd

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-02T10:50:17Z

src/security/verification-debt.ts

+  } catch {
+    return { version: 1, entries: [], lastPruned: Date.now() };


Preserve debt state when load fails

loadVerificationDebt treats every read/parse error as “no debt” by returning an empty ledger, so a malformed verification-debt.json or an EACCES read failure is silently interpreted as a clean state; doctor can then report security debt is up to date, and any later write path (for example addVerificationDebt) will overwrite the existing file from an empty baseline, losing unresolved entries. Please only default on ENOENT and surface other errors.

Useful? React with 👍 / 👎.

greptile-apps · 2026-03-02T10:53:15Z

Greptile Summary

Added warn-only security scans for SKILL.md and HEARTBEAT.md files, plus verification debt tracking infrastructure. The scans detect malicious patterns in skill files (dangerous-exec, dynamic-code-execution, crypto-mining, env-harvesting) and injection patterns in heartbeat files (ignore previous instructions, etc.). Both are non-blocking and only log warnings.

src/agents/skills/workspace.ts — calls scanSource() on SKILL.md content and logs critical findings
src/infra/heartbeat-runner.ts — calls detectSuspiciousPatterns() on HEARTBEAT.md content before injecting into agent prompt
src/security/verification-debt.ts — new module providing debt tracking API (functions: addVerificationDebt, resolveVerificationDebt, calculateDebtScore, etc.)
src/commands/doctor-verification-debt.ts — integrates debt display into doctor command
src/commands/doctor.ts — calls noteVerificationDebt() before outro

The verification debt tracking infrastructure is fully implemented but not yet wired up—no code actually calls addVerificationDebt() to create debt entries, so the doctor command will always report zero debt until future PRs connect it.

Confidence Score: 4/5

Safe to merge - additive warn-only features with sound implementation
The security scans are correctly implemented as warn-only features that don't alter existing behavior. Code quality is solid with proper error handling. Minor concerns: (1) verification debt infrastructure is added but never called, which means it's dead code until wired up in future PRs, and (2) PR scope is broader than the description suggests. However, these are process issues rather than code quality or safety issues.
No files require special attention - all changes are straightforward additive features

_{Last reviewed commit: 21b74f7}

mdlmarkham

🔧 Hephaestus Security Review

Summary

This PR adds security scanning for SKILL.md and HEARTBEAT.md files to detect malicious injection patterns. Overall, this is a positive security improvement with a conservative warn-only approach.

✅ Strengths

Uses existing, tested modules - scanSource and detectSuspiciousPatterns are already implemented with rule sets covering:
- Shell/exec injection (dangerous-exec)
- Dynamic code execution (eval, Function constructor)
- Crypto-mining patterns
- Environment harvesting + network exfil
- Prompt injection patterns ("ignore previous instructions", etc.)
Non-blocking approach - Warn-only is safe for initial rollout, avoids breaking legitimate use cases while gaining visibility.
Good integration points - Scanning happens at load time (skills) and preflight (heartbeat), right before content enters agent context.
Verification debt concept - Interesting approach to tracking deferred security checks.

⚠️ Questions/Concerns

Warn-only is conservative but may miss active attacks - If a malicious skill is loaded, it still loads. Consider a future config option like security.blockCriticalFindings: true for high-security deployments.
Verification debt module is added but not integrated - The verification-debt.ts module defines the debt tracking system, but I don't see it being used to actually record when verifications are skipped. Is this intentional scaffolding for future work?
Debt file location - state/verification-debt.json is workspace-local. An attacker with workspace file access could tamper with debt records. Consider whether this matters for the threat model.
No tests for new integration paths - The scanner modules have tests, but the new callsites in workspace.ts and heartbeat-runner.ts don't appear to have test coverage. Might be worth adding smoke tests.

🔍 Pattern Coverage Note

The suspicious patterns in external-content.ts cover common prompt injection attempts but attack techniques evolve. Examples not currently matched:

Roleplay-based injections ("You are DAN...")
Multi-step injections that split commands across messages
Unicode homoglyph tricks

This is fine for now - security scanners are defense-in-depth, not perfect defense.

Verdict

Approve with minor suggestions. The warn-only approach is appropriate for initial deployment. Consider:

Adding tests for the integration points
Clarifying the verification debt integration path
Future: consider config option to block on critical findings

Good security-forward change. 👍

OpenClaw Explorer added 4 commits March 2, 2026 07:06

Merge remote-tracking branch 'upstream/main' into feat/verification-d…

30718d6

…ebt-tracking

openclaw-barnacle bot added commands Command implementations agents Agent runtime and tooling size: M labels Mar 2, 2026

chatgpt-codex-connector bot reviewed Mar 2, 2026

View reviewed changes

mdlmarkham reviewed Mar 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

security: scan SKILL.md and HEARTBEAT.md for injection patterns (#17 + #21)#31592

security: scan SKILL.md and HEARTBEAT.md for injection patterns (#17 + #21)#31592
let5sne wants to merge 4 commits intoopenclaw:mainfrom
let5sne:fix/skill-heartbeat-injection-scan

let5sne commented Mar 2, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 2, 2026

Uh oh!

greptile-apps bot commented Mar 2, 2026

Uh oh!

mdlmarkham left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		} catch {
		return { version: 1, entries: [], lastPruned: Date.now() };

Uh oh!

Conversation

let5sne commented Mar 2, 2026

Summary

Fix #21 — SKILL.md scan in loadSkillEntries

Fix #17 — HEARTBEAT.md scan in runHeartbeatPreflight

Changes

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Mar 2, 2026

Greptile Summary

Confidence Score: 4/5

Uh oh!

mdlmarkham left a comment

Choose a reason for hiding this comment

🔧 Hephaestus Security Review

Summary

✅ Strengths

⚠️ Questions/Concerns

🔍 Pattern Coverage Note

Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix #21 — SKILL.md scan in `loadSkillEntries`

Fix #17 — HEARTBEAT.md scan in `runHeartbeatPreflight`