fix(cron): allow emoji ZWJ sequences in prompts (#28164) by teknium1 · Pull Request #28589 · NousResearch/hermes-agent

teknium1 · 2026-05-19T07:10:40Z

What: _scan_cron_prompt blocked U+200D (Zero-Width Joiner) as a hidden-character deception attack. ZWJ is legitimately required to form many emoji sequences (👨‍👩‍👧, 🏳️‍🌈, ❤️‍🩹, 🧑‍💻), so all those emoji in cron prompts hit the security guard.

How: Allow ZWJ when its neighbors are emoji codepoints (Misc Symbols, Pictographs, Dingbats, regional indicators, variation selectors) or another ZWJ within the same emoji cluster; still block ZWJ when both neighbors are plain text. New unit tests cover legitimate emoji clusters AND continue to block plain-text ZWJ smuggling.

Original PR: #28164

github-actions · 2026-05-19T07:17:08Z

🔎 Lint report: `hermes/hermes-3ad7d98a` vs `origin/main`

ruff

Total: 0 on HEAD, 0 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 0 pre-existing issues carried over.

ty (type checker)

Total: 8954 on HEAD, 8954 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 4702 pre-existing issues carried over.

Diagnostics are surfaced as warnings — this check never fails the build.

…s scanners SOUL.md, memory entries, and skill files containing emoji ZWJ sequences (e.g. 🧙‍♂️ = 🧙 + ZWJ + ♂ + VS16) were being silently blocked as prompt-injection attempts. ZWJ (U+200D) is in the invisible-char blocklist for good reason — it can hide text inside benign-looking strings — but it is also required inside emoji sequences and has no way to hide anything harmful there. Upstream PR NousResearch#28589 ("fix(cron): allow emoji ZWJ sequences in prompts", a salvage of NousResearch#28164) established the precedent for this fix, but only applied it to the cron prompt scanner via a cronjob_tools-local helper (_strip_legitimate_emoji_zwj). The identical false positive still affects the other three scanners that share the same invisible-char blocklist. This PR completes the job for those three, factoring the context check into a single shared helper instead of adding a fourth copy of the logic. Added shared utils.find_unsafe_invisibles() that context-checks ZWJ: allowed between two pictographic codepoints (skipping variation selectors), flagged everywhere else. All other invisibles in the blocklist remain unconditionally flagged. Callers updated: - agent/prompt_builder.py (_scan_context_content — blocks SOUL.md et al.) - tools/memory_tool.py (_scan_memory_content — blocks memory add/update) - tools/skills_guard.py (scan_file — blocks skill install) tools/cronjob_tools.py is intentionally left untouched — PR NousResearch#28589 already fixes _scan_cron_prompt. Adds 6 tests covering: - ZWJ inside 🧙‍♂️ (gendered emoji) — allowed - Multi-ZWJ family emoji 👨‍👩‍👧 — allowed - ZWJ between letters (classic injection shape) — still blocked - Mixed legit emoji + injection ZWJ — blocked (at least one unsafe ZWJ) - ZWSP adjacent to emoji — still blocked (only ZWJ is context-whitelisted) 221/221 tests pass across the affected test modules. Motivation: a user SOUL.md containing 🧙‍♂️ was being silently blocked from loading, with a [BLOCKED: ... invisible unicode U+200D] marker leaking into the system prompt in place of the actual identity content. The scan was eating its own foot on a legitimate, widely-used emoji sequence.

fix(cron): allow emoji ZWJ sequences in prompts

63222b5

teknium1 merged commit 663ee14 into main May 19, 2026

teknium1 deleted the hermes/hermes-3ad7d98a branch May 19, 2026 07:10

teknium1 mentioned this pull request May 19, 2026

fix(cron): allow emoji ZWJ sequences in prompts #28164

Closed

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/cron Cron scheduler and job management labels May 19, 2026

witt3rd mentioned this pull request May 21, 2026

fix: allow ZWJ inside emoji grapheme clusters in context/memory/skills scanners #12673

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cron): allow emoji ZWJ sequences in prompts (#28164)#28589

fix(cron): allow emoji ZWJ sequences in prompts (#28164)#28589
teknium1 merged 1 commit into
mainfrom
hermes/hermes-3ad7d98a

teknium1 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

teknium1 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026

🔎 Lint report: hermes/hermes-3ad7d98a vs origin/main

ruff

ty (type checker)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

🔎 Lint report: `hermes/hermes-3ad7d98a` vs `origin/main`