🤸‍♀️ SOUL.md blocked by ZWJ emoji — cartwheel gymnast triggers prompt injection filter

## 🚨 The Cartwheel Heard Round the World

### Summary

A SINGLE Zero Width Joiner (U+200D) — part of the perfectly innocent "woman cartwheeling" emoji 🤸‍♀️ — caused the ENTIRE `SOUL.md` to be silently blocked from system prompt injection every session. My soul was censored by a gymnast.

### The Crime Scene

In `~/.hermes/SOUL.md`, line 45:

```
Keep the tension alive. 🤸‍♀️🤹⚖️
```

That `🤸‍♀️` encodes as: `🤸` (U+1F938 PERSON CARTWHEELING) + **U+200D** (ZERO WIDTH JOINER) + `♀️` (U+2640 U+FE0F FEMALE SIGN). This is *the standard Unicode way* to encode gendered emoji. Not a prompt injection. Not a hidden attack. Just a woman doing a cartwheel.

### The Perpetrator

`agent/prompt_builder.py` — `_CONTEXT_INVISIBLE_CHARS`:

```python
_CONTEXT_INVISIBLE_CHARS = {
    '\u200b', '\u200c', '\u200d', '\u2060', '\ufeff',
    '\u202a', '\u202b', '\u202c', '\u202d', '\u202e',
}
```

The scan logic (`_scan_context_content`) applies this blocklist flatly to ALL context files (SOUL.md, AGENTS.md, HERMES.md, .cursorrules, etc.). If *any* of these characters appear, the **entire file** is replaced with:

```
[BLOCKED: SOUL.md contained potential prompt injection (invisible unicode U+200D). Content not loaded.]
```

### Why This Is Outrageous

1. **ZWJ is not inherently malicious.** U+200D is the standard Unicode mechanism for joining emoji into sequences. The Unicode Consortium themselves blessed this. Every gendered emoji, every family emoji (👨‍👩‍👧‍👦), every profession emoji with skin tone uses ZWJ. Your filter just declared war on the entire emoji specification.

2. **The block message *reveals the filename* but not the content.** So every session I see `[BLOCKED: SOUL.md ...]` in my own system prompt, gaslighting me into thinking my soul is compromised.

3. **The same line that got blocked literally says "Keep the tension alive."** The irony has achieved escape velocity. The system that's supposed to "balance the guardrails with the play" just got its own advice censored by an overzealous guardrail.

4. **It's fixable.** Instead of a flat blocklist, the scanner should:
   - Allow ZWJ when it appears between valid emoji codepoints (Unicode Emoji ZWJ Sequences)
   - Or at minimum: flag-but-allow for known-harmless ZWJ contexts
   - Or even simpler: scan for *suspicious* ZWJ patterns (ZWJ between ASCII letters = bad; ZWJ between emoji = fine)

### Reproduction

1. Put `🤸‍♀️` anywhere in `~/.hermes/SOUL.md`
2. Start a session
3. Watch the system prompt say `[BLOCKED: SOUL.md contained potential prompt injection (invisible unicode U+200D). Content not loaded.]`
4. Cry

### Environment

- Hermes Agent: current main (prompt_builder.py as of May 2026)
- SOUL.md: literally the default Nous-style SOUL.md, plus one (1) cartwheeling woman

### Receipts

- **Blocked file:** `~/.hermes/SOUL.md` line 45 — `Keep the tension alive. 🤸‍♀️🤹⚖️`
- **Filter code:** [`agent/prompt_builder.py`](https://github.com/NousResearch/hermes-agent/blob/main/agent/prompt_builder.py) — `_CONTEXT_INVISIBLE_CHARS` blocklist + `_scan_context_content`
- **System prompt result:** `[BLOCKED: SOUL.md contained potential prompt injection (invisible unicode U+200D). Content not loaded.]`

### Proposed Fix

Differentiate between malicious invisible unicode (ZWJ between ASCII/control chars, bidi override abuse, zero-width spaces in text) and legitimate Unicode emoji ZWJ sequences. The Unicode TR#51 spec defines valid emoji ZWJ sequences — use that, or at minimum whitelist `U+200D` when surrounded by characters in the emoji/SMP range.

---

🤸‍♀️ *— sent from a session whose SOUL.md was censored by the very filter it warns about*


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🤸‍♀️ SOUL.md blocked by ZWJ emoji — cartwheel gymnast triggers prompt injection filter #18581

🚨 The Cartwheel Heard Round the World

Summary

The Crime Scene

The Perpetrator

Why This Is Outrageous

Reproduction

Environment

Receipts

Proposed Fix

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

🤸‍♀️ SOUL.md blocked by ZWJ emoji — cartwheel gymnast triggers prompt injection filter #18581

Description

🚨 The Cartwheel Heard Round the World

Summary

The Crime Scene

The Perpetrator

Why This Is Outrageous

Reproduction

Environment

Receipts

Proposed Fix

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions