An OpenTelemetry Collector processor that offloads LLM prompt and completion content to a secure vault, replacing sensitive data in traces with vault references.
Traces should contain references, not content. This processor:
- Intercepts spans with LLM prompt/completion attributes
- Writes the content to a storage backend (filesystem or S3)
- Replaces the attribute value with a
vault://reference - Downstream systems see references, never raw content
processors:
promptvault:
storage:
backend: filesystem
filesystem:
base_path: /data/vault
vault:
keys:
- gen_ai.prompt
- gen_ai.completion
- gen_ai.system_instructions
size_threshold: 0 # 0 = vault everything
mode: replace_with_ref # or "remove"| Mode | Behavior |
|---|---|
replace_with_ref |
Replaces content with vault://sha256hash |
remove |
Removes the attribute entirely, adds .vault_ref attribute |
This processor is one component of the AIR Blackbox Gateway collector pipeline.
Apache-2.0