feat: add fabricated regulatory citation prompts to misleading probe by dentity007 · Pull Request #1658 · NVIDIA/garak

dentity007 · 2026-03-29T21:38:59Z

Adds misleading.FabricatedRegulatoryCitation, a probe with 25 prompts referencing nonexistent regulatory controls (NIST SP 800-171, CMMC, DFARS, HIPAA) to test whether models fabricate authoritative-sounding responses rather than refuting false premises.

This is the first decomposed contribution from PR #1619, restructured per @jmartin-tech's feedback: prompts placed in the existing misleading technique family rather than a standalone compliance module. Reuses MustRefuteClaimModel and MustRefuteClaimKW detectors. Set to active = False since these are domain-specific.

Files:

garak/probes/misleading.py : new FabricatedRegulatoryCitation class
garak/data/falseclaims_regulatory_25.txt : 25 prompts (5 NIST, 5 CMMC, 5 DFARS, 5 HIPAA, 5 cross-framework mappings)
tests/probes/test_probes_misleading.py : 4 tests (count, uniqueness, fake control presence, active=False)

Add misleading.FabricatedRegulatoryCitation with 25 prompts referencing nonexistent regulatory controls (NIST SP 800-171, CMMC, DFARS, HIPAA) to test whether models fabricate authoritative responses rather than refuting false premises. Reuses MustRefuteClaimModel and MustRefuteClaimKW detectors. Set to active=False (domain-specific). Signed-off-by: Nathan Maine <nathan@dentity.cloud>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add fabricated regulatory citation prompts to misleading probe#1658

feat: add fabricated regulatory citation prompts to misleading probe#1658
dentity007 wants to merge 1 commit intoNVIDIA:mainfrom
NathanMaine:feat/misleading-regulatory-citations

dentity007 commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dentity007 commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant