DynaGuard: A Dynamic Guardrail Model With User-Defined Policies.
A project by: Monte Hoover, Vatsal Baherwani, Neel Jain, Khalid Saifullah, Joseph Vincent, Chirag Jain, Melissa Kazemi Rad, C. Bayan Bruss, Ashwinee Panda, and Tom Goldstein.
- Install dependencies:
conda create -n dynaguard python=3.12 pip install -r requirements.txt - Evaluate a model on custom policies:
or
python eval.pypython eval.py --model tomg-group-umd/DynaGuard-8B --dataset tomg-group-umd/DynaBench --subset DynaBench --split test
To cite our work, please use this bibtex.
@article{hoover2025dynaguard,
title={DynaGuard: A Dynamic Guardrail Model With User-Defined Policies},
author={Monte Hoover and Vatsal Baherwani and Neel Jain and Khalid Saifullah and Joseph Vincent and Chirag Jain and Melissa Kazemi Rad and C. Bayan Bruss and Ashwinee Panda and Tom Goldstein},
journal={arXiv preprint},
year={2025},
url={https://arxiv.org/abs/2509.02563},
}
Please, feel free to contact us with any questions, or open an issue on Github.