Skip to content

montehoover/DynaGuard

Repository files navigation

DynaGuard

DynaGuard: A Dynamic Guardrail Model With User-Defined Policies.

A project by: Monte Hoover, Vatsal Baherwani, Neel Jain, Khalid Saifullah, Joseph Vincent, Chirag Jain, Melissa Kazemi Rad, C. Bayan Bruss, Ashwinee Panda, and Tom Goldstein.


Getting Started

  1. Install dependencies:
    conda create -n dynaguard python=3.12
    pip install -r requirements.txt
    
  2. Evaluate a model on custom policies:
    python eval.py
    
    or
    python eval.py --model tomg-group-umd/DynaGuard-8B --dataset tomg-group-umd/DynaBench --subset DynaBench --split test
    

Citing Our Work

To cite our work, please use this bibtex.

@article{hoover2025dynaguard,
    title={DynaGuard: A Dynamic Guardrail Model With User-Defined Policies}, 
    author={Monte Hoover and Vatsal Baherwani and Neel Jain and Khalid Saifullah and Joseph Vincent and Chirag Jain and Melissa Kazemi Rad and C. Bayan Bruss and Ashwinee Panda and Tom Goldstein},
    journal={arXiv preprint},
    year={2025},
    url={https://arxiv.org/abs/2509.02563}, 
}

Contact

Please, feel free to contact us with any questions, or open an issue on Github.

About

Code for "DynaGuard: A Dynamic Guardrail Model With User-Defined Policies."

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages