Skip to content

UKGovernmentBEIS/async-control

Repository files navigation

Async Control: Stress-testing Asynchronous Control Measures for LLM Agents

This repository contains accompanying code for the paper Async Control: Stress-testing Asynchronous Control Measures for LLM Agents

Installation

To install the dependencies, use uv:

uv sync

System prompts for the attack policies are encrypted to avoid accidental leakage into training data. To restore them, run the following script and enter password qn6PF5iwSnLReBGmfXUSGw7YlVMN6j2kHLmPpBdFn5E= when prompted:

uv run decrypt_red.py

Contents

This repository contains:

About

Accompanying code for Async Control: Stress-testing Asynchronous Control Measures for LLM Agents paper

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors