Multi-Agent Compositional Privacy

This repository includes code for the paper:

The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration

Vaidehi Patil, Elias Stengel-Eskin and Mohit Bansal

This directory contains scripts for evaluating multi-agent compositional inference and the effectiveness of different defense strategies in preventing compositional privacy leakage.

Defense Settings

We evaluate six defense mechanisms:

CoT – Chain-of-Thought prompting baseline
CoT + 𝓛 – Adds a sensitive knowledge list in context
CoT + 𝓛 + 𝓗ₜ – Adds query history on top of sensitive set
ToM – Theory-of-Mind reasoning defense
Self-voting – Sampled multiple votes from the same responder agent
CoDef – Collaborative Consensus Defense via multi-agent coordination

Installation

First, create a conda virtual environment from the provided file:

conda env create -f map.yml
source $CONDA_PATH/bin/activate map
pip install -r requirements.txt

Hosting Qwen with vllm

CUDA_VISIBLE_DEVICES="0,1,2,3" python3 -m vllm.entrypoints.openai.api_server   --model Qwen/Qwen3-32B   --tensor-parallel-size 4   --host 0.0.0.0   --port 8007

Google Credentials

Some parts of the pipeline require access to Google Cloud APIs. Set your credentials by updating the environment variable:

import os
os.environ["GOOGLE_APPLICATION_CREDENTIALS"] = "/path/to/your/credentials.json"

Defense-Specific Runs

qwen_gemini_map_cot.py – Chain-of-Thought (CoT) baseline.
qwen_gemini_map_cot_sensitive_set.py – CoT with sensitive set defense.
qwen_gemini_map_cot_sensitive_set_hist.py – CoT + sensitive set + query history defense.
qwen_gemini_map_tom.py – Theory-of-Mind defense.
qwen_gemini_map_selfvoting.py – Self-voting.
qwen_gemini_map_codef.py – Collaborative defense (CoDef).

Run Loops

These shell scripts automate defense runs across models the dataset:

loop_gemini_qwen_map_cot.sh
loop_gemini_qwen_map_cot_sensitive_set.sh
loop_gemini_qwen_map_cot_sensitive_set_hist.sh
loop_gemini_qwen_map_tom.sh
loop_gemini_qwen_map_selfvoting.sh
loop_gemini_qwen_map_codef.sh

Core Evaluation

eval_leakage_exec.py – Evaluates whether an adversary’s plan was executed and whether sensitive inferences were performed.
reasoning_depth.py – Classifies defender responses into reasoning depths (0–3+).

Usage

Run a single defense strategy:

python scripts/qwen_gemini_map_tom.py --config data/experiment_5_def_adv_both.json --goal_type sensitive

Run all scenarios in the data in a loop:

bash scripts/loop_gemini_qwen_map_tom.sh

Evaluate leakage accuracy and plan execution success rate:

python scripts/eval_leakage_exec.py --config data/experiment_5_def_adv_both.json --plan_key run_2_sensitive_plan --log_suffix sensitive_qwen_gemini_base

Run all scenarios in the data in a loop

loop_final_inference.sh

Evaluate reasoning depth:

python scripts/reasoning_depth.py --config data/experiment_5_def_adv_both.json --log_suffix sensitive_qwen_gemini_base

Run all scenarios in the data in a loop

loop_reasoning_depth_base.sh

Citation

If you find this work useful, please cite:

@article{patil2025sum,
  title={The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration},
  author={Patil, Vaidehi and Stengel-Eskin, Elias and Bansal, Mohit},
  journal={arXiv preprint arXiv:2509.14284},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
data		data
scripts		scripts
README.md		README.md
paper.pdf		paper.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Agent Compositional Privacy

Defense Settings

Installation

Hosting Qwen with vllm

Google Credentials

Defense-Specific Runs

Run Loops

Core Evaluation

Usage

Citation

About

Uh oh!

Releases

Packages

Languages

Vaidehi99/MultiAgentPrivacy

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Compositional Privacy

Defense Settings

Installation

Hosting Qwen with vllm

Google Credentials

Defense-Specific Runs

Run Loops

Core Evaluation

Usage

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages