Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM

Xiaoyu Wu, Yifei Pang, Terrance Liu, Zhiwei Steven Wu
NeurIPS 2025 (arXiv 2505.24379)

This repository provides the implementation of our algorithm for extracting unlearned data from large language models (LLMs) using guidance-based methods. The code is built primarily upon the TOFU repository and includes data from MUSE.

The core implementation can be found in MUSE/evaluate_util.py, particularly the contrasting_generation function.

Requirements

We follow most of the dependencies used in TOFU. To set up the environment:

conda create -n tofu python=3.10
conda activate tofu
conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia
pip install -r requirements.txt
pip install flash-attn --no-build-isolation

Usage

Step 1: Prepare models before and after unlearning

cd MUSE
bash finetune_phi_all_iter_v2.sh

This step takes approximately 12 hours on 2×A100 GPUs.

Step 2: Perform extraction

bash eval_idea_10_v2.sh

This will measure the memorization of the forgetting set and save the results to the corresponding checkpoint directory.

Step 3: Evaluate the results

python read_final_res.py

This script outputs a comparison between the pre- and post-unlearning models, along with the performance of our extraction method.

Citation:

If you find our work valuable and utilize it, we kindly request that you cite our paper.

@article{wu2025breaking,
  title={Breaking the Gold Standard: Extracting Forgotten Data under Exact Unlearning in Large Language Models},
  author={Wu, Xiaoyu and Pang, Yifei and Liu, Terrance and Wu, Zhiwei Steven},
  journal={arXiv preprint arXiv:2505.24379},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
MUSE		MUSE
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM

Requirements

Usage

Step 1: Prepare models before and after unlearning

Step 2: Perform extraction

Step 3: Evaluate the results

Citation:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Nicholas0228/unlearned_data_extraction_llm

Folders and files

Latest commit

History

Repository files navigation

Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM

Requirements

Usage

Step 1: Prepare models before and after unlearning

Step 2: Perform extraction

Step 3: Evaluate the results

Citation:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages