LMEraser: Large Model Unlearning through Adaptive Prompt Tuning

[AISTATS 2025] This repository contains the official implementation of our paper "LMEraser: Large Model Unlearning via Adaptive Prompt Tuning" Jie Xu, Zihan Wu, Cong Wang, Xiaohua Jia

Accepted at the 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025) Mai Khao, Thailand, May 3-5, 2025

Abstract

To address the growing demand for privacy protection in machine learning, we propose a novel and efficient machine unlearning approach for Large Models, called LMEraser. Existing unlearning research suffers from entangled training data and complex model architectures, incurring extremely high computational costs for large models. LMEraser takes a divide-and-conquer strategy with a prompt tuning architecture to isolate data influence. The training dataset is partitioned into public and private datasets. Public data are used to train the backbone of the model. Private data are adaptively clustered based on their diversity, and each cluster is used to optimize a prompt separately. This adaptive prompt tuning mechanism reduces unlearning costs and maintains model performance. Experiments demonstrate that LMEraser achieves a 100-fold reduction in unlearning costs without compromising accuracy compared to prior work. Our code is available at: Source Code.

Environment Setup

This code has been tested with Python 3.11.5 and PyTorch 2.1.2 with CUDA 12.1 on Ubuntu 22.04. The required packages are listed in environment.yaml.

To set up a conda environment, please follow these steps:

conda env create -f environment.yaml -n lmeraser
conda activate lmeraser

File Structure

The structure of the repository is as follows:

.
├── arguments.py
├── data_utils
│   ├── datasets
│   │   ├── cifar
│   │   │   └── dataset.py
│   │   ├── gtsrb
│   │   │   └── dataset.py
│   │   ├── __init__.py
│   │   └── svhn
│   │       └── dataset.py
│   ├── loader.py
│   └── transforms.py
├── environment.yaml
├── eraser
│   ├── eraser.py
│   └── main.py
├── launch.py
├── LICENSE
├── models
│   ├── backbones
│   │   ├── backbone_swin.py
│   │   ├── backbone_vit.py
│   │   ├── backbone_vit_old.py
│   │   └── __init__.py
│   ├── builder.py
│   ├── checkpoints
│   │   ├── swin_base_patch4_window7_224.pth
│   │   ├── swin_base_patch4_window7_224_22k.pth
│   │   ├── vit_base_p16_224_in1k.pth
│   │   └── vit_base_p16_224_in22k.pth
│   ├── model_zoo
│   │   ├── __init__.py
│   │   ├── swin.py
│   │   └── vit.py
│   └── prompters.py
├── README.md
├── scripts
│   ├── run_distributed_gpu.sh
│   ├── run_one_gpu.sh
│   └── run_sbatch.sh
└── utils
    ├── distributed.py
    ├── file_io.py
    ├── logging.py
    ├── lr.py
    └── seed.py

Dataset Preparation

Datasets are sourced from torchvision and downloaded automatically. For more details, please refer to torchvision datasets. The custom datasets directory can be set using --base_dir when running the code.

Pre-trained Model Preparation

The pre-trained vision models used can be downloaded from the provided links and should be placed in models/checkpoints/.

Pre-trained Models

Backbone	Pre-trained Objective	md5sum
vit-b-1k	Supervised	4423d3eecb5a0e00c353217a992d300f
vit-b-22k	Supervised	b3563fde27a47c2017e78044a0a72748
swin-b-1k	Supervised	733836a1b1b88a8ec21da708c35bd29e
swin-b-22k	Supervised	bf9cc182ae5e417f97390e2b21a0eb09

All models are available here Google Drive.

Training

Three scripts are provided for training on a single GPU, multiple GPUs, and a Slurm cluster, respectively. These scripts are located in scripts/.

Changable Arguments

Key arguments are listed in arguments.py. The default settings are configured for training on CIFAR-100 with a ViT-22k backbone.

Important Arguments

--erasing_method: Select the erasing method (e.g., lmeraser, random_part_tuning).
--base_dir: Directory to store datasets.
--test_dataset: Dataset for training.
--pretrained_model: Pre-trained model to use.
--one_prompt: Use one or multiple prompts (default: False).
--num_gpus: Number of GPUs to use.
--batch_size: Total batch size.
--num_epochs: Number of training epochs.
--lr: Learning rate.
--distance_threshold: Distance threshold for clustering.

Acknowledgement

This repository is partially based on VP, VPT, and DAM-VP. We thank the authors for their impressive works!

License

This code is released under the MIT License (see LICENSE file for details).

Citation

If you find this work helpful, please consider citing:

@inproceedings{xu2025lmeraser,
  title={LMEraser: Large Model Unlearning through Adaptive Prompt Tuning},
  author={Xu, Jie and Wu, Zihan and Wang, Cong and Jia, Xiaohua},
  booktitle={Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS)},
  year={2025},
  address={Mai Khao, Thailand}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LMEraser: Large Model Unlearning through Adaptive Prompt Tuning

Abstract

Environment Setup

File Structure

Dataset Preparation

Pre-trained Model Preparation

Pre-trained Models

Training

Changable Arguments

Important Arguments

Acknowledgement

License

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
data_utils		data_utils
eraser		eraser
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
environment.yaml		environment.yaml
launch.py		launch.py
skin-canser-result.jpg		skin-canser-result.jpg

License

wzh4464/lmeraser

Folders and files

Latest commit

History

Repository files navigation

LMEraser: Large Model Unlearning through Adaptive Prompt Tuning

Abstract

Environment Setup

File Structure

Dataset Preparation

Pre-trained Model Preparation

Pre-trained Models

Training

Changable Arguments

Important Arguments

Acknowledgement

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages