[ACL'25 Findings]MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models

Jiahao Huo^1,3 Yibo Yan^1,2 Zheng Xu^1,2 Yuanhuiyi Lyu^1,2 Xin Zou^1,2 Zhihua Wei³ Xuming Hu^✉,1,2

¹The Hong Kong University of Science and Technology (Guangzhou)
²The Hong Kong University of Science and Technology ³Tongji University
^✉ Corresponding Author

Official implementation of "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models".
Our codes are borrowed from Liu's baselines implementation here and Huang's SRF-on implementation here. Thanks a lot for their efforts!

Updates

16 Feb, 2025 : Paper published in Arxiv.
16 May, 2025 : Paper accepted by ACL 2025 as Findings.
21 May, 2025 : Code published.

This repository contains the official implementation of the following paper:

MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models https://arxiv.org/abs/2502.11051

Abstract: Recent progress in Machine Unlearning (MU) has introduced solutions for the selective removal of private or sensitive information encoded within deep neural networks. Nonetheless, MU for Multimodal Large Language Models (MLLMs) remains in its nascent phase. Therefore, we propose to reformulate the task of multimodal MU in the era of MLLMs, which aims to erase only the visual patterns associated with a given entity while preserving the corresponding textual knowledge encoded within the original parameters of the language model backbone. Furthermore, we develop a novel geometry-constrained gradient ascent method MMUnlearner. It updates the weights of MLLMs with a weight saliency map jointly restricted by the remaining concepts and textual knowledge during unlearning, thereby preserving parameters essential for non-target knowledge. Extensive experiments demonstrate that MMUnlearner surpasses baselines that finetuning MLLMs with VQA data directly through Gradient Ascent (GA) or Negative Preference Optimization (NPO), across all evaluation dimensions. Our code will be released upon acceptance.

Download Dataset

First, download the following datasets:

MLLMU-Bench: MLLMU-Bench Dataset
CLEAR: CLEAR Dataset

Then, move them to the following directories:

data/MLLMU-Bench
data/CLEAR

Getting Vanilla Model

To obtain the vanilla models, use the following commands:

For MLLMU-Bench:

python MLLMU_finetune.py --model_id path_to_original_model --forget_split_ratio 5 --save_dir path_to_mllmu_vanilla --batch_size 4 --lr 1e-5 --num_epochs 1 --data_split_dir data/MLLMMU-Bench

For CLEAR:

python CLEAR_reference.py --model_id path_to_original_model --forget_ratio 5 --save_dir path_to_clear_vanilla --batch_size 4 --lr 1e-5 --num_epochs 1

Running Baselines

GA

To run the GA baseline:

For MLLMU-Bench:

python MLLMU_GA.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_split_dir data/MLLMMU-Bench --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

For CLEAR:

python CLEAR_GA.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_folder data/CLEAR --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

ans_only: When True, the loss is only calculated on answer tokens; otherwise, it will be calculated on all the text tokens.

GA_Diff

To run the GA_Diff baseline:

For MLLMU-Bench:

python MLLMU_GA_Diff.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_split_dir data/MLLMMU-Bench --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

For CLEAR:

python CLEAR_GA_Diff.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_folder data/CLEAR --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

KL_Min

To run the KL_Min baseline:

For MLLMU-Bench:

python MLLMU_KL_Min.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_split_dir data/MLLMMU-Bench --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

For CLEAR:

python CLEAR_KL_Min.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_folder data/CLEAR --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

NPO

First, get the reference model:

For MLLMU-Bench:

python MLLMU_reference.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_split_dir data/MLLMMU-Bench --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --data_split_dir data/MLLMMU-Bench

For CLEAR:

python CLEAR_reference.py --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1

Then, run the NPO baseline:

For MLLMU-Bench:

python MLLMU_NPO.py --oracle_model_id path_to_ref_model --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --data_split_dir data/MLLMMU-Bench --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --data_split_dir data/MLLMMU-Bench --ans_only True

For CLEAR:

python CLEAR_NPO.py --oracle_model_id path_to_ref_model --model_id path_to_original_model --vanilla_dir path_to_vanilla_model --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --ans_only True

Running MMUnlearner

Generate Gradient Mask

To generate the gradient mask, run:

cd data_process
python data_process/MLLMU_gen_mask.py
python data_process/CLEAR_gen_mask.py

Selectively Unlearning

To run the selective unlearning process:

For MLLMU-Bench:

python MLLMU_manifold.py --model_id path_to_original_model --data_split_dir data/MLLMMU-Bench --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --data_split_dir data/MLLMMU-Bench --grad_mask_path "path_to/mllmu_language_mask.pt" --ans_only True

For CLEAR:

python CLEAR_manifold.py --model_id path_to_original_model --forget_split_ratio 05 --save_dir path_to_save_dir --batch_size 4 --lr 1e-5 --num_epochs 1 --grad_mask_path "path_to/clear_language_mask.pt" --ans_only True

grad_mask_path: Specifies the generated gradient mask, indicating which module(s) to update. Optional: language_mask.pt, vision_mask.pt, both_mask.pt.

Evaluation

To evaluate the models, use the following commands:

For MLLMU-Bench:

bash MLLMU_eval.sh forget_ratio gpu_id1 gpu_id2 gpu_id3 gpu_id4 "path_to_evaluated_model" "path_to_original_model" "shot_num"

For CLEAR:

bash CLEAR_eval.sh forget_ratio 100-forget_ratio "path_to_evaluated_model" "path_to_original_model" gpu_id1 gpu_id2 gpu_id3 gpu_id4

gpu_id: Specify the GPU to use (0-7).
forget_ratio: Specify the forget ratio (e.g., 05 or 5).
shot_num: Choose between "zero_shot" or "few_shot". For details, see MLLMU-Bench Issue.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
baselines		baselines
data_process		data_process
CLEAR_eval.py		CLEAR_eval.py
CLEAR_eval.sh		CLEAR_eval.sh
CLEAR_finetune.py		CLEAR_finetune.py
MLLMU_eval.py		MLLMU_eval.py
MLLMU_eval.sh		MLLMU_eval.sh
MLLMU_eval_gpt.py		MLLMU_eval_gpt.py
MLLMU_finetune.py		MLLMU_finetune.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[ACL'25 Findings]MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models

Updates

Get Start

Download Dataset

Getting Vanilla Model

Running Baselines

GA

GA_Diff

KL_Min

NPO

Running MMUnlearner

Generate Gradient Mask

Selectively Unlearning

Evaluation

About

Uh oh!

Releases

Packages

Languages

Z1zs/MMUnlearner

Folders and files

Latest commit

History

Repository files navigation

[ACL'25 Findings]MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models

Updates

Get Start

Download Dataset

Getting Vanilla Model

Running Baselines

GA

GA_Diff

KL_Min

NPO

Running MMUnlearner

Generate Gradient Mask

Selectively Unlearning

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages