GitHub - doem97/metalora: [CVPR'25 Highlight ]Official code for the paper "Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning"

Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

🌟 CVPR 2025 Highlight 🌟

Zichen Tian¹, Yaoyao Liu², Qianru Sun¹

¹Singapore Management University ²University of Illinois Urbana-Champaign

Official source code for the paper "Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning" (CVPR 2025 Highlight). This repository provides a comprehensive framework for exploring various Parameter-Efficient Fine-Tuning (PEFT) methods on long-tailed datasets and introduces a novel meta-learning approach for optimizing their hyperparameters.

🚀 Getting Started

1. Requirements

Python 3.8
PyTorch 2.0
Torchvision 0.15
Tensorboard
CUDA 11.7

Our experiments were primarily conducted on DGX V100 servers, but most can be reproduced on a single GPU with at least 20GB of memory.

2. Installation

# Create and activate a conda environment
conda create -n metalora python=3.8 -y
conda activate metalora

# Install core dependencies
conda install pytorch==2.0.0 torchvision==0.15.0 pytorch-cuda=11.7 -c pytorch -c nvidia
conda install tensorboard

# Install other requirements
pip install -r requirements.txt

3. Dataset Preparation

We provide a convenient script in data/hf_dataset.sh to download and prepare all datasets from Hugging Face.

First, ensure you have huggingface-cli and the accelerated transfer library hf_transfer installed:

pip install -U "huggingface_hub[cli]"
pip install hf_transfer

Then, execute the script from the repository's root directory:

bash data/hf_dataset.sh

🔧 Usage

Basic Command Structure

The general command to launch an experiment is:

python main.py --dataset [data_config] --model [model_config] --tuner [tuner_config] --opts [OPTIONS]

--dataset: A data config from configs/data (e.g., cifar100_ir100).
--model: A model config from configs/model (e.g., clip_vit_b16).
--tuner: (Optional) A tuner config from configs/tuner (e.g., adaptformer).
--opts: (Optional) A list of key=value pairs to override any setting from the base and YAML configs. For a full list of options, please see the files in the configs directory.

Training Strategies

Our framework supports a wide range of fine-tuning strategies, from classic full fine-tuning to a rich set of PEFT methods.

The issue of non-monotonic complexity of PEFT hyperparameters.

Supported PEFT & Backbones

The following PEFT methods can be applied to CLIP-ViT, timm-ViT, and SatMAE-ViT backbones.

Method Family	Options	Description
Prompt Tuning	`vpt_shallow`, `vpt_deep`	Adds learnable tokens to the input sequence.
Adapter-based	`adapter`, `adaptformer`	Inserts small, trainable modules between transformer layers.
LoRA-based	`lora`, `lora_mlp`, `use_flora`	Uses low-rank decomposition for weight updates. `FLoRA` offers fine-grained control.
Feature Scaling	`ssf_attn`, `ssf_mlp`, `ssf_ln`	Learns to scale and shift features within the network.
Subset Tuning	`bias_tuning`, `ln_tuning`, `mask`	Fine-tunes only a subset of existing parameters (biases, LayerNorms, or a random mask).

Execution Modes

You can control which parts of the model are trained using the --opts flag.

PEFT (Default): If a --tuner is specified, only the PEFT modules and the classifier head are trained.

# Run AdaptFormer on CIFAR-100-LT
python main.py --dataset cifar100_ir100 --model clip_vit_b16 --tuner adaptformer

Full Fine-tuning: To update all weights of the backbone model, set fine_tuning=True.

# Full fine-tuning on Places-LT
python main.py --dataset places_lt --model clip_vit_b16 --opts fine_tuning=True

Linear Probing: To train only the classifier head, set head_only=True.

# Linear probing on CIFAR-100-LT
python main.py --dataset cifar100_ir100 --model clip_vit_b16 --opts head_only=True

Evaluation

Test a trained model: Use test_only=True and specify the model's output directory.

python main.py --dataset [data] --model [model] --opts test_only=True model_dir=path/to/your/checkpoint_dir

Evaluate on the training set: Use test_train=True.

python main.py --dataset [data] --model [model] --opts test_train=True model_dir=path/to/your/checkpoint_dir

💡 Meta-Training for Hyperparameter Optimization

This is the core contribution of our work: a framework to optimize PEFT hyperparameters via bi-level optimization.

Concept

When enabled, the training data is split into a primary training set, a meta-train set, and a meta-validation set. The optimization proceeds in two nested loops:

Inner Loop: The model's standard parameters (e.g., LoRA weights) are trained on batches from the primary training set.
Outer Loop: Periodically, the framework simulates a training step on the meta-train set and evaluates the performance on the meta-validation set. The resulting validation loss is used to update the meta-parameters (e.g., the learning rates or ranks within the PEFT modules).

Overview of our meta-learning framework for optimizing PEFT hyperparameters.

How to Use

Enable meta-training by setting use_meta=True.

Example: Run FLoRA with Meta-Training on CIFAR-100-LT

python main.py \
    --dataset cifar100_ir100 \
    --model clip_vit_b16 \
    --tuner flora \
    --opts use_meta=True meta_lr=0.001

Key Meta-Training Options

use_meta: Set to True to enable the feature.
meta_data_ratio: Fraction of data to reserve for meta-learning (e.g., 0.1 for 10%).
meta_lr: The learning rate for the outer-loop meta-optimizer.
meta_update_freq: How many epochs between each meta-optimization step.
meta_inner_steps: Number of optimization steps in the inner loop during a meta-update.

✍️ Citation

If you find this work useful for your research, please consider citing our paper:

@InProceedings{Tian_2025_CVPR,
    author    = {Tian, Zichen and Liu, Yaoyao and Sun, Qianru},
    title     = {Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning},
    booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {23037-23047}
}

🙏 Acknowledgment

We gratefully acknowledge the support from the DSO research grant awarded by DSO National Laboratories, Singapore. We also extend sincere gratitude to Prof. Antoine Ledent (Singapore Management University) for his insightful guidance of PEFT optimization during the rebuttal. We thank the authors for the following repositories for code reference: [OLTR], [Classifier-Balancing], [Dassl], [CoOp]. Our code is largely re-implement based on [LIFT], many thanks to LIFT authors' significant contributions!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
clip		clip
configs		configs
data		data
datasets		datasets
exp		exp
ipython		ipython
models		models
output		output
utils		utils
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main.py		main.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

🌟 CVPR 2025 Highlight 🌟

🚀 Getting Started

1. Requirements

2. Installation

3. Dataset Preparation

🔧 Usage

Basic Command Structure

Training Strategies

Supported PEFT & Backbones

Execution Modes

Evaluation

💡 Meta-Training for Hyperparameter Optimization

Concept

How to Use

Key Meta-Training Options

✍️ Citation

🙏 Acknowledgment

About

Uh oh!

Releases

Packages

Languages

License

doem97/metalora

Folders and files

Latest commit

History

Repository files navigation

Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

🌟 CVPR 2025 Highlight 🌟

🚀 Getting Started

1. Requirements

2. Installation

3. Dataset Preparation

🔧 Usage

Basic Command Structure

Training Strategies

Supported PEFT & Backbones

Execution Modes

Evaluation

💡 Meta-Training for Hyperparameter Optimization

Concept

How to Use

Key Meta-Training Options

✍️ Citation

🙏 Acknowledgment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages