Acknowledgement

Put the Space of LoRA Initialization to the Extreme to Preserve Pre-trained Knowledge (AAAI 2026)

Getting Start

Download the repo and install dependencies.

cd LoRA_Null
pip install -r requirements.txt

Step 1:

sh step1.sh

CUDA_VISIBLE_DEVICES=0 python build_corda.py \
    --model_id "meta-llama/Llama-2-7b-hf" \
    --singular_aware \
    --r {rank} \
    --use_cache \
    --calib_dataset "nqopen" \
    --calib_loader_size 256 \
    --save_model \
    --save_path {path_to_decomposed_model}

Arguments:

--model_id is the pre-trained model for decomposition.
--singular_aware adopts our LoRA-Null inilization.
--r is the low rank of LoRA, e.g. 128.
--use_cache adopts the dataloader and covariance matrices saved in Adapter/cache, to avoid calculating the covariance matrices again.
--calib_dataset specifies the dataset to sample data to obtain covariance matrices. We use QA datasets "nqopen".
--calib_loader_size is the number of sampled data.
--save_model saves the initialized model in --save_path.

Step 2: Adapter Training

sh step2.sh

Step 3: Merging

After training, LoRA adapter can be merged with the base model by runing: sh step3.sh

Step 4: Inference on world knowledge:

Inference on world knowledge benchmarks is based on EleutherAI/lm-evaluation-harness. For example, we evaluate by: sh step4.sh

accelerate launch -m lm_eval \
    --model hf \
    --model_args pretrained={path_to_merged_model},trust_remote_code=True,dtype=float16 \
    --output_path {result_path}/nq_open.json \
    --tasks nq_open,triviaqa,nq_open \
    --batch_size auto \
    --max_batch_size 8 \
    --device cuda

Step 5: Inference on Downstream Tasks

Inference on Math:

Evaluation on Gsm8k and Math can be performed by: sh step5.sh

sh tools/inference_Math.sh {path_to_merged_model}

Inference on Code and Instruction Following:

Evaluation on HumanEval and MBPP is based on bigcode-evaluation-harness. Evaluation on MTBench is based on FastChat. We use their default settings for evaluation.

Acknowledgement

Our codes are modified from https://github.com/iboing/CorDA

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LoRA_Null		LoRA_Null
adapterlib		adapterlib
cache		cache
data		data
inference		inference
mapping		mapping
tools		tools
README.md		README.md
Readme.md		Readme.md
build_adapter.py		build_adapter.py
merge_adapter_for_Null.py		merge_adapter_for_Null.py
merge_adapter_to_base_model.py		merge_adapter_to_base_model.py
requirements.txt		requirements.txt
step1.sh		step1.sh
step2.sh		step2.sh
step3.sh		step3.sh
step4.sh		step4.sh
step5.sh		step5.sh
train_model.py		train_model.py
train_model_freeze_a.py		train_model_freeze_a.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Getting Start

Step 1:

Step 2: Adapter Training

Step 3: Merging

Step 4: Inference on world knowledge:

Step 5: Inference on Downstream Tasks

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

HungerPWAY/LoRA-Null

Folders and files

Latest commit

History

Repository files navigation

Getting Start

Step 1:

Step 2: Adapter Training

Step 3: Merging

Step 4: Inference on world knowledge:

Step 5: Inference on Downstream Tasks

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages