code_for_lit_s_m_b

Less is More: Pay Less Attention in Vision Transformers

Training and evaluation code for LIT-S, LIT-M and LIT-B.

Training

First, activate your python environment

conda activate lit

Make sure you have the correct ImageNet DATA_PATH in config/*.yaml.

To train LIT-S:

bash scripts/lit-s.sh [GPUs]

To train LIT-M:

bash scripts/lit-m.sh [GPUs]

To train LIT-B:

bash scripts/lit-b.sh [GPUs]

Note: We use a total batch size of 1024 for all experiments on ImageNet. Therefore, you may want to use a different batch size by editing BATCH_SIZE in configs/*.yaml. For example, by setting BATCH_SIZE to 64 and training with 8 GPUs, your total batch size is 512.

Evaluation

We provide scripts to evaluate LIT-S, LIT-M and LIT-B. To evaluate a model, you can run

bash scripts/lit-b-eval.sh [GPUs] [path/to/checkpoint]

For example, to evaluate LIT-B with 1 GPU, you can run:

bash scripts/lit-b-eval.sh 1 checkpoint/lit_b.pth

This should give

* Acc@1 83.366 Acc@5 96.254
Accuracy of the network on the 50000 test images: 83.4%

Result could be slightly different based on you environment.

Results

Name	Params (M)	FLOPs (G)	Top-1 Acc. (%)	Model	Log
LIT-S	27	4.1	81.5	google drive/github	log
LIT-M	48	8.6	83.0	google drive/github	log
LIT-B	86	15.0	83.4	google drive/github	log

Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
data		data
mm_modules		mm_modules
models		models
scripts		scripts
.gitignore		.gitignore
README.md		README.md
config.py		config.py
logger.py		logger.py
lr_scheduler.py		lr_scheduler.py
main.py		main.py
optimizer.py		optimizer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Less is More: Pay Less Attention in Vision Transformers

Training

Evaluation

Results

FilesExpand file tree

code_for_lit_s_m_b

Directory actions

More options

Directory actions

More options

Latest commit

History

code_for_lit_s_m_b

Folders and files

parent directory

README.md

Less is More: Pay Less Attention in Vision Transformers

Training

Evaluation

Results