LaConvNet

This is the official PyTorch implementation of our under review paper "Towards Language-guided Visual Recognition via Dynamic Convolutions". In this paper, we propose a compact and unified Vision-Language network, termed as LaConvNets. LaConvNets can unify the visual recognition and multi-modal reasoning in one forward structure with a novel language-guided convolution (LaConv). On 9 benchmarks, LaConvNets demonstrate better trade-offs between efficiency and performance than existing methods.

Updates

(2023/4/13) Release our LaConvNet project.

Installation

pip install -r requirements.txt
wget https://github.com/explosion/spacy-models/releases/download/en_vectors_web_lg-2.1.0/en_vectors_web_lg-2.1.0.tar.gz -O en_vectors_web_lg-2.1.0.tar.gz
pip install en_vectors_web_lg-2.1.0.tar.gz
pip install cupy-cuda11x==11.6

Data preparation

Follow the instructions of DATA_PRE_README.md to prepare the necessary training data.

Training and Evaluation

Prepare your settings. To train a model, you should modify ./config/config.yaml to adjust the settings you want.
Train the model. run train.py under the main folder to start training:

python train.py --config ./config/config.yaml

Test the model. Then, you can run test.py by

python test.py --eval-weights ./weights/det_best.pth

Training log. Logs are stored in ./logs directory, which records the detailed training curve and accuracy per epoch. If you want to log the visualizations, please set LOG_IMAGE to True in config.yaml.

Model Zoo

We provide the results of LaConvNets on REC and RES. Results and pre-trained checkpoints are available in Model Zoo.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
config		config
data		data
datasets		datasets
layers		layers
misc		misc
models		models
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
DATA_PRE_README.md		DATA_PRE_README.md
LICENSE		LICENSE
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LaConvNet

Updates

Installation

Data preparation

Training and Evaluation

Model Zoo

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

luogen1996/LaConvNet

Folders and files

Latest commit

History

Repository files navigation

LaConvNet

Updates

Installation

Data preparation

Training and Evaluation

Model Zoo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages