LaViSE

Code DOI: https://zenodo.org/badge/latestdoi/585223762
OpenReview URL: https://openreview.net/forum?id=nsrHznwHhl
Paper PDF: https://openreview.net/pdf?id=nsrHznwHhl

LaViSE

This repository is a heavily refactored implementation of Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention as part of the Machine Learning Reproducibility Challange (MLRC) 2022. The original code from the authors can be found here.

Requirements

One of our main contributions is making the experiments and code easily reproducible. Therefore, we took the greatest care in writing the following guide to be as concise and easy to follow as possible. If you have any questions or comments, don't hestitate to contact us (preferably via email).

Environment

To set up the correct environment, we recommend downloading the Conda package manager. Once installed, create a new environment with the following commands:

conda config --set channel_priority flexible
conda env create -f fact2023.yml

Be aware that this can take a while (depending on the hardware and network speed, around 10 to 40 minutes). Once the environment is created, activate it with:

conda activate fact2023

Datasets

The original paper used two datasets: Common Objects in Context (COCO) and Visual Genome (VG). We recommend downloading the datasets using the following scripts, which will also preprocess the data. Alternatively, you can download the datasets manually via the aforementioned websites and preprocess them yourself. The datasets will be downloaded to the data folder.

COCO

Common Objects in Context (COCO) is a large-scale object detection, segmentation, and captioning dataset. To download COCO and preprocess the dataset, run:

sh ./setup_coco.sh

Around 20 GB of data will be downloaded and processed.

VG

Visual Genome (VG) is a large-scale dataset of images annotated with object and region bounding boxes, object and attribute labels, and image-level relationships. To download VG and preprocess the dataset, run:

sh ./setup_vg.sh

Around 15 GB of data will be downloaded and processed.

Getting started

Usage

Train an explainer with:

python3 train_explainer.py

Explain a target filter of any model with:

python3 infer_filter.py

Examples

For training an explainer to explain ResNet1-8's layer 4 using the VG dataset, run:

python3 train_explainer.py --model resnet18 --layer-target layer4 --layer-classifier fc --refer vg --epochs 10

Each epoch takes around 30 minutes on a single A100 GPU, and around an hour on a Titan RTX GPU. The trained explainer will be saved to the outputs folder.

For training an explainer to explain AlexNet's feature layer using the COCO dataset, run:

python3 train_explainer.py --model alexnet --layer-target features --layer-classifier classifier --refer coco --epochs 10

Each epoch takes around 30 minutes on a single A100 GPU, and around an hour on a Titan RTX GPU. The trained explainer will be saved to the outputs folder.

To run inference on the trained ResNet-18 layer 4 explainer and evaluate it with recall@20, use the following command:

python3 infer_filter.py --model resnet18 --layer-target layer4 --layer-classifier fc --refer vg --path-model "path-to-trained-model-here" --s 20

Citation of the original paper

@inproceedings{yang2022explaining,
  author    = {Yang, Yu and Kim, Seungbae and Joo, Jungseock},
  title     = {Explaining Deep Convolutional Neural Networks via Unsupervised Visual-Semantic Filter Attention},
  booktitle = {2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year      = {2022},
  pages     = {8323-8333},
  doi       = {10.1109/CVPR52688.2022.00815}
}

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dadapt_adam.py		dadapt_adam.py
fact2023_gpu.yml		fact2023_gpu.yml
image_datasets.py		image_datasets.py
infer_filter.py		infer_filter.py
infer_helpers.py		infer_helpers.py
model_loader.py		model_loader.py
preprocess_coco.py		preprocess_coco.py
preprocess_vg.py		preprocess_vg.py
results.ipynb		results.ipynb
setup_coco.sh		setup_coco.sh
setup_vg.sh		setup_vg.sh
statistics_coco.py		statistics_coco.py
statistics_vg.py		statistics_vg.py
train_explainer.py		train_explainer.py
train_helpers.py		train_helpers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LaViSE

Requirements

Environment

Datasets

COCO

VG

Getting started

Usage

Examples

Citation of the original paper

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

ErikBuis/FACT2023

Folders and files

Latest commit

History

Repository files navigation

LaViSE

Requirements

Environment

Datasets

COCO

VG

Getting started

Usage

Examples

Citation of the original paper

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages