Emergence of Concepts in Vision Transformers

This is the official repository for the paper "From Colors to Classes: Emergence of Concepts in Vision Transformers" by Teresa Dorszewski, Lenka Tětková, Robert Jenssen, Lars Kai Hansen, Kristoffer Knutsen Wickstrøm.

The basis of this code is from the github repo Trustworthy-ML-Lab/CLIP-dissect for the paper CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks published at ICLR 2023. It was adapted to support models from huggingface and save the top activating images for each neuron.

The labeled neuron lists for all models from our analysis can be found in the results folder. The concept set and categoization of concepts is in the data folder.

All figures can be reproduced with the final_figures.ipynb and vis_neurons.ipynb notebook.

Quick guide:

Analysing your own model

Add your path to the datasets and models in data_utils.py
Dissect the model by running: python describe_neurons.py --target_model {model_name} --target_layers {layers} --d_probe "imagenet_broden" --concept_set "data/20k.txt"

You can run this with any model on huggingface, just insert the right layer names, e.g. for vit: vit.encoder.layer[0],vit.encoder.layer[1],...

The probing set and concept set can be changed as liked.

Note: The code is written to run on GPU (with cuda), it will take very long to run on a CPU, if it even runs at all.

Reproducing our results

To reproduce the results saved in results, run run_clipdissect.sh. It requires a lot of memory, so it might be needed to split by layers.

Cite this work

Dorszewski, Teresa, et al. "From colors to classes: Emergence of concepts in vision transformers." arXiv preprint arXiv:2503.24071 (2025)

@article{dorszewski2025colors,
  title={From colors to classes: Emergence of concepts in vision transformers},
  author={Dorszewski, Teresa and T{\v{e}}tkov{\'a}, Lenka and Jenssen, Robert and Hansen, Lars Kai and Wickstr{\o}m, Kristoffer Knutsen},
  journal={arXiv preprint arXiv:2503.24071},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emergence of Concepts in Vision Transformers

Quick guide:

Analysing your own model

Reproducing our results

Cite this work

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
results		results
.gitignore		.gitignore
README.md		README.md
data_utils.py		data_utils.py
describe_neurons.py		describe_neurons.py
dlbroden.sh		dlbroden.sh
final_figures.ipynb		final_figures.ipynb
requirements.txt		requirements.txt
run_clipdissect.sh		run_clipdissect.sh
similarity.py		similarity.py
utils.py		utils.py
vis_neurons.ipynb		vis_neurons.ipynb

Folders and files

Latest commit

History

Repository files navigation

Emergence of Concepts in Vision Transformers

Quick guide:

Analysing your own model

Reproducing our results

Cite this work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages