Semantic Masked Autoencoders

This repository hosts the code for our accepted submission to the ML Reproducibility Challenge 2022: Masked Autoencoders are Small Scale Vision Learners: A Reproducibility Study Under Resource Constraints

Development Setup

Setting up environment

pip

To set up using pip, run the following:

cd <project_root>
pip install -r requirements.txt

conda

To set up using conda, run the following:

cd ./environment
conda env create -f [os].yaml

Datasets

For our experiments we made use of two datasets:

Tiny-ImageNet (TIN): gets automatically downloaded from hugging-face into data/tiny-imagenet-200 directory. The latter is also created from our dataloader.
CUB-200 (CUB): gets automatically downloaded from caltech.edu into data/cub-200-2011 directory. The latter is also created from our dataloader.

Model Checkpoints

Below we provide checkpoints for our trained models on the corresponding datasets. Please make sure to place those inside the directory checkpoints under project root.

Model	Pre-Train	Fine-Tune	Masking Method	Link
SqueezeNet	-	TIN	-	Google Drive
MAE	TIN	TIN	Random	Google Drive
MAE	TIN	CUB	Random	Google Drive
SMAE (α=0.5)	TIN	TIN	Random	Google Drive
SMAE (α=0.5)	TIN	CUB	Block	Google Drive

Feel free to contact any of the authors if the links above stop working.

Project Structure

Under project root one can find the following directory tree:

📦checkpoints: model checkpoints

📂config: config files

┣ 📂model: for different model versions

┗ 📂train: for different training hyperparameters

📂data

┣ 📦cub-200-2011: CUB dataset files

┗ 📦tiny-imagenet-200: TIN dataset files

📂src: Sources root. Make sure it is in PYTHONPATH or you start inside.

┣ 📂dataset: dataloader classes

┣ 📂loss: loss classes

┣ 📂model: model classes and their dependencies

┣ 📂utilities: utility methods and classes

┃📜...

┃📜evaluate.py: model evaluation entry point

┗📜main.py: training entry point

How To Run

After making sure that the developmental environment is setup and that SqueezeNet checkpoint exists in the checkpoints directory, you can train a MAE model using the main.py script. This should be run from inside the src directory.

To evaluate a trained model from a saved checkpoint, you can use the evaluate.py script.

CLI arguments

The main.py script accepts the following CLI arguments (enumerating all the possible values):

usage: main.py 
	[--model_key MODEL_KEY]         mae, mae-blockmask, squeeze
	[--model_config MODEL_CONFIG]   e7d2_128, e12d2_128, 200 (for SqueezeNet)
	[--train_config TRAIN_CONFIG]   default, cub
	[--intention INTENTION]         pretrain, finetune, linearprobing, fromscratch, dino
	[--resume]                      <add it to the CLI to resume from the last found checkpoint or the pretraining one (in case intention is finetune/linearprobing)>
	[--seed SEED]                   <SRNG seed>
	[--device DEVICE]               cuda

Please see src/main.py for more detailed description.

The evaluate.py script accepts the following CLI arguments

usage: evaluate.py CHECKPOINTFILE	filename, including path, to saved checkpoint to evaluate

Example Runs

Self-Supervised Pre-Training
- Pixel Loss, Random Masking:
  python main.py --intention pretrain --train_config default --model_key mae
- Combined Perceptual + Pixel Loss (uncertainty-based weighting), Block Masking:
  python main.py --intention pretrain --train_config default_uncertainty --model_key mae-blockmask
- Combined Perceptual + Pixel Loss (fixed weights), Block Masking:
  python main.py --intention pretrain --train_config default_combined --model_key mae-blockmask
Supervised Fine-Tuning
- On TIN:
  python main.py --intention finetune --model_key mae --train_config default --resume
Transfer Learning
- On CUB:
  python main.py --intention finetune --model_key mae --train_config cub --resume
Evaluate Trained Model
- python evaluate.py ../checkpoints/mae_e7d2_128_finetune.pth

Experiments

To generatate reconstructions of pre-trained models please consult:
src/experiments/create_figure_2.py
<we will update this list with new experiments>

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
checkpoints		checkpoints
config		config
data		data
environment		environment
src		src
test		test
.gitignore		.gitignore
Masked Autoencoders Are Small Scale Vision Learners - Charisoudis, A., Ekman von Huth, S., Jansson, E.pdf		Masked Autoencoders Are Small Scale Vision Learners - Charisoudis, A., Ekman von Huth, S., Jansson, E.pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Semantic Masked Autoencoders

Table of Contents

Development Setup

Setting up environment

pip

conda

Datasets

Model Checkpoints

Project Structure

How To Run

CLI arguments

Example Runs

Experiments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

MLReproHub/SMAE

Folders and files

Latest commit

History

Repository files navigation

Semantic Masked Autoencoders

Table of Contents

Development Setup

Setting up environment

pip

conda

Datasets

Model Checkpoints

Project Structure

How To Run

CLI arguments

Example Runs

Experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages