MRI Brain Alzheimer Classification Under Adversarial Attacks

This project is part of the EE046211 Deep Learning course at the Technion.

Dor Lerman:

Niv Bar-Tov:

Project Overview

In the realm of cyber security, efforts are made to protect computing systems from digital attacks, which are an emerging threat nowadays. In machine learning, attackers develop adversarial attacks that are designed to trick models using deceiving data. This deceptive data is given to the models as an input, causing classifiers to make incorrect classifications. Particularly, medical images are vulnerable to these adversarial attacks^[1]. Acknowledging this vulnerability emphasizes the importance of enhancing the model's resilience.

In this project, we aim to design a robust model for detecting and classifying Alzheimer disease using MRI brain images. The model simulates a radiologist's diagnostic process by classifying images into four severity levels. We evaluate and compare several well-known unsupervised pre-trained models for classification tasks. Then, we train and evaluate these models under adversarial attacks, which can significantly reduce model's performance. By combining the models, we aim to create an ensemble, a unified and robust model that maximizes resilience against adversarial attacks while maintaining high classification performance.

The project includes the following steps:

Transfer Learning: We used transfer learning to fine-tune and extract features from three well-known unsupervised pre-trained models to perform well on our specific task: DINOv2, ResNet34 and EfficientNet-B0.
Adversarial Attacks Implementation: We performed two adversarial attacks on each one of the models: Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD), which were found to be effective attacks^[2].
Adversarial Training: To enhance model robustness, we trained these models with adversarial examples, using the PGD attack^[3].A weighted cross-entropy loss combined the standard loss with an adversarial component scaled by the adv_weight parameter. For models that required a different adversarial training approach, we applied Curriculum Adversarial Training^[4].
Ensemble Models: Finally, we combined these three models using weighted ensemble approach to create a robust model, without affecting performance.

The project is implemented in Python using the PyTorch framework, which allows us to build and train the models efficiently throughout these steps.

Repository Structure

Directory Name	Content
`assets`	Contains images for each model, including confusion matrices, loss curves, and accuracy curves.
`checkpoints`	An empty directory for saving notebook's output: Optuna hyperparameters and trained models.
`dataset`	Contains two sub-directories: `raw_dataset` with the original raw data, and `dataset_variables` with processed dataset splitted into train, validation and test sets.
`env`	Contains the project environment configuration file and the requirements file.
`models`	Contains all model's training notebooks.
`utils`	Contains utility flies including `optuna_search.py` for hyperparameter optimization, `utils_funcs.py` with general helper functions, and `Grad_cam.py` for generating Grad-CAM visualizations.

Files In The Repository

File Name	Description
`dataset/prepare_dataset.ipynb`	Notebook used to split the raw dataset into train, validation and test sets and applies a resize transformation of 224x224 pixels.
`dataset/dataset_variables/*.pt`	Processed dataset files: `train_set.pt`, `validation_set.pt`, and `test_set.pt` (created by `prepare_dataset.ipynb`).
`env/requirements.txt`	List of required Python packages for setting up the environment.
`env/project_env.yaml`	Environment configuration file.
`utils/optuna_search.py`	Script for performing hyperparameters search using Optuna. It allows customization of epochs, trials and hyperparameters.
`utils/Grad_cam.py`	Script for generating Grad-CAM heatmaps. User must specify required parameters as per function definitions.
`utils/utils_funcs.py`	Contains general utility functions such as saving models, loading images, displaying graphs, and training.
`models/def_models.py`	Definition of class objects used for the trained models.
`models/*_model.ipynb`	Model-specific notebook (e.g. `resnet_model.ipynb`). Used for data loading, training, evaluation and results generation: accuracy, confusion matrix, loss curve and accuracy curve.
`models/*_model_atk.ipynb`	Adversarial training model-specific notebook (e.g. `resnet_model_atk.ipynb`). Loads a pre-trained model, applies adversarial attacks, and trains the model under these attacks.

Installation

General Prerequisites

Library	Version
`Python`	`3.10`
`torch`	`>= 1.4.0`
`matplotlib`	`>= 3.7.1`
`numpy`	`>= 1.24.3`
`opencv`	`>= 4.5.0`
`pandas`	`>= 1.5.0`
`tqdm`	`>= 4.65.0`
`scipy`	`>= 1.8.1`
`seaborn`	`>= 0.11.2`
`plotly`	`>= 5.7.0`
`notebook`	`>= 6.5.4`
`ipywidgets`	`>= 7.6.0`
`torchmetrics`	`>= 0.10.0`
`optuna`	`>= 2.10.0`
`fvcore`	`>= 0.1.5`
`iopath`	`>= 0.1.9`
`submitit`	`>= 1.3.0`
`kornia`	`>= 0.6.0`
`prettytable`	`>= 2.4.0`
`pickleshare`	`>= 0.7.5`
`torchcam`	`>= 0.1.2`
`torchattacks`	`>= 0.2.0`

DINOv2 Specific Requirements

Library	Version
`torch`	`== 2.0.0`
`torchvision`	`== 0.15.0`
`omegaconf`	`>= 2.3.0`
`torchmetrics`	`== 0.10.3`
`fvcore`	`>= 0.1.6`
`iopath`	`>= 0.1.9`
`submitit`	`>= 1.4.5`
`xformers`	`== 0.0.18`
`cuml-cu11`	`>= 23.04`

To set up the required dependencies, please follow one of the options below:

1. Conda (Recommended)

Clone this repository and then create and activate the conda environment (env/project_env.yaml) using the following commands:

conda env create -f env/project_env.yaml
conda activate project_env

2. Pip Install

Clone this repository and then use the provided env/requirements.txt file to install the required dependencies:

pip install -r env/requirements.txt

Dataset

We used a pre-processed dataset of 11,519 axial MRI brain images: 6,400 images from real patients and 5,119 synthetic images that were developed to rectify the class imbalance of the original dataset. The images are classified into four categories: "Non Demented", "Very Mild Demented", "Mild Demented", and "Moderate Demented". Each category had 100, 70, 28, and 2 patients, respectively, and each patient's brain was sliced into 32 horizontal axial MRIs. The images have a resolution of 128x128 pixels and are in the “.jpg” format. All images have been pre-processed to remove the skull.

The dataset was split according to the train-validation-test methodology: the train set contains 8,192 real and synthetic images, the validation set contains 2,048 real and synthetic images and the test set contains 1,279 real images only. We resized the images into 224x224 pixels to match the input size required for the pre-trained models.

Trained Models

We provide the files of our trained models, as well as the hyperparameters used for the training. These files can be loaded to the notebooks as mentioned in the next section.

Model Type	Google Drive Link	Optuna Params
DINOv2	Download .pth file	Download JSON file
DINOv2 with adversarial attacks	Download .pth file	Download JSON file
Resnet34	Download .pth file	Download JSON file
Resnet34 with adversarial attacks	Download .pth file	Download JSON file
Efficientnet-B0	Download .pth file	Download JSON file
Efficientnet-B0 with adversarial attacks	Download .pth file	Download JSON file

Usage

In order to train a specific model, open one of the notebooks *_model.ipynb. You can load the backbone of your required model, choose the hyperparameters using Optuna, define data augmentations and then train the model. If you are interested in transfer learning, set requires_grad = False for layers you want to freeze. Otherwise, set requires_grad = True for layers you want to fine-tune. Notice that all models structs exist in def_models.py and you can add there new models of your own. It is possible to load a trained model using load_existing_params = True or to load existing hyperparameters using load_existing_hyperparams = True.

Alternatively, to run the model under adversarial attacks, open one of the *_model_atk.ipynb notebooks.Here, you can apply attacks such as FGSM, PGD, or implement a custom attack. Select the parameters for your attacks, load your model, and train it under adversarial conditions.

All notebooks provide confusion matrices, loss and accuracy curves to analyze the results (these are saved into assets/<name_of_model> directory). During the training process, every 5 epochs the model's state and statistics are saved under: checkpoints/<name_of_model_with_time_stamp>/<name_of_model_with_loss_statistics>

Sources and References

[1] Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A. (2017). Understanding adversarial attacks on deep learning based medical image analysis systems. arXiv:1907.10456

[2] Zhang, H., Li, Y., & Chen, X. (2023). Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications. arXiv:2308.14597

[3] Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A. (2018). Towards Deep Learning Models Resistant to Adversarial Attacks. arXiv:1706.06083

[4] Cai, Q.-Z., Du, M., Liu, C., & Song, D. (2018). Curriculum Adversarial Training. arXiv:1805.04807

[5] Luke Chugh. (2021). Best Alzheimer MRI dataset. Kaggle dataset. https://www.kaggle.com/datasets/lukechugh/best-alzheimer-mri-dataset-99-accuracy

[6] Optuna. (2023). Optuna: A hyperparameter optimization framework. GitHub repository. https://github.com/optuna/optuna

[7] Facebook Research. (2023). DINOv2. GitHub repository. https://github.com/facebookresearch/dinov2

[8] Chen, X., Zhang, H., & Li, Y. (2022). Exploring adversarial attacks and defenses in vision transformers trained with DINO. arXiv:2206.06761

[9] Gil, J. (2020). PyTorch Grad-CAM. GitHub repository. https://github.com/jacobgil/pytorch-grad-cam

[10] Hoki. (2020). Torchattack: PyTorch adversarial attack library. GitHub repository. https://github.com/Harry24k/torchattacks

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MRI Brain Alzheimer Classification Under Adversarial Attacks

Dor Lerman:

Niv Bar-Tov:

Table of Contents

Project Overview

Repository Structure

Files In The Repository

Installation

General Prerequisites

DINOv2 Specific Requirements

1. Conda (Recommended)

2. Pip Install

Dataset

Trained Models

Usage

Sources and References

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
assets		assets
checkpoints		checkpoints
dataset		dataset
env		env
models		models
utils		utils
LICENSE.md		LICENSE.md
MRI Brain Alzheimer Classification Under Adversarial Attacks.pdf		MRI Brain Alzheimer Classification Under Adversarial Attacks.pdf
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

MRI Brain Alzheimer Classification Under Adversarial Attacks

Dor Lerman:

Niv Bar-Tov:

Table of Contents

Project Overview

Repository Structure

Files In The Repository

Installation

General Prerequisites

DINOv2 Specific Requirements

1. Conda (Recommended)

2. Pip Install

Dataset

Trained Models

Usage

Sources and References

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages