This project implements a deep learning model for image classification on the CIFAR-10 dataset using ResNet18 architecture. CIFAR-10 contains 60,000 32x32 color images across 10 object categories (airplanes, automobiles, birds, cats, deer, dogs, frogs, horses, ships, and trucks).
| Dhanush Srinivas & Ashwin Athappan |
|---|
| 
- 18-layer deep residual network
- Residual connections (skip connections) every 2 layers
- Batch normalization after each convolution
- ~11 million parameters

- 3 convolutional layers with max pooling
- 2 fully connected layers
- Dropout for regularization
- ~0.5 million parameters

- Optimizer: SGD with momentum (0.9)
- Learning Rate: 0.1 with MultiStepLR scheduler
- Weight Decay: 1e-4
- Batch Size: 128
- Epochs: 50
- Loss Function: Cross Entropy Loss
- Random horizontal flip (p=0.5)
- Random crop (32x32 with padding=4)
- Normalization using CIFAR-10 statistics
- Transfer learning with pre-trained weights
- Advanced data augmentation (AutoAugment, CutMix)
- Experiment with ResNet34, ResNet50
- Apply to CIFAR-100 or ImageNet subsets
- Model compression and optimization
- Attention mechanisms (CBAM, SE blocks)
- He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. CVPR.
- Krizhevsky, A. (2009). Learning multiple layers of features from tiny images. Technical Report.
- PyTorch Documentation: https://pytorch.org/docs/stable/index.html
This project is for educational purposes.
Created as part of a machine learning project for CIFAR-10 image classification.