046211 Project - Image Captioning

Python implementation of Image Captioning networks

The project uses PyTorch to implement two networks that generate descriptive sentences from an image input. The models were inspired by a couple of existing projects[1, 2].

Dataset

The networks were trained using the Flickr8k dataset

Model Architectures

Model 1

CNN-RNN with Single layer LSTM without Attention

Encoder - Pre-Trained Inception v3
Decoder - Single Layer LSTM

Model 2

CNN-RNN with Single layer LSTM with Attention

Encoder - Pre-Trained ResNet-50
Decoder - Single Layer LSTM with Soft Attention

Examples

Model 1

Below are a few examples of the generated captions:

Model 2

Below are a few examples of the generated captions and the attention weights:

Scores

Usage

Fill in the following paths:

path_images="/content/flickr8k/Images" #Dataset Images
path_captions="/content/flickr8k/captions.txt" #Dataset Captions
path_examples="" #Images to caption
path_checkpoints="" #Model checkpoints

Use the following function to caption images:

print_examples(model, device, dataset, path, transform, attention=False, save=False, max_imgs=5, dpi=None)

Check out the notebook for additional information.

Parameters

model = model to evaluate
device = device to use i.e. device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
dataset = dataset used for vocabulary
path = directories for the images to caption
transform = transform depending on the model
attention = True for Model 2, False for Model 1
save = saves the figures with generated captions
max_imgs = generates captions for only max_imgs pictures from the folder (random)
dpi = resolution for saved figures

Folders

Examples: a few sample images from the Flickr30k dataset.
Checkpoints: checkpoints to the trained models.
Code: .py and .ipynb files with the code.

References

[1] https://github.com/aladdinpersson/Machine-Learning-Collection/tree/master/ML/Pytorch/more_advanced/image_captioning

[2] https://www.kaggle.com/mdteach/image-captioning-with-attention-pytorch

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Checkpoints		Checkpoints
Code		Code
Examples		Examples
Images		Images
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

046211 Project - Image Captioning

Python implementation of Image Captioning networks

Dataset

Model Architectures

Model 1

Model 2

Examples

Model 1

Model 2

Scores

Usage

Parameters

Folders

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

046211 Project - Image Captioning

Python implementation of Image Captioning networks

Dataset

Model Architectures

Model 1

Model 2

Examples

Model 1

Model 2

Scores

Usage

Parameters

Folders

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages