Udacity Deep Learning Nanodegree (nd101) — Course 4: RNNs and Transformers, final project.
Analyze sentiment of movie reviews in three languages (English, French, Spanish):
- Translation — fr→en and es→en via HuggingFace
Helsinki-NLP/opus-mt-*MarianMT models - Merge — combine the three CSVs into one dataframe
- Sentiment — binary classification via DistilBERT (
distilbert-base-uncased-finetuned-sst-2-english) - Export —
result/reviews_with_sentiment.csv
30 reviews (10 per language) → 15 Positive / 15 Negative.
- Python 3.14 / PyTorch 2.10 / HuggingFace transformers 5.3
- Runs on CPU or MPS (Apple Silicon); no GPU required
pip install transformers sentencepiece sacremoses pandas torch
python run_project.py # inference-only, ~2-3 minproject.ipynb/project.html— executed submission notebookrun_project.py— standalone training/inference scriptdata/movie_reviews_{eng,fr,sp}.csv— source reviewsresult/reviews_with_sentiment.csv— final output
Part of Leo's Udacity Masters in AI journey. See also:
- landmark-classifier — Course 3 CNN project
- face-generation — Course 5 DCGAN project