Audio similarity experiments with SKLearn random forest classification on extracted audio feature vectors.
Dataset sourced from kaggle
To activate virtual environment:
python3 -m venv env
source env/bin/activate
pip install -r requirements.txt
Please create your own samples/ directory or download and extract to samples/ as this is gitignored. Samples should be ~2.5GB in total when extracted.