Skip to content

Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music, NTUA

Notifications You must be signed in to change notification settings

agelosk/Instrument_Classification_Paper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music

Image

This is the official implementation code of the paper:

Abstract

Instrument classification is one of the fields in Music Information Retrieval (MIR) that has attracted a lot of research interest. However, the majority of that is dealing with monophonic music, while efforts on polyphonic material mainly focus on predominant instrument recognition or multi-instrument recognition for entire tracks. We present an approach for instrument classification in polyphonic music using monophonic training data that involves mixing-augmentation methods. Specifically, we experiment with pitch and tempo-based synchronization, as well as mixes of tracks with similar music genres. Further, a custom CNN model is proposed, that uses the augmented training data efficiently and a plethora of suitable evaluation metrics are discussed as well. The tempo-sync and genre techniques stand out, achieving an 81% label ranking average precision accuracy, detecting up to 9 instruments in over 2300 testing tracks.

About

Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music, NTUA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published