Here we present two applications of the pre-trained speechVGG introduced in (Beckmann et al., 2019):
- Speaker identification (TIMIT dataset) - example of fine tuning the pre-trained extractor to a specific task
- Speech-music-noise classification (MUSAN dataset) - example of the direct deployment of the pre-trained feature extractor