GitHub - terranivium/speech-emotion-recognition: Speech emotion recognition with PyTorch

MSc Software Development (2019-2020)

University of Glasgow

Source-code for the research paper 'Deep learning for robust dimensional characterisation of affect in speech'.

Paper abstract:

* "This paper seeks to evaluate machine learning methodology 
in the task of speech emotion recognition (SER) by utilising a 
signal processing approach. This is done with aim of assessing
user behaviour during the use of online voice communication."

* "We use dimensional models of emotion, put forward by empirical
research to gain nuanced information about sampled emotion data,
disseminating greater insight into user voice communication activity."

* "This paper focuses on the extraction and use of Mel-frequency 
cepstral coefficients (MFCC) as feature vectors for feed 
forward neural network architectures." 

* "Experiments conducted show evidence that the methodology 
proposed in this paper is partially effective on unseen, 
dissimilar in structure real-world data, which has proven 
to be a hurdle to deployment of solutions in the area of 
automatic speech recognition (ASR) and SER. These findings 
provide a framework to enable more precise, automated user handling."

Files:

'speech_emotion_recognition.ipynb' - main notebook
	- PyTorch (MLP) model

'augment_data.ipynb' - data augmentation batch scripts
	- white noise
	- simulated chatter, background noise
	- overdrive
	- reverb

Datasets used to train, validate and test model:

+ RAVDESS
+ CREMA-D

Datasets used only to emulate wild test performance:

+ TESS

Usage guide:

+ To simply generate results and plots, 
skip to and run cells in 'Testing' section of 'speech_emotion_recognition.ipynb', 
this will load the provided 'best model' state.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.gitignore		.gitignore
Deep_learning_for_robust_dimensional_characterisation_of_affect_in_speech.pdf		Deep_learning_for_robust_dimensional_characterisation_of_affect_in_speech.pdf
README.md		README.md
augment_data.ipynb		augment_data.ipynb
mlp.bestmodel		mlp.bestmodel
speech_emotion_recognition.ipynb		speech_emotion_recognition.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MSc Software Development (2019-2020)

University of Glasgow

Paper abstract:

Files:

Datasets used to train, validate and test model:

Datasets used only to emulate wild test performance:

Usage guide:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MSc Software Development (2019-2020)

University of Glasgow

Paper abstract:

Files:

Datasets used to train, validate and test model:

Datasets used only to emulate wild test performance:

Usage guide:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages