Developing a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions

dc.contributor.advisorValles, Damian
dc.contributor.advisorViswanathan, Vishu
dc.contributor.authorMatin, Rezwan
dc.contributor.committeeMemberResendiz, Maria
dc.date.accessioned2020-12-02T21:06:31Z
dc.date.available2020-12-02T21:06:31Z
dc.date.issued2020-12
dc.description.abstractIn this thesis work, a robust speech emotion recognition system has been developed to be used by children with autism spectrum disorder (ASD). Children with ASD have difficulty identifying human emotions during social interactions, and the goal of this work was to develop a tool that could be used by these children to better understand the emotions of people around them. The speech emotion recognition solution was created using machine learning and deep learning techniques. A novel approach was taken, which involves joining multiple machine learning algorithms using ensemble learning to classify speech recordings in real-time. A support vector machine (SVM), a multilayer perceptron (MLP), and a recurrent neural network model were trained on the Ryerson Audio-Visual Database of Emotional Speech and Songs (RAVDESS), the Toronto Emotional Speech Set (TESS), the Crowd-sourced Emotional Multimodal Actors Dataset (CREMA-D), and a custom dataset which contains utterances from the three datasets with added background noise. Two separate audio feature sets were used, and their performances were compared. One of them was a custom feature set created specifically for this study and the other contained features from a popular speech emotion feature set. Furthermore, once the speech emotion recognizer was developed, it was joined with a facial expression recognition model to create a robust, multimodal emotion recognition system. The purpose was to get more accurate predictions of emotions by processing data from the audio and video mode.
dc.description.departmentEngineering
dc.formatText
dc.format.extent126 pages
dc.format.medium1 file (.pdf)
dc.identifier.citationMatin, R. (2020). <i>Developing a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions</i> (Unpublished thesis). Texas State University, San Marcos, Texas.
dc.identifier.urihttps://hdl.handle.net/10877/13037
dc.language.isoen
dc.subjectMachine learning
dc.subjectDeep learning
dc.subjectSpeech emotion recognition
dc.subject.lcshAutistic children
dc.subject.lcshPattern recognition systems
dc.titleDeveloping a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions
dc.typeThesis
thesis.degree.departmentEngineering
thesis.degree.disciplineEngineering
thesis.degree.grantorTexas State University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MATIN-THESIS-2020.pdf
Size:
2.47 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.53 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.96 KB
Format:
Plain Text
Description: