Developing a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions

Matin, Rezwan

Developing a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions

dc.contributor.advisor	Valles, Damian
dc.contributor.advisor	Viswanathan, Vishu
dc.contributor.author	Matin, Rezwan
dc.contributor.committeeMember	Resendiz, Maria
dc.date.accessioned	2020-12-02T21:06:31Z
dc.date.available	2020-12-02T21:06:31Z
dc.date.issued	2020-12
dc.description.abstract	In this thesis work, a robust speech emotion recognition system has been developed to be used by children with autism spectrum disorder (ASD). Children with ASD have difficulty identifying human emotions during social interactions, and the goal of this work was to develop a tool that could be used by these children to better understand the emotions of people around them. The speech emotion recognition solution was created using machine learning and deep learning techniques. A novel approach was taken, which involves joining multiple machine learning algorithms using ensemble learning to classify speech recordings in real-time. A support vector machine (SVM), a multilayer perceptron (MLP), and a recurrent neural network model were trained on the Ryerson Audio-Visual Database of Emotional Speech and Songs (RAVDESS), the Toronto Emotional Speech Set (TESS), the Crowd-sourced Emotional Multimodal Actors Dataset (CREMA-D), and a custom dataset which contains utterances from the three datasets with added background noise. Two separate audio feature sets were used, and their performances were compared. One of them was a custom feature set created specifically for this study and the other contained features from a popular speech emotion feature set. Furthermore, once the speech emotion recognizer was developed, it was joined with a facial expression recognition model to create a robust, multimodal emotion recognition system. The purpose was to get more accurate predictions of emotions by processing data from the audio and video mode.
dc.description.department	Engineering
dc.format	Text
dc.format.extent	126 pages
dc.format.medium	1 file (.pdf)
dc.identifier.citation	Matin, R. (2020). <i>Developing a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions</i> (Unpublished thesis). Texas State University, San Marcos, Texas.
dc.identifier.uri	https://hdl.handle.net/10877/13037
dc.language.iso	en
dc.subject	Machine learning
dc.subject	Deep learning
dc.subject	Speech emotion recognition
dc.subject.lcsh	Autistic children
dc.subject.lcsh	Pattern recognition systems
dc.title	Developing a speech emotion recognition solution using ensemble learning for children with autism spectrum disorder to help identify human emotions
dc.type	Thesis
thesis.degree.department	Engineering
thesis.degree.discipline	Engineering
thesis.degree.grantor	Texas State University
thesis.degree.level	Masters
thesis.degree.name	Master of Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: MATIN-THESIS-2020.pdf
Size:: 2.47 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 4.53 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.96 KB
Format:: Plain Text
Description:

Download

Collections

Graduate Theses and Dissertations