A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms

Sung-Woo Byun; Seok-Pil Lee

doi:10.3390/app11041890

A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms

Applied Sciences ◽

10.3390/app11041890 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1890

Author(s):

Sung-Woo Byun ◽

Seok-Pil Lee

Keyword(s):

Neural Network ◽

Emotion Recognition ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Recognition Performance ◽

Recognition System ◽

Speech Emotion Recognition ◽

Acoustic Features ◽

Speech Database

The goal of the human interface is to recognize the user’s emotional state precisely. In the speech emotion recognition study, the most important issue is the effective parallel use of the extraction of proper speech features and an appropriate classification engine. Well defined speech databases are also needed to accurately recognize and analyze emotions from speech signals. In this work, we constructed a Korean emotional speech database for speech emotion analysis and proposed a feature combination that can improve emotion recognition performance using a recurrent neural network model. To investigate the acoustic features, which can reflect distinct momentary changes in emotional expression, we extracted F0, Mel-frequency cepstrum coefficients, spectral features, harmonic features, and others. Statistical analysis was performed to select an optimal combination of acoustic features that affect the emotion from speech. We used a recurrent neural network model to classify emotions from speech. The results show the proposed system has more accurate performance than previous studies.

Download Full-text

An Improvised Facial Emotion Recognition System using the Optimized Convolutional Neural Network Model with Dropout

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2021.0120743 ◽

2021 ◽

Vol 12 (7) ◽

Author(s):

P V V S Srinivas ◽

Pragnyaban Mishra

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Emotion Recognition ◽

Network Model ◽

Neural Network Model ◽

Recognition System ◽

Facial Emotion Recognition ◽

Facial Emotion

Download Full-text

Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features

Sensors ◽

10.3390/s20185212 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5212 ◽

Cited By ~ 3

Author(s):

Tursunov Anvarjon ◽

Mustaqeem ◽

Soonil Kwon

Keyword(s):

Emotion Recognition ◽

Speech Signal ◽

Recognition Performance ◽

Recognition System ◽

Speech Emotion Recognition ◽

Success Rates ◽

Speech Database ◽

High Recognition Accuracy ◽

Signal Processing Methods ◽

Frequency Features

Artificial intelligence (AI) and machine learning (ML) are employed to make systems smarter. Today, the speech emotion recognition (SER) system evaluates the emotional state of the speaker by investigating his/her speech signal. Emotion recognition is a challenging task for a machine. In addition, making it smarter so that the emotions are efficiently recognized by AI is equally challenging. The speech signal is quite hard to examine using signal processing methods because it consists of different frequencies and features that vary according to emotions, such as anger, fear, sadness, happiness, boredom, disgust, and surprise. Even though different algorithms are being developed for the SER, the success rates are very low according to the languages, the emotions, and the databases. In this paper, we propose a new lightweight effective SER model that has a low computational complexity and a high recognition accuracy. The suggested method uses the convolutional neural network (CNN) approach to learn the deep frequency features by using a plain rectangular filter with a modified pooling strategy that have more discriminative power for the SER. The proposed CNN model was trained on the extracted frequency features from the speech data and was then tested to predict the emotions. The proposed SER model was evaluated over two benchmarks, which included the interactive emotional dyadic motion capture (IEMOCAP) and the berlin emotional speech database (EMO-DB) speech datasets, and it obtained 77.01% and 92.02% recognition results. The experimental results demonstrated that the proposed CNN-based SER system can achieve a better recognition performance than the state-of-the-art SER systems.

Download Full-text

Data-driven recurrent neural network model to predict the rate of penetration

Upstream Oil and Gas Technology ◽

10.1016/j.upstre.2021.100047 ◽

2021 ◽

Vol 7 ◽

pp. 100047

Author(s):

Husam H. Alkinani ◽

Abo Taleb T. Al-Hameedi ◽

Shari Dunn-Norman

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Data Driven ◽

Rate Of Penetration

Download Full-text

Bottleneck Feature Extraction-Based Deep Neural Network Model for Facial Emotion Recognition

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Mobile Networks and Management ◽

10.1007/978-3-030-64002-6_3 ◽

2020 ◽

pp. 30-46

Author(s):

Tian Ma ◽

Kavuma Benon ◽

Bamweyana Arnold ◽

Keping Yu ◽

Yan Yang ◽

...

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Emotion Recognition ◽

Network Model ◽

Neural Network Model ◽

Deep Neural Network ◽

Facial Emotion Recognition ◽

Facial Emotion

Download Full-text

An Adaptive Recurrent Neural Network Model Dedicated to Opportunistic Communication in Wireless Networks

2018 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2018.8489720 ◽

2018 ◽

Cited By ~ 4

Author(s):

Silas S. Fernandes ◽

Mariana R. Makiuchi ◽

Marcus V. Lamar ◽

Jacir L. Bordim

Keyword(s):

Neural Network ◽

Wireless Networks ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Opportunistic Communication

Download Full-text

Design and Analysis of a General Recurrent Neural Network Model for Time-Varying Matrix Inversion

IEEE Transactions on Neural Networks ◽

10.1109/tnn.2005.857946 ◽

2005 ◽

Vol 16 (6) ◽

pp. 1477-1490 ◽

Cited By ~ 291

Author(s):

Y. Zhang ◽

S.S. Ge

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Matrix Inversion ◽

Time Varying

Download Full-text

An efficient strategy for predicting river dissolved oxygen concentration: application of deep recurrent neural network model

Environmental Monitoring and Assessment ◽

10.1007/s10661-021-09586-x ◽

2021 ◽

Vol 193 (12) ◽

Author(s):

Salar Valizadeh Moghadam ◽

Ahmad Sharafati ◽

Hajar Feizi ◽

Seyed Mohammad Saeid Marjaie ◽

Seyed Babak Haji Seyed Asadollah ◽

...

Keyword(s):

Neural Network ◽

Dissolved Oxygen ◽

Oxygen Concentration ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Dissolved Oxygen Concentration ◽

Efficient Strategy ◽

Deep Recurrent Neural Network

Download Full-text

Dynamics of Vertebral Column Observed by Stereovision and Recurrent Neural Network Model

Biological and Medical Data Analysis - Lecture Notes in Computer Science ◽

10.1007/11573067_6 ◽

2005 ◽

pp. 51-60

Author(s):

C. Fernando Mugarra Gonzalez ◽

Stanisław Jankowski ◽

Jacek J. Dusza ◽

Vicente Carrilero López ◽

Javier M. Duart Clemente

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Vertebral Column

Download Full-text

Battery Voltage Estimation Using NARX Recurrent Neural Network Model

Advances in Intelligent Systems and Computing - Automation 2019 ◽

10.1007/978-3-030-13273-6_22 ◽

2019 ◽

pp. 218-231

Author(s):

Adrian Chmielewski ◽

Jakub Możaryn ◽

Piotr Piórkowski ◽

Krzysztof Bogdziński

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Battery Voltage

Download Full-text

A Recurrent Neural Network model to predict blood–brain barrier permeability

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2020.107377 ◽

2020 ◽

Vol 89 ◽

pp. 107377 ◽

Cited By ~ 1

Author(s):

Shrooq Alsenan ◽

Isra Al-Turaiki ◽

Alaaeldin Hafez

Keyword(s):

Neural Network ◽

Blood Brain Barrier ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Brain Barrier ◽

Barrier Permeability ◽

Blood Brain Barrier Permeability ◽

Blood Brain ◽

Brain Barrier Permeability

Download Full-text