Speech emotion recognition based on data enhancement in time-frequency domain

Time-Frequency Representation Learning with Graph Convolutional Network for Dialogue-Level Speech Emotion Recognition

10.21437/interspeech.2021-2067 ◽

2021 ◽

Author(s):

Jiaxing Liu ◽

Yaodong Song ◽

Longbiao Wang ◽

Jianwu Dang ◽

Ruiguo Yu

Keyword(s):

Emotion Recognition ◽

Representation Learning ◽

Speech Emotion Recognition ◽

Convolutional Network ◽

Time Frequency ◽

Frequency Representation

Download Full-text

EEG-Based Emotion Recognition Using Quadratic Time-Frequency Distribution

Sensors ◽

10.3390/s18082739 ◽

2018 ◽

Vol 18 (8) ◽

pp. 2739 ◽

Cited By ~ 22

Author(s):

Rami Alazrai ◽

Rasha Homoud ◽

Hisham Alwanni ◽

Mohammad Daoud

Keyword(s):

Emotion Recognition ◽

Frequency Domain ◽

Frequency Distribution ◽

Support Vector ◽

Eeg Signals ◽

Time Frequency ◽

Labeling Schemes ◽

Frequency Representation ◽

Frequency Features ◽

Time Frequency Distribution

Accurate recognition and understating of human emotions is an essential skill that can improve the collaboration between humans and machines. In this vein, electroencephalogram (EEG)-based emotion recognition is considered an active research field with challenging issues regarding the analyses of the nonstationary EEG signals and the extraction of salient features that can be used to achieve accurate emotion recognition. In this paper, an EEG-based emotion recognition approach with a novel time-frequency feature extraction technique is presented. In particular, a quadratic time-frequency distribution (QTFD) is employed to construct a high resolution time-frequency representation of the EEG signals and capture the spectral variations of the EEG signals over time. To reduce the dimensionality of the constructed QTFD-based representation, a set of 13 time- and frequency-domain features is extended to the joint time-frequency-domain and employed to quantify the QTFD-based time-frequency representation of the EEG signals. Moreover, to describe different emotion classes, we have utilized the 2D arousal-valence plane to develop four emotion labeling schemes of the EEG signals, such that each emotion labeling scheme defines a set of emotion classes. The extracted time-frequency features are used to construct a set of subject-specific support vector machine classifiers to classify the EEG signals of each subject into the different emotion classes that are defined using each of the four emotion labeling schemes. The performance of the proposed approach is evaluated using a publicly available EEG dataset, namely the DEAPdataset. Moreover, we design three performance evaluation analyses, namely the channel-based analysis, feature-based analysis and neutral class exclusion analysis, to quantify the effects of utilizing different groups of EEG channels that cover various regions in the brain, reducing the dimensionality of the extracted time-frequency features and excluding the EEG signals that correspond to the neutral class, on the capability of the proposed approach to discriminate between different emotion classes. The results reported in the current study demonstrate the efficacy of the proposed QTFD-based approach in recognizing different emotion classes. In particular, the average classification accuracies obtained in differentiating between the various emotion classes defined using each of the four emotion labeling schemes are within the range of 73 . 8 % – 86 . 2 % . Moreover, the emotion classification accuracies achieved by our proposed approach are higher than the results reported in several existing state-of-the-art EEG-based emotion recognition studies.

Download Full-text

Time-Frequency Deep Representation Learning for Speech Emotion Recognition Integrating Self-attention

Communications in Computer and Information Science - Neural Information Processing ◽

10.1007/978-3-030-36808-1_74 ◽

2019 ◽

pp. 681-689

Author(s):

Jiaxing Liu ◽

Zhilei Liu ◽

Longbiao Wang ◽

Lili Guo ◽

Jianwu Dang

Keyword(s):

Emotion Recognition ◽

Representation Learning ◽

Speech Emotion Recognition ◽

Time Frequency

Download Full-text

Music Composition and Emotion Recognition Using Big Data Technology and Neural Network Algorithm

Computational Intelligence and Neuroscience ◽

10.1155/2021/5398922 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Yu Wang

Keyword(s):

Neural Network ◽

Big Data ◽

Emotion Recognition ◽

Frequency Domain ◽

Music Composition ◽

Feature Recognition ◽

Short Term Memory ◽

Learning Ability ◽

Time Frequency ◽

Big Data Technology

To implement a mature music composition model for Chinese users, this paper analyzes the music composition and emotion recognition of composition content through big data technology and Neural Network (NN) algorithm. First, through a brief analysis of the current music composition style, a new Music Composition Neural Network (MCNN) structure is proposed, which adjusts the probability distribution of the Long Short-Term Memory (LSTM) generation network by constructing a reasonable Reward function. Meanwhile, the rules of music theory are used to restrict the generation of music style and realize the intelligent generation of specific style music. Afterward, the generated music composition signal is analyzed from the time-frequency domain, frequency domain, nonlinearity, and time domain. Finally, the emotion feature recognition and extraction of music composition content are realized. Experiments show that: when the iteration times of the function increase, the number of weight parameter adjustments and learning ability will increase, and thus the accuracy of the model for music composition can be greatly improved. Meanwhile, when the iteration times increases, the loss function will decrease slowly. Moreover, the music composition generated through the proposed model includes the following four aspects: sadness, joy, loneliness, and relaxation. The research results can promote music composition intellectualization and impacts traditional music composition mode.

Download Full-text

Emotion Recognition of Single-electrode EEG based on Multi-feature Combination in Time-frequency Domain

Journal of Physics Conference Series ◽

10.1088/1742-6596/1827/1/012031 ◽

2021 ◽

Vol 1827 (1) ◽

pp. 012031

Author(s):

Xiang Cao ◽

Kunyuan Zhao ◽

Dan Xu

Keyword(s):

Emotion Recognition ◽

Frequency Domain ◽

Feature Combination ◽

Time Frequency

Download Full-text

Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition

Sensors ◽

10.3390/s150101458 ◽

2015 ◽

Vol 15 (1) ◽

pp. 1458-1478 ◽

Cited By ~ 17

Author(s):

Kun-Ching Wang

Keyword(s):

Emotion Recognition ◽

Texture Analysis ◽

Real Life ◽

Feature Representation ◽

Speech Emotion Recognition ◽

Acoustic Activity ◽

Time Frequency ◽

Frequency Feature

Download Full-text

Speech Emotion Recognition Based on Sparse Representation

Archives of Acoustics ◽

10.2478/aoa-2013-0055 ◽

2013 ◽

Vol 38 (4) ◽

pp. 465-470 ◽

Cited By ~ 11

Author(s):

Jingjie Yan ◽

Xiaolan Wang ◽

Weiyi Gu ◽

LiLi Ma

Keyword(s):

Dimensionality Reduction ◽

Emotion Recognition ◽

Least Squares ◽

Partial Least Squares ◽

Partial Least Squares Regression ◽

Speech Emotion Recognition ◽

Least Squares Regression ◽

Computer Science Pedagogy ◽

Reduction Methods ◽

Analysis Computer

Abstract Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of do- mains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recogni- tion rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.

Download Full-text

Time-Frequency Domain Techniques For Identification Of Power System Transients

i-manager’s Journal on Electrical Engineering ◽

10.26634/jee.5.2.1586 ◽

2011 ◽

Vol 5 (2) ◽

pp. 42-48

Author(s):

Ashwani Kumar Chandel ◽

◽

P. Srikanth ◽

Keyword(s):

Power System ◽

Frequency Domain ◽

Time Frequency ◽

Power System Transients ◽

Frequency Domain Techniques

Download Full-text

Classifier fusion for speech emotion recognition based on improved queuing voting algorithm

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.00381 ◽

2009 ◽

Vol 29 (2) ◽

pp. 381-385 ◽

Cited By ~ 1

Author(s):

Li-qin FU ◽

Xia MAO ◽

Li-jiang CHEN

Keyword(s):

Emotion Recognition ◽

Classifier Fusion ◽

Speech Emotion Recognition

Download Full-text

Fiscal Policy Tracking Design in the Time Frequency Domain Using Wavelet Analysis (VVreanalyysin KKyttt Optimaalisen Finanssipolitiikan JJljitttmiseen Aika-Taajuusalueella)

SSRN Electronic Journal ◽

10.2139/ssrn.2676984 ◽

2014 ◽

Author(s):

Patrick M. Crowley ◽

David Hudgins

Keyword(s):

Fiscal Policy ◽

Wavelet Analysis ◽

Frequency Domain ◽

Time Frequency

Download Full-text