Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing

Sudarsana Reddy Kadiri; Paavo Alku

doi:10.1121/1.5131043

Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing

The Journal of the Acoustical Society of America ◽

10.1121/1.5131043 ◽

2019 ◽

Vol 146 (5) ◽

pp. EL418-EL423

Author(s):

Sudarsana Reddy Kadiri ◽

Paavo Alku

Keyword(s):

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Zero Time

Download Full-text

SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

Communications in Computer and Information Science - Applied Technologies ◽

10.1007/978-3-030-42520-3_20 ◽

2020 ◽

pp. 245-253

Author(s):

María José Mora-Regalado ◽

Omar Ruiz-Vivanco ◽

Alexandra González-Eras ◽

Pablo Torres-Carrión

Keyword(s):

Neural Network ◽

Real Time ◽

Deep Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Real Time Classification

Download Full-text

Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features

Computational and Mathematical Methods in Medicine ◽

10.1155/2015/956249 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 7

Author(s):

Ömer Eskidere ◽

Ahmet Gürhanlı

Keyword(s):

Voice Disorder ◽

Mel Frequency Cepstral Coefficients ◽

Essential Information ◽

Audio Processing ◽

Voice Signal ◽

Window Technique ◽

Normal Voice ◽

Cepstral Coefficients ◽

Better Than

The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.

Download Full-text

Particle Swarm Optimisation of Mel-frequency Cepstral Coefficients computation for the classification of asphyxiated infant cry

2010 3rd International Conference on Biomedical Engineering and Informatics ◽

10.1109/bmei.2010.5639674 ◽

2010 ◽

Cited By ~ 4

Author(s):

A. Zabidi ◽

W. Mansor ◽

Y. K. Lee ◽

A. I. Mohd Yassin ◽

R. Sahak

Keyword(s):

Particle Swarm ◽

Particle Swarm Optimisation ◽

Mel Frequency Cepstral Coefficients ◽

Infant Cry ◽

Cepstral Coefficients

Download Full-text

Fusion of Linear and Mel Frequency Cepstral Coefﬁcients for Automatic Classiﬁcation of Reptiles

Applied Sciences ◽

10.3390/app7020178 ◽

2017 ◽

Vol 7 (2) ◽

pp. 178 ◽

Cited By ~ 5

Author(s):

Juan Noda ◽

Carlos Travieso ◽

David Sánchez-Rodríguez

Keyword(s):

Automatic Classification ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Classification of heart sounds using linear prediction coefficients and mel-frequency cepstral coefficients as acoustic features

2017 IEEE Colombian Conference on Communications and Computing (COLCOM) ◽

10.1109/colcomcon.2017.8088215 ◽

2017 ◽

Author(s):

Pedro Narvaez ◽

Katerine Vera ◽

Nhikolas Bedoya ◽

Winston S. Percybrooks

Keyword(s):

Linear Prediction ◽

Heart Sounds ◽

Acoustic Features ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Quartiles and Mel Frequency Cepstral Coefficients vectors in Hidden Markov-Gaussian Mixture Models classification of merged heart sounds and lung sounds signals

2015 International Conference on High Performance Computing & Simulation (HPCS) ◽

10.1109/hpcsim.2015.7237053 ◽

2015 ◽

Cited By ~ 3

Author(s):

Pedro Mayorga ◽

Daniela Ibarra ◽

Vesna Zeljkovic ◽

Christopher Druzgalski

Keyword(s):

Mixture Models ◽

Hidden Markov ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Heart Sounds ◽

Lung Sounds ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Genre Classification of Indian Tamil Music using Mel-Frequency Cepstral Coefficients

International Journal of Engineering Research and ◽

10.17577/ijertv4is120465 ◽

2015 ◽

Vol V4 (12) ◽

Cited By ~ 1

Author(s):

Betsy. S ◽

Bhalke. D. G ◽

Keyword(s):

Mel Frequency Cepstral Coefficients ◽

Genre Classification ◽

Cepstral Coefficients

Download Full-text

Analysis of Accent-Sensitive Words in Multi-Resolution Mel-Frequency Cepstral Coefficients for Classification of Accents in Malaysian English

International Journal of Automotive and Mechanical Engineering ◽

10.15282/ijame.7.2012.21.0086 ◽

2013 ◽

Vol 7 ◽

pp. 1053-1073 ◽

Cited By ~ 4

Author(s):

M.A. Yusnita ◽

M.P. Paulraj ◽

Sazali Yaacob ◽

R. Yusuf ◽

A.B. Shahriman

Keyword(s):

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech

10.21437/interspeech.2019-2863 ◽

2019 ◽

Cited By ~ 1

Author(s):

Sudarsana Reddy Kadiri ◽

Paavo Alku

Keyword(s):

Mel Frequency Cepstral Coefficients ◽

Voice Source ◽

Cepstral Coefficients

Download Full-text

Discriminant Analysis of Voice Commands in the Presence of an Unmanned Aerial Vehicle

Information ◽

10.3390/info12010023 ◽

2021 ◽

Vol 12 (1) ◽

pp. 23

Author(s):

Marzena Mięsikowska

Keyword(s):

Discriminant Analysis ◽

Unmanned Aerial Vehicle ◽

Speech Intelligibility ◽

Measuring Equipment ◽

Verbal Communication ◽

Mel Frequency Cepstral Coefficients ◽

Sound Levels ◽

Aerial Vehicle ◽

Cepstral Coefficients

The aim of this study was to perform discriminant analysis of voice commands in the presence of an unmanned aerial vehicle equipped with four rotating propellers, as well as to obtain background sound levels and speech intelligibility. The measurements were taken in laboratory conditions in the absence of the unmanned aerial vehicle and the presence of the unmanned aerial vehicle. Discriminant analysis of speech commands (left, right, up, down, forward, backward, start, and stop) was performed based on mel-frequency cepstral coefficients. Ten male speakers took part in this experiment. The unmanned aerial vehicle hovered at a height of 1.8 m during the recordings at a distance of 2 m from the speaker and 0.3 m above the measuring equipment. Discriminant analysis based on mel-frequency cepstral coefficients showed promising classification of speech commands equal to 76.2% for male speakers. Evaluated speech intelligibility during recordings and obtained sound levels in the presence of the unmanned aerial vehicle during recordings did not exclude verbal communication with the unmanned aerial vehicle for male speakers.

Download Full-text