Comparison of feature extraction methods for speech recognition in noise-free and in traffic noise environment

The performance of a machine learning model depends on the quality of the features used as input to the model. Research into feature extraction methods for convolutional neural network (CNN)-based diagnostics for rotating machinery remains in a developmental stage. In general, the input to CNN-based diagnostics consists of a spectrogram without significant pre-processing. This paper introduces octave-band filtering as a feature extraction method for preprocessing a spectrogram prior to use with CNN. This method is an adaptation of a feature extraction method originally developed for speech recognition. The method developed for diagnosis of machinery faults differs from filtering methods applied to speech recognition in its use of octave bands, to which weighting has been applied that is optimal for machinery diagnosis. Through a case study, the effectiveness of octave-band filtering is demonstrated. The method not only improves the accuracy of the CNN-based diagnostics but also reduces the size of the CNN.

Download Full-text

Kernel based Non-linear Feature Extraction Methods for Speech Recognition

Sixth International Conference on Intelligent Systems Design and Applications ◽

10.1109/isda.2006.253706 ◽

2006 ◽

Cited By ~ 4

Author(s):

Hao Huang ◽

Jie Zhu

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Extraction Methods ◽

Linear Feature ◽

Linear Feature Extraction ◽

Non Linear

Download Full-text

Metode Wavelet-MFCC dan Korelasi dalam Pengenalan Suara Digit

JTIM : Jurnal Teknologi Informasi dan Multimedia ◽

10.35746/jtim.v2i2.99 ◽

2020 ◽

Vol 2 (2) ◽

pp. 100-108

Author(s):

Zaurarista Dyarbirru ◽

Syahroni Hidayat

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Recognition System ◽

Extraction Process ◽

Extraction Methods ◽

Transform Method ◽

Feature Extraction Method ◽

Average Value ◽

Advantages And Disadvantages ◽

Comparison Results

Voice is the sound emitted from living things. With the development of Automatic Speech Recognition (ASR) technology, voice can be used to make it easier for humans to do something. In the ASR extraction process the features have an important role in the recognition process. The feature extraction methods that are commonly applied to ASR are MFCC and Wavelet. Each of them has advantages and disadvantages. Therefore, this study will combine the wavelet feature extraction method and MFCC to maximize the existing advantages. The proposed method is called Wavelet-MFCC. Voice recognition method that does not use recommendations. Determination of system performance using the Word Recoginition Rate (WRR) method which is validated with the K-Fold Cross Validation with the number of folds is 5. The research dataset used is voice recording digits 0-9 in English. The results show that the digit speech recognition system that has been built gives the highest average value of 63% for digit 4 using wavelet daubechies DB3 and wavelet dyadic transform method. As for the comparison results of the wavelet decomposition method used, that the use of dyadic wavelet transformation is better than the wavelet package.

Download Full-text

Two novel FDLP based feature extraction methods for improvement of speech recognition

2010 5th International Symposium on Telecommunications ◽

10.1109/istel.2010.5734095 ◽

2010 ◽

Cited By ~ 1

Author(s):

Yasser Shekofteh ◽

Farshad AlmasGanj ◽

Ahmadreza Rezaei ◽

Mohammad Mohsen Goodarzi

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Extraction Methods

Download Full-text

Speech Recognition with Advanced Feature Extraction Methods Using Adaptive Particle Swarm Optimization

International Journal of Intelligent Engineering and Systems ◽

10.22266/ijies2016.1231.03 ◽

2016 ◽

Vol 9 (4) ◽

pp. 21-30

Author(s):

Bright Kanisha ◽

◽

Ganesan Balarishnanan ◽

Keyword(s):

Feature Extraction ◽

Particle Swarm Optimization ◽

Speech Recognition ◽

Particle Swarm ◽

Extraction Methods ◽

Swarm Optimization ◽

Adaptive Particle Swarm Optimization

Download Full-text

Analysis of Correlation between Cognitive Function and Speech Recognition in Noise

Korean Journal of Otorhinolaryngology - Head and Neck Surgery ◽

10.3342/kjorl-hns.2010.53.4.215 ◽

2010 ◽

Vol 53 (4) ◽

pp. 215 ◽

Cited By ~ 3

Author(s):

Seong Jun Song ◽

Hyun Joon Shim ◽

Chul Ho Park ◽

Seong Hee Lee ◽

Sang Won Yoon

Keyword(s):

Speech Recognition ◽

Cognitive Function ◽

Speech Recognition In Noise

Download Full-text