Comparison of Parametric representations of Birdcall in Gaussian Mixture model

Ricky Mohanty; Sandeep Singh Solanki

doi:10.11591/aptikom.j.csit.71

Comparison of Parametric representations of Birdcall in Gaussian Mixture model

APTIKOM Journal on Computer Science and Information Technologies ◽

10.11591/aptikom.j.csit.71 ◽

2017 ◽

Vol 2 (3) ◽

pp. 124-130

Author(s):

Ricky Mohanty ◽

Sandeep Singh Solanki

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Bird Species ◽

Gaussian Mixture ◽

Recognition System ◽

Extraction Methods ◽

Paper Briefly ◽

Mel Frequency Cepstral Coefficients ◽

Model Classification ◽

Audio Recordings

This paper focuses on the methods of automatic classifications of birds into different species based on feature extraction methods & audio recordings of their sounds. The recognition system uses Gaussian mixture model (GMM) to model 14 poultry bird species calls. Mel frequency cepstral coefficients (MFCC) parameters & wavelet parameters are used for feature vector extraction. The paper briefly explains the methods & also evaluates the performance of these methods in Gaussian Mixture Model classification .The results depicts the performance of Gaussian Mixture Model classification using wavelet was more efficient in terms of percentage of accuracy at around 80% and computation was also faster.

Download Full-text

A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM

Asian Journal of Pharmaceutical and Clinical Research ◽

10.22159/ajpcr.2017.v10s1.19596 ◽

2017 ◽

Vol 10 (13) ◽

pp. 140

Author(s):

Kumari Piu Gorai ◽

Thomas Abraham

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Speaker Recognition ◽

Gaussian Mixture ◽

Recognition System ◽

Human Being ◽

Biometric System ◽

Voice Signal ◽

Signal Characteristics ◽

Authentication Technique

A human being has lot of unique features and one of them is voice. Speaker recognition is the use of a system to distinguish and identify a person from his/her vocal sound. A speaker recognition system (SRS) can be used as one of the authentication technique, in addition to the conventional authentication methods. This paper represents the overview of voice signal characteristics and speaker recognition techniques. It also discusses the advantages and problem of current SRS. The only biometric system that allows users to authenticate remotely is voice-based SRS, we are in the need of a robust SRS.

Download Full-text

A Gaussian Mixture Model Based Speech Recognition System Using Matlab

Signal & Image Processing An International Journal ◽

10.5121/sipij.2013.4409 ◽

2013 ◽

Vol 4 (4) ◽

pp. 109-118 ◽

Cited By ~ 5

Author(s):

Manan Vyas

Keyword(s):

Speech Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Recognition System ◽

Speech Recognition System ◽

Model Based

Download Full-text

Study on Gender Identification Based on Audio Recordings Using Gaussian Mixture Model and Mel Frequency Cepstrum Coefficient Technique

International Journal of Innovative Computing ◽

10.11113/ijic.v11n2.343 ◽

2021 ◽

Vol 11 (2) ◽

pp. 35-41

Author(s):

Thurgeaswary Rokanatnam ◽

Hazinah Kutty Mammi

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Noise Removal ◽

Accuracy Rate ◽

Speech Corpus ◽

Speech Database ◽

Audio Recordings ◽

Speech Data ◽

Mel Frequency Cepstrum Coefficient

Speaker recognition is an ability to identify speaker’s characteristics based from spoken language. The purpose of this study is to identify gender of speakers based on audio recordings. The objective of this study is to evaluate the accuracy rate of this technique to differentiate the gender and also to determine the performance rate to classify even when using self-acquired recordings. Audio forensics uses voice recordings as part of evidence to solve cases. This study is mainly conducted to provide an easier technique to identify the unknown speaker characteristics in forensic field. This experiment is fulfilled by training the pattern classifier using gender dependent data. In order to train the model, a speech database is obtained from an online speech corpus comprising of both male and female speakers. During the testing phase, apart from the data from speech corpus, audio recordings of UTM students will too be used to determine the accuracy rate of this speaker identification experiment. As for the technique to run this experiment, Mel Frequency Cepstrum Coefficient (MFCC) algorithm is used to extract the features from speech data while Gaussian Mixture Model (GMM) is used to model the gender identifier. Noise removal was not used for any speech data in this experiment. Python software is used to extract using MFCC coefficients and model the behavior using GMM technique. Experiment results show that GMM-MFCC technique can identify gender regardless of language but with varying accuracy rate.

Download Full-text

Speech Emotion Recognition under White Noise

Archives of Acoustics ◽

10.2478/aoa-2013-0054 ◽

2013 ◽

Vol 38 (4) ◽

pp. 457-463 ◽

Cited By ~ 14

Author(s):

Chengwei Huang ◽

Guoming Chen ◽

Hua Yu ◽

Yongqiang Bao ◽

Li Zhao

Keyword(s):

White Noise ◽

Emotion Recognition ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Speech Enhancement ◽

Gaussian Mixture ◽

Recognition System ◽

Class Model ◽

Space Model ◽

Dimension Space

Abstract Speaker‘s emotional states are recognized from speech signal with Additive white Gaussian noise (AWGN). The influence of white noise on a typical emotion recogniztion system is studied. The emotion classifier is implemented with Gaussian mixture model (GMM). A Chinese speech emotion database is used for training and testing, which includes nine emotion classes (e.g. happiness, sadness, anger, surprise, fear, anxiety, hesitation, confidence and neutral state). Two speech enhancement algorithms are introduced for improved emotion classification. In the experiments, the Gaussian mixture model is trained on the clean speech data, while tested under AWGN with various signal to noise ratios (SNRs). The emotion class model and the dimension space model are both adopted for the evaluation of the emotion recognition system. Regarding the emotion class model, the nine emotion classes are classified. Considering the dimension space model, the arousal dimension and the valence dimension are classified into positive regions or negative regions. The experimental results show that the speech enhancement algorithms constantly improve the performance of our emotion recognition system under various SNRs, and the positive emotions are more likely to be miss-classified as negative emotions under white noise environment.

Download Full-text

Gaussian Mixture Model Based Classification of Stuttering Dysfluencies

Journal of Intelligent Systems ◽

10.1515/jisys-2014-0140 ◽

2016 ◽

Vol 25 (3) ◽

pp. 387-399

Author(s):

P. Mahesha ◽

D.S. Vinod

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Speaker Recognition ◽

Gaussian Mixture ◽

Modeling Technique ◽

Mel Frequency Cepstral Coefficients ◽

Automatic Speaker Recognition ◽

Word Repetition ◽

Syllable Repetition

AbstractThe classification of dysfluencies is one of the important steps in objective measurement of stuttering disorder. In this work, the focus is on investigating the applicability of automatic speaker recognition (ASR) method for stuttering dysfluency recognition. The system designed for this particular task relies on the Gaussian mixture model (GMM), which is the most widely used probabilistic modeling technique in ASR. The GMM parameters are estimated from Mel frequency cepstral coefficients (MFCCs). This statistical speaker-modeling technique represents the fundamental characteristic sounds of speech signal. Using this model, we build a dysfluency recognizer that is capable of recognizing dysfluencies irrespective of a person as well as what is being said. The performance of the system is evaluated for different types of dysfluencies such as syllable repetition, word repetition, prolongation, and interjection using speech samples from the University College London Archive of Stuttered Speech (UCLASS).

Download Full-text

Object Activity Recognition System with Shadow Suppression Using Adaptive Gaussian Mixture Model

British Journal of Mathematics & Computer Science ◽

10.9734/bjmcs/2016/25119 ◽

2016 ◽

Vol 17 (2) ◽

pp. 1-15

Author(s):

A Adekunle ◽

E Omidiora ◽

S Olabiyisi ◽

J Ojo

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Activity Recognition ◽

Gaussian Mixture ◽

Recognition System ◽

Shadow Suppression

Download Full-text

Gaussian mixture model classification: A projection pursuit approach

Computational Statistics & Data Analysis ◽

10.1016/j.csda.2006.12.038 ◽

2007 ◽

Vol 52 (1) ◽

pp. 471-482 ◽

Cited By ~ 9

Author(s):

Daniela G. Calò

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Projection Pursuit ◽

Gaussian Mixture ◽

Model Classification

Download Full-text

SAR image retrieval based on Gaussian Mixture Model classification

2009 2nd Asian-Pacific Conference on Synthetic Aperture Radar ◽

10.1109/apsar.2009.5374176 ◽

2009 ◽

Cited By ~ 1

Author(s):

Biao Hou ◽

Xu Tang ◽

Licheng Jiao ◽

Shuang Wang

Keyword(s):

Image Retrieval ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Sar Image ◽

Model Classification

Download Full-text

Gaussian mixture model classification of odontocetes in the Southern California Bight and the Gulf of California

The Journal of the Acoustical Society of America ◽

10.1121/1.2400663 ◽

2007 ◽

Vol 121 (3) ◽

pp. 1737-1748 ◽

Cited By ~ 55

Author(s):

Marie A. Roch ◽

Melissa S. Soldevilla ◽

Jessica C. Burtenshaw ◽

E. Elizabeth Henderson ◽

John A. Hildebrand

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Gulf Of California ◽

Southern California ◽

Gaussian Mixture ◽

Southern California Bight ◽

Model Classification

Download Full-text

Single frame IR point target detection based on a Gaussian mixture model classification

10.1117/12.974492 ◽

2012 ◽

Cited By ~ 3

Author(s):

Laure Genin ◽

Frédéric Champagnat ◽

Guy Le Besnerais

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Target Detection ◽

Gaussian Mixture ◽

Model Classification ◽

Point Target ◽

Single Frame ◽

Point Target Detection

Download Full-text