Improving speaker verification performance against long-term speaker variability

Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data

10.21437/interspeech.2013-415 ◽

2013 ◽

Author(s):

Cong-Thanh Do ◽

Claude Barras ◽

Viet Bac Le ◽

A. K. Sarkar

Keyword(s):

Speaker Verification ◽

Short Term ◽

Cepstral Features

Download Full-text

Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech

Speech Communication ◽

10.1016/j.specom.2007.11.005 ◽

2008 ◽

Vol 50 (11-12) ◽

pp. 953-964 ◽

Cited By ~ 1

Author(s):

Nestor Becerra Yoma ◽

Claudio Garretón ◽

Carlos Molina ◽

Fernando Huenupán

Keyword(s):

Speaker Verification ◽

Model Adaptation ◽

Speaker Variability ◽

Variability Compensation ◽

Telephone Speech

Download Full-text

Speaker verification with long-term ageing data

2012 5th IAPR International Conference on Biometrics (ICB) ◽

10.1109/icb.2012.6199796 ◽

2012 ◽

Cited By ~ 11

Author(s):

Finnian Kelly ◽

Andrzej Drygajlo ◽

Naomi Harte

Keyword(s):

Speaker Verification

Download Full-text

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification

2016 International Conference of the Biometrics Special Interest Group (BIOSIG) ◽

10.1109/biosig.2016.7736933 ◽

2016 ◽

Cited By ~ 10

Author(s):

Hannah Muckenhirn ◽

Mathew Magimai-Doss ◽

Sebastien Marcel

Keyword(s):

Speaker Verification ◽

Attack Detection ◽

Spectral Statistics ◽

Presentation Attack Detection

Download Full-text

LTSD and GDMD features for Telephone Speech Endpoint Detection

Cybernetics and Information Technologies ◽

10.1515/cait-2017-0045 ◽

2017 ◽

Vol 17 (4) ◽

pp. 114-133

Author(s):

Atanas Ouzounov

Keyword(s):

Dynamic Time Warping ◽

Group Delay ◽

Speaker Verification ◽

Endpoint Detection ◽

Time Warping ◽

Adaptive Thresholds ◽

Telephone Speech ◽

Dynamic Time ◽

Speech Endpoint Detection

AbstractThis paper proposes a new contour-based speech endpoint detector which combines the log-Group Delay Mean-Delta (log-GDMD) feature, an adaptive twothreshold scheme and an eight-state automaton. The adaptive thresholds scheme uses two pairs of thresholds - for the starting and for the ending points, respectively. Each pair of thresholds is calculated by using the contour characteristics in the corresponded region of the utterance. The experimental results have shown that the proposed detector demonstrates better performance compared to the Long-Term Spectral Divergence (LTSD) one in terms of endpoint accuracy. Additional fixed-text speaker verification tests with short phrases of telephone speech based on the Dynamic Time Warping (DTW) and left-to-right Hidden Markov Model (HMM) frameworks confirm the improvements of the verification rate due to the better endpoint accuracy.

Download Full-text

Effects of Long-Term Ageing on Speaker Verification

Lecture Notes in Computer Science - Biometrics and ID Management ◽

10.1007/978-3-642-19530-3_11 ◽

2011 ◽

pp. 113-124 ◽

Cited By ~ 8

Author(s):

Finnian Kelly ◽

Naomi Harte

Keyword(s):

Speaker Verification

Download Full-text

Effect of long-term ageing on i-vector speaker verification

10.21437/interspeech.2014-18 ◽

2014 ◽

Author(s):

Finnian Kelly ◽

Rahim Saeidi ◽

Naomi Harte ◽

David A. van Leeuwen

Keyword(s):

Speaker Verification

Download Full-text

Long term examination of intra-session and inter-session speaker variability

10.21437/interspeech.2009-734 ◽

2009 ◽

Author(s):

A. D. Lawson ◽

A. R. Stauffer ◽

B. Y. Smolenski ◽

B. B. Pokines ◽

M. Leonard ◽

...

Keyword(s):

Speaker Variability

Download Full-text

Intra-speaker variability compensation in speaker verification with limited enrolling data

10.21437/interspeech.2006-165 ◽

2006 ◽

Author(s):

Claudio Garreton ◽

Nestor Becerra Yoma ◽

Carlos Molina ◽

Fernando Huenupan

Keyword(s):

Speaker Verification ◽

Speaker Variability ◽

Variability Compensation

Download Full-text

Speaker Verification with Fuzzy Fusion and Genetic Optimization

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.1999.p0451 ◽

1999 ◽

Vol 3 (6) ◽

pp. 451-456

Author(s):

Tuan Pham ◽

◽

Michael Wagner ◽

Keyword(s):

Genetic Algorithms ◽

Speaker Verification ◽

A Priori ◽

Similarity Measures ◽

Error Rates ◽

Fuzzy Integral ◽

Speech Corpus ◽

Speaker Variability ◽

Verification System ◽

Fuzzy Fusion

Most speaker verification systems are based on similarity or likelihood normalization techniques as they help to better cope with speaker variability. In the conventional normalization, the it a priori probabilities of the cohort speakers are assumed to be equal. From this standpoint, we apply the fuzzy integral and genetic algorithms to combine the likelihood values of the cohort speakers in which the assumption of equal <I>a priori</I> probabilities is relaxed. This approach replaces the conventional normalization term by the fuzzy integral which acts as a non-linear fusion of the similarity measures of an utterance assigned to the cohort speakers. Furthermore, genetic algorithms are applied to find optimal fuzzy densities which are very important for the fuzzy fusion. We illustrate the performance of the proposed approach by testing the speaker verification system with both the conventional and the proposed algorithms using the commercial speech corpus TI46. The results in terms of the equal error rates show that the speaker verification system using the fuzzy integral is more favorable than the conventional normalization method.

Download Full-text