Speaker verification with long-term ageing data

Author(s):  
Finnian Kelly ◽  
Andrzej Drygajlo ◽  
Naomi Harte
Keyword(s):  
2016 ◽  
Vol 79 ◽  
pp. 14-29 ◽  
Author(s):  
Linlin Wang ◽  
Jun Wang ◽  
Lantian Li ◽  
Thomas Fang Zheng ◽  
Frank K. Soong

2017 ◽  
Vol 17 (4) ◽  
pp. 114-133
Author(s):  
Atanas Ouzounov

AbstractThis paper proposes a new contour-based speech endpoint detector which combines the log-Group Delay Mean-Delta (log-GDMD) feature, an adaptive twothreshold scheme and an eight-state automaton. The adaptive thresholds scheme uses two pairs of thresholds - for the starting and for the ending points, respectively. Each pair of thresholds is calculated by using the contour characteristics in the corresponded region of the utterance. The experimental results have shown that the proposed detector demonstrates better performance compared to the Long-Term Spectral Divergence (LTSD) one in terms of endpoint accuracy. Additional fixed-text speaker verification tests with short phrases of telephone speech based on the Dynamic Time Warping (DTW) and left-to-right Hidden Markov Model (HMM) frameworks confirm the improvements of the verification rate due to the better endpoint accuracy.


2014 ◽  
Author(s):  
Finnian Kelly ◽  
Rahim Saeidi ◽  
Naomi Harte ◽  
David A. van Leeuwen
Keyword(s):  

Author(s):  
Alex Marino Gonçalves De Almeida ◽  
Claudineia Helena Recco ◽  
Rodrigo Capobianco Guido

The state-of-art models for speech synthesis and voice conversion can generate synthetic speech perceptually indistinguishable from human speech, and speaker verification is crucial to prevent breaches. The building feature that best distinguishes genuine speech between spoof attacks is an open research subject. We used the baseline ASVSpoof2017, Transfer Learning (TL) set, and Symlet and Daubechies Discrete Wavelet Packet Transform (DWPT) for this investigation. To qualitatively assess the features, we used Paraconsistent Feature Engineering (PFE). Our experiments pointed out that for the use of more robust classifiers, the best choice would be the AlexNet method, while in terms of classification regarding the Equal Error Rate metric, the best suggestion would be Daubechies filter support 21. Finally, our findings indicate that Symlet filter support 17 as the most promising feature, which is evidence that PFE is a useful tool and contributes to feature selection.


2019 ◽  
Vol 42 ◽  
Author(s):  
John P. A. Ioannidis

AbstractNeurobiology-based interventions for mental diseases and searches for useful biomarkers of treatment response have largely failed. Clinical trials should assess interventions related to environmental and social stressors, with long-term follow-up; social rather than biological endpoints; personalized outcomes; and suitable cluster, adaptive, and n-of-1 designs. Labor, education, financial, and other social/political decisions should be evaluated for their impacts on mental disease.


2016 ◽  
Vol 39 ◽  
Author(s):  
Mary C. Potter

AbstractRapid serial visual presentation (RSVP) of words or pictured scenes provides evidence for a large-capacity conceptual short-term memory (CSTM) that momentarily provides rich associated material from long-term memory, permitting rapid chunking (Potter 1993; 2009; 2012). In perception of scenes as well as language comprehension, we make use of knowledge that briefly exceeds the supposed limits of working memory.


Sign in / Sign up

Export Citation Format

Share Document