Tree-based context clustering using speech recognition features for acoustic model training of speech synthesis

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) ◽

10.1109/ecticon.2015.7207094 ◽

2015 ◽

Author(s):

Supadaech Chanjaradwichai ◽

Atiwong Suchato ◽

Proadpran Punyabukkana

Keyword(s):

Speech Recognition ◽

Speech Synthesis ◽

Acoustic Model ◽

Model Training

Download Full-text

Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis

10.21437/interspeech.2006-389 ◽

2006 ◽

Author(s):

Katsumi Ogata ◽

Makoto Tachibana ◽

Junichi Yamagishi ◽

Takao Kobayashi

Keyword(s):

Linear Transformation ◽

Speech Synthesis ◽

Acoustic Model ◽

Model Training

Download Full-text

Robust Acoustic Model Training Against Phoneme Variations for Large Vocabulary Continuous Speech Recognition

Signal and Image Processing ◽

10.2316/p.2012.759-070 ◽

2012 ◽

Author(s):

Gil Ho Lee ◽

Nam Soo Kim

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Large Vocabulary ◽

Model Training

Download Full-text

Generative approach for robust acoustic model training for blindly separated speech recognition

The Journal of the Acoustical Society of America ◽

10.1121/1.4805203 ◽

2013 ◽

Vol 133 (5) ◽

pp. 3245-3245

Author(s):

Norihide Kitaoka ◽

Yuto Dekiura ◽

Kazuya Takeda

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Model Training ◽

Generative Approach

Download Full-text

Generative approach for robust acoustic model training for blindly separated speech recognition

10.1121/1.4800633 ◽

2013 ◽

Cited By ~ 1

Author(s):

Yuto Dekiura ◽

Norihide Kitaoka ◽

Kazuya Takeda

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Model Training ◽

Generative Approach

Download Full-text

Pipelining Acoustic Model Training for Speech Recognition Using Storm

2013 Fifth International Conference on Computational Intelligence, Modelling and Simulation ◽

10.1109/cimsim.2013.42 ◽

2013 ◽

Cited By ~ 1

Author(s):

Dinkar Sitaram ◽

Haripriya Srinivasaraghavan ◽

Kapish Agarwal ◽

Amritanshu Agrawal ◽

Neha Joshi ◽

...

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Model Training

Download Full-text

Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system

10.21437/interspeech.2009-20 ◽

2009 ◽

Author(s):

Jonas Lööf ◽

Christian Gollan ◽

Hermann Ney

Keyword(s):

Speech Recognition ◽

Rapid Development ◽

Recognition System ◽

Speech Recognition System ◽

Acoustic Model ◽

Model Training ◽

Cross Language

Download Full-text

Acoustic Model Training, using Kaldi, for Automatic Whispery Speech Recognition

Position Papers of the 2018 Federated Conference on Computer Science and Information Systems ◽

10.15439/2018f255 ◽

2018 ◽

Cited By ~ 1

Author(s):

Piotr Kozierski ◽

Talar Sadalla ◽

Szymon Drgas ◽

Adam Dąbrowski ◽

Joanna Ziętkiewicz ◽

...

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Model Training

Download Full-text

Lightly Supervised Acoustic Model Training for Mandarin Continuous Speech Recognition

Intelligent Science and Intelligent Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-642-36669-7_88 ◽

2013 ◽

pp. 727-734

Author(s):

Xiangang Li ◽

Zaihu Pang ◽

Xihong Wu

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Model Training

Download Full-text

An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition

10.21437/interspeech.2011-186 ◽

2011 ◽

Author(s):

Jian Xu ◽

Yu Zhang ◽

Zhi-Jie Yan ◽

Qiang Huo

Keyword(s):

Speech Recognition ◽

Acoustic Model ◽

Model Training

Download Full-text

Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training

Frontiers in Artificial Intelligence and Applications - Human Language Technologies – The Baltic Perspective ◽

10.3233/faia200601 ◽

2020 ◽

Author(s):

Askars Salimbajevs

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

State Of The Art ◽

Speaker Verification ◽

Voice Conversion ◽

Acoustic Model ◽

Acoustic Models ◽

Speech Data ◽

Model Training ◽

The Voice

Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is an interest in the collection of anonymized speech data that is processed by some voice conversion method. In this paper, we evaluate one of the voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show the effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and the effectiveness of using privacy-transformed data in ASR training.

Download Full-text