Detailed pronunciation variant modeling for speech transcription

The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge

10.21437/iberspeech.2018-56 ◽

2018 ◽

Cited By ~ 1

Author(s):

Haritz Arzelus ◽

Aitor Alvarez ◽

Conrad Bernath ◽

Eneritz García ◽

Emilio Granell ◽

...

Keyword(s):

Speech Transcription

Download Full-text

On Employing a Highly Mismatched Crowd for Speech Transcription

10.21437/interspeech.2016-673 ◽

2016 ◽

Author(s):

Purushotam Radadia ◽

Rahul Kumar ◽

Kanika Kalra ◽

Shirish Karande ◽

Sachin Lodha

Keyword(s):

Speech Transcription

Download Full-text

Lithuanian Broadcast Speech Transcription Using Semi-supervised Acoustic Model Training

Procedia Computer Science ◽

10.1016/j.procs.2016.04.037 ◽

2016 ◽

Vol 81 ◽

pp. 107-113 ◽

Cited By ~ 6

Author(s):

Rasa Lileikytė ◽

Arseniy Gorin ◽

Lori Lamel ◽

Jean-Luc Gauvain ◽

Thiago Fraga-Silva

Keyword(s):

Acoustic Model ◽

Model Training ◽

Speech Transcription

Download Full-text

Advances in speech transcription at IBM under the DARPA EARS program

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2006.879814 ◽

2006 ◽

Vol 14 (5) ◽

pp. 1596-1608 ◽

Cited By ~ 59

Author(s):

S.F. Chen ◽

B. Kingsbury ◽

Lidia Mangu ◽

D. Povey ◽

G. Saon ◽

...

Keyword(s):

Speech Transcription

Download Full-text

Integrating hidden Markov model and PRAAT: a toolbox for robust automatic speech transcription

10.1117/12.872211 ◽

2010 ◽

Author(s):

A. Kabir ◽

J. Barker ◽

M. Giurgiu

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Speech Transcription

Download Full-text

Improving automatic forced alignment for dysarthric speech transcription

10.21437/interspeech.2015-619 ◽

2015 ◽

Author(s):

Yu Ting Yeung ◽

Ka Ho Wong ◽

Helen Meng

Keyword(s):

Dysarthric Speech ◽

Speech Transcription

Download Full-text

Reducing Costs in Human Assisted Speech Transcription

10.15368/theses.2016.10 ◽

2016 ◽

Author(s):

Justin Rovin

Keyword(s):

Reducing Costs ◽

Speech Transcription

Download Full-text

TEDxSK and JumpSK: A New Slovak Speech Recognition Dedicated Corpus

Journal of Linguistics/Jazykovedný casopis ◽

10.1515/jazcas-2017-0044 ◽

2017 ◽

Vol 68 (2) ◽

pp. 346-354

Author(s):

Ján Staš ◽

Daniel Hládek ◽

Peter Viszlay ◽

Tomáš Koctúr

Keyword(s):

Speech Recognition ◽

Total Duration ◽

Principal Component ◽

Speech Segmentation ◽

Word Error Rate ◽

Speech Database ◽

Recognition Systems ◽

Speech Transcription ◽

Speech Segments

Abstract This paper describes a new Slovak speech recognition dedicated corpus built from TEDx talks and Jump Slovakia lectures. The proposed speech database consists of 220 talks and lectures in total duration of about 58 hours. Annotated speech database was generated automatically in an unsupervised manner by using acoustic speech segmentation based on principal component analysis and automatic speech transcription using two complementary speech recognition systems. The evaluation data consisting of 50 manually annotated talks and lectures in total duration of about 12 hours, has been created for evaluation of the quality of Slovak speech recognition. By unsupervised automatic annotation of TEDx talks and Jump Slovakia lectures we have obtained 21.26% of new speech segments with approximately 9.44% word error rate, suitable for retraining or adaptation of acoustic models trained beforehand.

Download Full-text