Towards Robust Indonesian Speech Recognition with Spontaneous-Speech Adapted Acoustic Models

Procedia Computer Science ◽

10.1016/j.procs.2016.04.045 ◽

2016 ◽

Vol 81 ◽

pp. 167-173 ◽

Cited By ~ 7

Author(s):

Devin Hoesen ◽

Cil Hardianto Satriawan ◽

Dessi Puji Lestari ◽

Masayu Leylia Khodra

Keyword(s):

Speech Recognition ◽

Spontaneous Speech ◽

Acoustic Models

Download Full-text

Improving Acoustic Models for Russian Spontaneous Speech Recognition

Speech and Computer - Lecture Notes in Computer Science ◽

10.1007/978-3-319-23132-7_29 ◽

2015 ◽

pp. 234-242 ◽

Cited By ~ 10

Author(s):

Alexey Prudnikov ◽

Ivan Medennikov ◽

Valentin Mendelev ◽

Maxim Korenevsky ◽

Yuri Khokhlov

Keyword(s):

Speech Recognition ◽

Spontaneous Speech ◽

Acoustic Models

Download Full-text

Improved acoustic models for spontaneous speech recognition

The Journal of the Acoustical Society of America ◽

10.1121/1.4708075 ◽

2012 ◽

Vol 131 (4) ◽

pp. 3236-3236

Author(s):

Qingqing Zhang ◽

Shang Cai ◽

Jielin Pan ◽

Yonghong Yan

Keyword(s):

Speech Recognition ◽

Spontaneous Speech ◽

Acoustic Models

Download Full-text

Use of Global and Acoustic Features Associated with Contextual Factors to Adapt Language Models for Spontaneous Speech Recognition

10.21437/interspeech.2017-717 ◽

2017 ◽

Cited By ~ 3

Author(s):

Shohei Toyama ◽

Daisuke Saito ◽

Nobuaki Minematsu

Keyword(s):

Speech Recognition ◽

Contextual Factors ◽

Spontaneous Speech ◽

Language Models ◽

Acoustic Features

Download Full-text

CTC Training of Multi-Phone Acoustic Models for Speech Recognition

10.21437/interspeech.2017-505 ◽

2017 ◽

Cited By ~ 3

Author(s):

Olivier Siohan

Keyword(s):

Speech Recognition ◽

Acoustic Models

Download Full-text

Acoustic models adaptation in large vocabulary continuous mandarin speech recognition for non-native speakers

Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004. ◽

10.1109/icosp.2004.1452756 ◽

2005 ◽

Author(s):

Jian Yang ◽

Yuanyuan Pu ◽

Hong Wei ◽

Zhengpeng Zhao

Keyword(s):

Speech Recognition ◽

Native Speakers ◽

Large Vocabulary ◽

Acoustic Models ◽

Mandarin Speech Recognition

Download Full-text

Limited Training Data Robust Speech Recognition Using Kernel-Based Acoustic Models

2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings ◽

10.1109/icassp.2006.1660226 ◽

2006 ◽

Cited By ~ 1

Author(s):

M. Schaffoner ◽

S.E. Kruger ◽

E. Andelic ◽

M. Katz ◽

A. Wendemuth

Keyword(s):

Speech Recognition ◽

Training Data ◽

Robust Speech Recognition ◽

Acoustic Models

Download Full-text

Evaluation of English Speech Recognition for Japanese Learners Using DNN-Based Acoustic Models

Recent Advances in Intelligent Information Hiding and Multimedia Signal Processing - Smart Innovation, Systems and Technologies ◽

10.1007/978-3-030-03748-2_11 ◽

2018 ◽

pp. 93-100 ◽

Cited By ~ 1

Author(s):

Jiang Fu ◽

Yuya Chiba ◽

Takashi Nose ◽

Akinori Ito

Keyword(s):

Speech Recognition ◽

Acoustic Models ◽

Japanese Learners

Download Full-text

Fast and accurate recurrent neural network acoustic models for speech recognition

10.21437/interspeech.2015-350 ◽

2015 ◽

Cited By ~ 3

Author(s):

Haşim Sak ◽

Andrew Senior ◽

Kanishka Rao ◽

Françoise Beaufays

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recurrent Neural Network ◽

Acoustic Models

Download Full-text

A De Novo Divide-and-Merge Paradigm for Acoustic Model Optimization in Automatic Speech Recognition

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/513 ◽

2020 ◽

Author(s):

Conghui Tan ◽

Di Jiang ◽

Jinhua Peng ◽

Xueyang Wu ◽

Qian Xu ◽

...

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

De Novo ◽

Superior Performance ◽

Acoustic Model ◽

Acoustic Models ◽

Public Data ◽

Speech Data ◽

Low Efficiency ◽

Novel Algorithms

Due to the rising awareness of privacy protection and the voluminous scale of speech data, it is becoming infeasible for Automatic Speech Recognition (ASR) system developers to train the acoustic model with complete data as before. In this paper, we propose a novel Divide-and-Merge paradigm to solve salient problems plaguing the ASR field. In the Divide phase, multiple acoustic models are trained based upon different subsets of the complete speech data, while in the Merge phase two novel algorithms are utilized to generate a high-quality acoustic model based upon those trained on data subsets. We first propose the Genetic Merge Algorithm (GMA), which is a highly specialized algorithm for optimizing acoustic models but suffers from low efficiency. We further propose the SGD-Based Optimizational Merge Algorithm (SOMA), which effectively alleviates the efficiency bottleneck of GMA and maintains superior performance. Extensive experiments on public data show that the proposed methods can significantly outperform the state-of-the-art.

Download Full-text

Boosting Thai Syllable Speech Recognition Using Acoustic Models Combination

2008 International Conference on Computer and Electrical Engineering ◽

10.1109/iccee.2008.130 ◽

2008 ◽

Cited By ~ 7

Author(s):

Supachai Tangwongsan ◽

Rong Phoophuangpairoj

Keyword(s):

Speech Recognition ◽

Acoustic Models

Download Full-text