Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages

Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages

10.21437/interspeech.2014-212 ◽

2014 ◽

Author(s):

Shakti P. Rath ◽

Kate M. Knill ◽

Anton Ragni ◽

Mark J. F. Gales

Keyword(s):

Speech Recognition ◽

Hybrid Systems ◽

Keyword Spotting ◽

Low Resource

Download Full-text

Enhancing low resource keyword spotting with automatically retrieved web documents

10.21437/interspeech.2015-262 ◽

2015 ◽

Author(s):

Le Zhang ◽

Damianos Karakos ◽

William Hartmann ◽

Roger Hsiao ◽

Richard Schwartz ◽

...

Keyword(s):

Keyword Spotting ◽

Web Documents ◽

Low Resource

Download Full-text

Generalisation Gap of Keyword Spotters in a Cross-Speaker Low-Resource Scenario

Sensors ◽

10.3390/s21248313 ◽

2021 ◽

Vol 21 (24) ◽

pp. 8313

Author(s):

Łukasz Lepak ◽

Kacper Radzikowski ◽

Robert Nowak ◽

Karol J. Piczak

Keyword(s):

Distribution Data ◽

Acoustic Similarity ◽

Call Centre ◽

Keyword Spotting ◽

Data Annotation ◽

Low Resource ◽

Generic Models ◽

Training Resources ◽

Cross Lingual ◽

Cross Language

Models for keyword spotting in continuous recordings can significantly improve the experience of navigating vast libraries of audio recordings. In this paper, we describe the development of such a keyword spotting system detecting regions of interest in Polish call centre conversations. Unfortunately, in spite of recent advancements in automatic speech recognition systems, human-level transcription accuracy reported on English benchmarks does not reflect the performance achievable in low-resource languages, such as Polish. Therefore, in this work, we shift our focus from complete speech-to-text conversion to acoustic similarity matching in the hope of reducing the demand for data annotation. As our primary approach, we evaluate Siamese and prototypical neural networks trained on several datasets of English and Polish recordings. While we obtain usable results in English, our models’ performance remains unsatisfactory when applied to Polish speech, both after mono- and cross-lingual training. This performance gap shows that generalisation with limited training resources is a significant obstacle for actual deployments in low-resource languages. As a potential countermeasure, we implement a detector using audio embeddings generated with a generic pre-trained model provided by Google. It has a much more favourable profile when applied in a cross-lingual setup to detect Polish audio patterns. Nevertheless, despite these promising results, its performance on out-of-distribution data are still far from stellar. It would indicate that, in spite of the richness of internal representations created by more generic models, such speech embeddings are not entirely malleable to cross-language transfer.

Download Full-text

Usage of acoustic cues in spoken term detection keyword spotting for zero low resource languages

Fifth International Conference on Advances in Computing, Communication and Information Technology - CCIT 2017 ◽

10.15224/978-1-63248-131-3-54 ◽

2017 ◽

Author(s):

PATHA SREEDHAR ◽

SURYAKANTH V

Keyword(s):

Keyword Spotting ◽

Acoustic Cues ◽

Spoken Term Detection ◽

Low Resource

Download Full-text

Investigation of DNN-Based Keyword Spotting in Low Resource Environments

International Journal of Future Computer and Communication ◽

10.18178/ijfcc.2016.5.2.458 ◽

2016 ◽

Vol 5 (2) ◽

pp. 125-129 ◽

Cited By ~ 1

Author(s):

Kaixiang Shen ◽

◽

Meng Cai ◽

Wei-Qiang Zhang ◽

Yao Tian ◽

...

Keyword(s):

Keyword Spotting ◽

Low Resource

Download Full-text

Low-Resource Speech Recognition and Keyword-Spotting

Speech and Computer - Lecture Notes in Computer Science ◽

10.1007/978-3-319-66429-3_1 ◽

2017 ◽

pp. 3-19 ◽

Cited By ~ 2

Author(s):

Mark J. F. Gales ◽

Kate M. Knill ◽

Anton Ragni

Keyword(s):

Speech Recognition ◽

Keyword Spotting ◽

Low Resource

Download Full-text

Low resource point process models for keyword spotting using unsupervised online learning

2017 25th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco.2017.8081265 ◽

2017 ◽

Author(s):

Samik Sadhu ◽

Prasanta Kumar Ghosh

Keyword(s):

Online Learning ◽

Point Process ◽

Process Models ◽

Keyword Spotting ◽

Low Resource ◽

Point Process Models

Download Full-text

Feature learning for efficient ASR-free keyword spotting in low-resource languages

Computer Speech & Language ◽

10.1016/j.csl.2021.101275 ◽

2021 ◽

pp. 101275

Author(s):

Ewald van der Westhuizen ◽

Herman Kamper ◽

Raghav Menon ◽

John Quinn ◽

Thomas Niesler

Keyword(s):

Feature Learning ◽

Keyword Spotting ◽

Low Resource

Download Full-text

Handbook of Hybrid Systems Control

10.1017/cbo9780511807930 ◽

2009 ◽

Cited By ~ 141

Keyword(s):

Hybrid Systems ◽

Systems Control

Download Full-text

Diagnosis of neonatal sepsis in low resource settings: C-reactive protein or procalcitonin?

Journal of Pediatric Biochemistry ◽

10.1055/s-0036-1586432 ◽

2016 ◽

Vol 03 (02) ◽

pp. 079-083

Author(s):

Lawrence Mbuagbaw ◽

Francisca Monebenimp ◽

Bolaji Obadeyi ◽

Grace Bissohong ◽

Marie-Thérèse Obama ◽

...

Keyword(s):

Neonatal Sepsis ◽

C Reactive Protein ◽

Low Resource Settings ◽

Low Resource ◽

Reactive Protein

Download Full-text