Two-step correction of speech recognition errors based on n-gram and long contextual information

HPM: A Hybrid Model for User’s Behavior Prediction Based on N-Gram Parsing and Access Logs

Scientific Programming ◽

10.1155/2020/8897244 ◽

2020 ◽

Vol 2020 ◽

pp. 1-18

Author(s):

Sonia Setia ◽

Verma Jyoti ◽

Neelam Duhan

Keyword(s):

Web Mining ◽

Contextual Information ◽

Hybrid Approach ◽

Web Pages ◽

Continuous Growth ◽

Novel Approach ◽

Content Mining ◽

Long Access ◽

N Gram ◽

The Individual

The continuous growth of the World Wide Web has led to the problem of long access delays. To reduce this delay, prefetching techniques have been used to predict the users’ browsing behavior to fetch the web pages before the user explicitly demands that web page. To make near accurate predictions for users’ search behavior is a complex task faced by researchers for many years. For this, various web mining techniques have been used. However, it is observed that either of the methods has its own set of drawbacks. In this paper, a novel approach has been proposed to make a hybrid prediction model that integrates usage mining and content mining techniques to tackle the individual challenges of both these approaches. The proposed method uses N-gram parsing along with the click count of the queries to capture more contextual information as an effort to improve the prediction of web pages. Evaluation of the proposed hybrid approach has been done by using AOL search logs, which shows a 26% increase in precision of prediction and a 10% increase in hit ratio on average as compared to other mining techniques.

Download Full-text

Characterizing and Predicting Corrections in Spoken Dialogue Systems

Computational Linguistics ◽

10.1162/coli.2006.32.3.417 ◽

2006 ◽

Vol 32 (3) ◽

pp. 417-438 ◽

Cited By ~ 19

Author(s):

Diane Litman ◽

Julia Hirschberg ◽

Marc Swerts

Keyword(s):

Speech Recognition ◽

Predictive Power ◽

Classification Error ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Experimental Conditions ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Recognition Errors

This article focuses on the analysis and prediction of corrections, defined as turns where a user tries to correct a prior error made by a spoken dialogue system. We describe our labeling procedure of various corrections types and statistical analyses of their features in a corpus collected from a train information spoken dialogue system. We then present results of machine-learning experiments designed to identify user corrections of speech recognition errors. We investigate the predictive power of features automatically computable from the prosody of the turn, the speech recognition process, experimental conditions, and the dialogue history. Our best-performing features reduce classification error from baselines of 25.70–28.99% to 15.72%.

Download Full-text

Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian

Statistical Language and Speech Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-45925-7_11 ◽

2016 ◽

pp. 133-144 ◽

Cited By ~ 1

Author(s):

Matti Varjokallio ◽

Mikko Kurimo ◽

Sami Virpioja

Keyword(s):

Speech Recognition ◽

Large Vocabulary ◽

Large Vocabulary Speech Recognition ◽

N Gram

Download Full-text

Are some speech recognition errors easier to detect than others?

10.3115/1614108.1614148 ◽

2007 ◽

Cited By ~ 1

Author(s):

Yongmei Shi ◽

Lina Zhou

Keyword(s):

Speech Recognition ◽

Recognition Errors

Download Full-text

Exploiting automatic speech recognition errors to enhance partial and synchronized caption for facilitating second language listening

Computer Speech & Language ◽

10.1016/j.csl.2017.11.001 ◽

2018 ◽

Vol 49 ◽

pp. 17-36 ◽

Cited By ~ 1

Author(s):

Maryam Sadat Mirzaei ◽

Kourosh Meshgi ◽

Tatsuya Kawahara

Keyword(s):

Second Language ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Second Language Listening ◽

Recognition Errors

Download Full-text

Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue

Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy - Lecture Notes in Computer Science ◽

10.1007/978-3-642-00831-3_26 ◽

2009 ◽

pp. 282-289 ◽

Cited By ~ 3

Author(s):

Sangwoo Kang ◽

Songwook Lee ◽

Jungyun Seo

Keyword(s):

Speech Recognition ◽

Recognition Errors

Download Full-text

Voice Activity Detection Using Contextual Information for Robust Speech Recognition

Recent Advances in Robust Speech Recognition Technology ◽

10.2174/978160805172411101010030 ◽

2012 ◽

pp. 30-45

Keyword(s):

Speech Recognition ◽

Contextual Information ◽

Voice Activity Detection ◽

Robust Speech Recognition ◽

Activity Detection ◽

Voice Activity

Download Full-text

Combination of random indexing based language model and n-gram language model for speech recognition

10.21437/interspeech.2013-525 ◽

2013 ◽

Author(s):

Dominique Fohr ◽

Odile Mella

Keyword(s):

Speech Recognition ◽

Language Model ◽

Random Indexing ◽

N Gram

Download Full-text

An Empirical Study of Speech Recognition Errors in Human-Computer Dialogue

Text, Speech and Language Technology - Current and New Directions in Discourse and Dialogue ◽

10.1007/978-94-010-0019-2_6 ◽

2003 ◽

pp. 113-131

Author(s):

Marc Cavazza

Keyword(s):

Speech Recognition ◽

Empirical Study ◽

Recognition Errors

Download Full-text

Feedback-Driven Refinement of Mandarin Speech Recognition Result Based on Lattice Modification and Rescoring

Cognitive Analytics ◽

10.4018/978-1-7998-2460-2.ch062 ◽

2020 ◽

pp. 1237-1247

Author(s):

Xiangdong Wang ◽

Yang Yang ◽

Hong Liu ◽

Yueliang Qian ◽

Duan Jia

Keyword(s):

Speech Recognition ◽

Real World ◽

Computational Cost ◽

User Feedback ◽

Recognition Result ◽

Novel Method ◽

Mandarin Speech Recognition ◽

Recognition Errors ◽

Dynamic Updating ◽

Manual Correction

In real world applications of speech recognition, recognition errors are inevitable, and manual correction is necessary. This paper presents an approach for the refinement of Mandarin speech recognition result by exploiting user feedback. An interface incorporating character-based candidate lists and feedback-driven updating of the candidate lists is introduced. For dynamic updating of candidate lists, a novel method based on lattice modification and rescoring is proposed. By adding words with similar pronunciations to the candidates next to the corrected character into the lattice and then performing rescoring on the modified lattice, the proposed method can improve the accuracy of the candidate lists even if the correct characters are not in the original lattice, with much lower computational cost than that of the speech re-recognition methods. Experimental results show that the proposed method can reduce 24.03% of user inputs and improve average candidate rank by 25.31%.

Download Full-text