Learning N-Gram Language Models from Uncertain Data

Author(s):  
Vitaly Kuznetsov ◽  
Hank Liao ◽  
Mehryar Mohri ◽  
Michael Riley ◽  
Brian Roark
2020 ◽  
Author(s):  
Grant P. Strimel ◽  
Ariya Rastrow ◽  
Gautam Tiwari ◽  
Adrien Piérard ◽  
Jon Webb

Author(s):  
ROMAN BERTOLAMI ◽  
HORST BUNKE

Current multiple classifier systems for unconstrained handwritten text recognition do not provide a straightforward way to utilize language model information. In this paper, we describe a generic method to integrate a statistical n-gram language model into the combination of multiple offline handwritten text line recognizers. The proposed method first builds a word transition network and then rescores this network with an n-gram language model. Experimental evaluation conducted on a large dataset of offline handwritten text lines shows that the proposed approach improves the recognition accuracy over a reference system as well as over the original combination method that does not include a language model.


2008 ◽  
Author(s):  
Ahmad Emami ◽  
Imed Zitouni ◽  
Lidia Mangu
Keyword(s):  

Author(s):  
Varalakshmi Konagala ◽  
Shahana Bano

The engendering of uncertain data in ordinary access news sources, for example, news sites, web-based life channels, and online papers, have made it trying to recognize capable news sources, along these lines expanding the requirement for computational instruments ready to give into the unwavering quality of online substance. For instance, counterfeit news outlets were observed to be bound to utilize language that is abstract and enthusiastic. At the point when specialists are chipping away at building up an AI-based apparatus for identifying counterfeit news, there wasn't sufficient information to prepare their calculations; they did the main balanced thing. In this chapter, two novel datasets for the undertaking of phony news locations, covering distinctive news areas, distinguishing proof of phony substance in online news has been considered. N-gram model will distinguish phony substance consequently with an emphasis on phony audits and phony news. This was pursued by a lot of learning analyses to fabricate precise phony news identifiers and showed correctness of up to 80%.


2001 ◽  
Author(s):  
Gary E. Kopec ◽  
Maya R. Said ◽  
Kris Popat

Sign in / Sign up

Export Citation Format

Share Document