N-gram language models for document image decoding

Author(s):  
Gary E. Kopec ◽  
Maya R. Said ◽  
Kris Popat

This paper presents a feature extraction method for optical Braille recognition (OBR) system to locate, extract and convert the Braille cells in one sided Indian language Braille documents. The Braille cells are located by implementing a gridbox designed using physical properties of a Braille cell. A Braille document image is a compilation of group of six dots. The physical position of each dot and its relevance with other neighboring dots in a single cell gives various Braille characters. After the grid-box is mapped with the Braille cells in the document, the mesh characters are extracted and are then mapped with existing database to translate them in required text. Mapping of Braille cells with mesh box and separation of characters and words from a Braille document was a challenging task. The unwanted dots or degraded dots way result in incorrect mapping of characters. In this paper we have used N-gram Language Models to Predict the word Sequence in case of wrong mapping of characters in extraction and conversion of the Braille cells.


Author(s):  
Vitaly Kuznetsov ◽  
Hank Liao ◽  
Mehryar Mohri ◽  
Michael Riley ◽  
Brian Roark

2020 ◽  
Author(s):  
Grant P. Strimel ◽  
Ariya Rastrow ◽  
Gautam Tiwari ◽  
Adrien Piérard ◽  
Jon Webb

Author(s):  
ROMAN BERTOLAMI ◽  
HORST BUNKE

Current multiple classifier systems for unconstrained handwritten text recognition do not provide a straightforward way to utilize language model information. In this paper, we describe a generic method to integrate a statistical n-gram language model into the combination of multiple offline handwritten text line recognizers. The proposed method first builds a word transition network and then rescores this network with an n-gram language model. Experimental evaluation conducted on a large dataset of offline handwritten text lines shows that the proposed approach improves the recognition accuracy over a reference system as well as over the original combination method that does not include a language model.


2008 ◽  
Author(s):  
Ahmad Emami ◽  
Imed Zitouni ◽  
Lidia Mangu
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document