Optical character recognition of handwritten Arabic using hidden Markov models

An Efficient Algorithm for Estimating State Sequences in Imprecise Hidden Markov Models

Journal of Artificial Intelligence Research ◽

10.1613/jair.4385 ◽

2014 ◽

Vol 50 ◽

pp. 189-233 ◽

Cited By ~ 11

Author(s):

J. De Bock ◽

G. De Cooman

Keyword(s):

Hidden Markov Models ◽

Character Recognition ◽

Optical Character Recognition ◽

Markov Models ◽

Hidden Markov ◽

Exact Algorithm ◽

Mass Function ◽

The State ◽

Model Parameters ◽

State Sequence

We present an efficient exact algorithm for estimating state sequences from outputs or observations in imprecise hidden Markov models (iHMMs). The uncertainty linking one state to the next, and that linking a state to its output, is represented by a set of probability mass functions instead of a single such mass function. We consider as best estimates for state sequences the maximal sequences for the posterior joint state model conditioned on the observed output sequence, associated with a gain function that is the indicator of the state sequence. This corresponds to and generalises finding the state sequence with the highest posterior probability in (precise-probabilistic) HMMs, thereby making our algorithm a generalisation of the one by Viterbi. We argue that the computational complexity of our algorithm is at worst quadratic in the length of the iHMM, cubic in the number of states, and essentially linear in the number of maximal state sequences. An important feature of our imprecise approach is that there may be more than one maximal sequence, typically in those instances where its precise-probabilistic counterpart is sensitive to the choice of prior. For binary iHMMs, we investigate experimentally how the number of maximal state sequences depends on the model parameters. We also present an application in optical character recognition, demonstrating that our algorithm can be usefully applied to robustify the inferences made by its precise-probabilistic counterpart.

Download Full-text

HIDDEN MARKOV MODELS IN TEXT RECOGNITION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001495000389 ◽

1995 ◽

Vol 09 (06) ◽

pp. 925-958 ◽

Cited By ~ 11

Author(s):

J.C. ANIGBOGU ◽

A. BELAÏD

Keyword(s):

Hidden Markov Models ◽

Character Recognition ◽

Markov Models ◽

Hidden Markov ◽

Majority Vote ◽

The Other ◽

Distance Measures ◽

Original Form ◽

Verification Methods ◽

Font Identification

A multi-level multifont character recognition is presented. The system proceeds by first delimiting the context of the characters. As a way of enhancing system performance, typographical information is extracted and used for font identification before actual character recognition is performed. This has the advantage of sure character identification as well as text reproduction in its original form. The font identification is based on decision trees where the characters are automatically arranged differently in confusion classes according to the physical characteristics of fonts. The character recognizers are built around the first and second order hidden Markov models (HMM) as well as Euclidean distance measures. The HMMs use the Viterbi and the Extended Viterbi algorithms to which enhancements were made. Also present is a majority-vote system that polls the other systems for “advice” before deciding on the identity of a character. Among other things, this last system is shown to give better results than each of the other systems applied individually. The system finally uses combinations of stochastic and dictionary verification methods for word recognition and error-correction.

Download Full-text

Hidden markov model based optical character recognition in the presence of deterministic transformations

Pattern Recognition ◽

10.1016/0031-3203(93)90178-y ◽

1993 ◽

Vol 26 (12) ◽

pp. 1813-1826 ◽

Cited By ~ 53

Author(s):

Oscar E Agazzi ◽

Shyh-shiaw Kuo

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Character Recognition ◽

Optical Character Recognition ◽

Hidden Markov ◽

Model Based ◽

Optical Character

Download Full-text

Hidden Markov models for character recognition

10.1109/icassp.1989.266780 ◽

2003 ◽

Cited By ~ 4

Author(s):

J.A. Vlontzos ◽

S.Y. Kung

Keyword(s):

Hidden Markov Models ◽

Character Recognition ◽

Markov Models ◽

Hidden Markov

Download Full-text

Hand-written Chinese character recognition by first and second order Hidden Markov Models and radical modeling

10.5353/th_b2777086 ◽

2003 ◽

Author(s):

Ho-ting Wong

Keyword(s):

Hidden Markov Models ◽

Character Recognition ◽

Chinese Character ◽

Markov Models ◽

Hidden Markov ◽

Second Order ◽

Chinese Character Recognition

Download Full-text

An off-line oriental character recognition system (OOCRS): synergy of distortion modeling, hidden Markov models and vector quantization

Pattern Recognition ◽

10.1016/s0031-3203(01)00090-5 ◽

2002 ◽

Vol 35 (5) ◽

pp. 1007-1023

Author(s):

Khue Hiang Chan

Keyword(s):

Hidden Markov Models ◽

Vector Quantization ◽

Character Recognition ◽

Markov Models ◽

Hidden Markov ◽

Recognition System

Download Full-text

A Survey on Arabic Handwritten Script Recognition Systems

International Journal of Artificial Intelligence and Machine Learning ◽

10.4018/ijaiml.20210701.oa9 ◽

2021 ◽

Vol 11 (2) ◽

pp. 1-17

Author(s):

Soumia Djaghbellou ◽

Abderraouf Bouziane ◽

Abdelouahab Attia ◽

Zahid Akhtar

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Research Field ◽

Research Directions ◽

Optical Character ◽

On Line ◽

Recognition Systems ◽

Active Research ◽

Handwritten Arabic ◽

Open Issues

The optical character recognition (OCR) system is still an active research field in pattern recognition. Such systems can identify, recognize and distinguish electronically between characters and texts, printed or handwritten. They can also do a transformation of such data type into machine-processable form to facilitate the interaction between user and machine in various applications. In this paper, we present the global structure of an OCR system, with its types (on-line and off-line), categories (printed and handwritten) and its main steps. We also focused on off-line handwritten Arabic character recognition and provided a list of the main datasets publicly available. This paper also presents a survey of the works that have been carried out over recent years. Finally, some open issues and potential research directions have been highlighted

Download Full-text

Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set

IEEE Access ◽

10.1109/access.2020.2970438 ◽

2020 ◽

Vol 8 ◽

pp. 23011-23021 ◽

Cited By ~ 2

Author(s):

Ahmed Talat Sahlol ◽

Mohamed Abd Elaziz ◽

Mohammed A. A. Al-Qaness ◽

Sunghwan Kim

Keyword(s):

Rough Set ◽

Optimization Algorithm ◽

Character Recognition ◽

Optical Character Recognition ◽

Whale Optimization Algorithm ◽

Optical Character ◽

Whale Optimization ◽

Neighborhood Rough Set ◽

Handwritten Arabic

Download Full-text

Hidden Markov models for character recognition

IEEE Transactions on Image Processing ◽

10.1109/83.199925 ◽

1992 ◽

Vol 1 (4) ◽

pp. 539-543 ◽

Cited By ~ 35

Author(s):

J.A. Vlontzos ◽

S.Y. Kung

Keyword(s):

Hidden Markov Models ◽

Character Recognition ◽

Markov Models ◽

Hidden Markov

Download Full-text

Printed amazigh character recognition by a hybrid approach based on Hidden Markov Models and the Hough transform

2009 International Conference on Multimedia Computing and Systems ◽

10.1109/mmcs.2009.5256672 ◽

2009 ◽

Cited By ~ 3

Author(s):

M. Amrouch ◽

Y. Es saady ◽

A. Rachidi ◽

M. El Yassa ◽

D. Mammass

Keyword(s):

Hidden Markov Models ◽

Hough Transform ◽

Character Recognition ◽

Markov Models ◽

Hidden Markov ◽

Hybrid Approach

Download Full-text