General Models for Handwritten Text Recognition: Feasibility and State-of-the Art. German Kurrent as an Example

Boosting of Deep Convolutional Architectures for Arabic Handwriting Recognition

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2019100102 ◽

2019 ◽

Vol 10 (4) ◽

pp. 26-45 ◽

Cited By ~ 1

Author(s):

Mohamed Elleuch ◽

Monji Kherallah

Keyword(s):

Character Recognition ◽

State Of The Art ◽

Handwriting Recognition ◽

Image Data ◽

Text Recognition ◽

Deep Belief Networks ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Accuracy Rates ◽

Hierarchical Representations

In recent years, deep learning (DL) based systems have become very popular for constructing hierarchical representations from unlabeled data. Moreover, DL approaches have been shown to exceed foregoing state of the art machine learning models in various areas, by pattern recognition being one of the more important cases. This paper applies Convolutional Deep Belief Networks (CDBN) to textual image data containing Arabic handwritten script (AHS) and evaluated it on two different databases characterized by the low/high-dimension property. In addition to the benefits provided by deep networks, the system is protected against over-fitting. Experimentally, the authors demonstrated that the extracted features are effective for handwritten character recognition and show very good performance comparable to the state of the art on handwritten text recognition. Yet using Dropout, the proposed CDBN architectures achieved a promising accuracy rates of 91.55% and 98.86% when applied to IFN/ENIT and HACDB databases, respectively.

Download Full-text

HTR for Greek Historical Handwritten Documents

Journal of Imaging ◽

10.3390/jimaging7120260 ◽

2021 ◽

Vol 7 (12) ◽

pp. 260

Author(s):

Lazaros Tsochatzidis ◽

Symeon Symeonidis ◽

Alexandros Papazoglou ◽

Ioannis Pratikakis

Keyword(s):

Network Architecture ◽

State Of The Art ◽

Text Recognition ◽

Historical Period ◽

Neural Network Architecture ◽

Handwritten Documents ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Historical Manuscripts

Offline handwritten text recognition (HTR) for historical documents aims for effective transcription by addressing challenges that originate from the low quality of manuscripts under study as well as from several particularities which are related to the historical period of writing. In this paper, the challenge in HTR is related to a focused goal of the transcription of Greek historical manuscripts that contain several particularities. To this end, in this paper, a convolutional recurrent neural network architecture is proposed that comprises octave convolution and recurrent units which use effective gated mechanisms. The proposed architecture has been evaluated on three newly created collections from Greek historical handwritten documents that will be made publicly available for research purposes as well as on standard datasets like IAM and RIMES. For evaluation we perform a concise study which shows that compared to state of the art architectures, the proposed one deals effectively with the challenging Greek historical manuscripts.

Download Full-text

Towards the Natural Language Processing as Spelling Correction for Offline Handwritten Text Recognition Systems

Applied Sciences ◽

10.3390/app10217711 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7711

Author(s):

Arthur Flor de Sousa Neto ◽

Byron Leite Dantas Bezerra ◽

Alejandro Héctor Toselli

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Network Architecture ◽

State Of The Art ◽

Language Models ◽

Text Recognition ◽

Spelling Correction ◽

Handwritten Text ◽

Handwritten Text Recognition

The increasing portability of physical manuscripts to the digital environment makes it common for systems to offer automatic mechanisms for offline Handwritten Text Recognition (HTR). However, several scenarios and writing variations bring challenges in recognition accuracy, and, to minimize this problem, optical models can be used with language models to assist in decoding text. Thus, with the aim of improving results, dictionaries of characters and words are generated from the dataset and linguistic restrictions are created in the recognition process. In this way, this work proposes the use of spelling correction techniques for text post-processing to achieve better results and eliminate the linguistic dependence between the optical model and the decoding stage. In addition, an encoder–decoder neural network architecture in conjunction with a training methodology are developed and presented to achieve the goal of spelling correction. To demonstrate the effectiveness of this new approach, we conducted an experiment on five datasets of text lines, widely known in the field of HTR, three state-of-the-art Optical Models for text recognition and eight spelling correction techniques, among traditional statistics and current approaches of neural networks in the field of Natural Language Processing (NLP). Finally, our proposed spelling correction model is analyzed statistically through HTR system metrics, reaching an average sentence correction of 54% higher than the state-of-the-art method of decoding in the tested datasets.

Download Full-text

Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN–RNN networks

Neural Computing and Applications ◽

10.1007/s00521-020-05556-5 ◽

2021 ◽

Author(s):

R. Geetha ◽

T. Thilagam ◽

T. Padmavathy

Keyword(s):

Text Recognition ◽

Recognition Model ◽

Model Based ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

Offline Handwritten Text Recognition Using Deep Learning: A Review

Journal of Physics Conference Series ◽

10.1088/1742-6596/1848/1/012015 ◽

2021 ◽

Vol 1848 (1) ◽

pp. 012015

Author(s):

Yintong Wang ◽

Wenjie Xiao ◽

Shuo Li

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

ICFHR2014 Competition on Handwritten Text Recognition on Transcriptorium Datasets (HTRtS)

2014 14th International Conference on Frontiers in Handwriting Recognition ◽

10.1109/icfhr.2014.137 ◽

2014 ◽

Cited By ~ 18

Author(s):

Joan Andreu Sanchez ◽

Veronica Romero ◽

Alejandro H. Toselli ◽

Enrique Vidal

Keyword(s):

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

OCFormer: A Transformer-Based Model For Arabic Handwritten Text Recognition

2021 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC) ◽

10.1109/miucc52538.2021.9447608 ◽

2021 ◽

Author(s):

Aly Mostafa ◽

Omar Mohamed ◽

Ali Ashraf ◽

Ahmed Elbehery ◽

Salma Jamal ◽

...

Keyword(s):

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

Handwritten Text Recognition using Deep Learning with TensorFlow

International Journal of Engineering Research and ◽

10.17577/ijertv9is050534 ◽

2020 ◽

Vol V9 (05) ◽

Author(s):

Sri. Yugandhar Manchala ◽

Jayaram Kinthali ◽

Kowshik Kotha ◽

Kanithi Santosh Kumar, Jagilinki Jayalaxmi ◽

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition

10.1109/cvpr46437.2021.01557 ◽

2021 ◽

Author(s):

Ayan Kumar Bhunia ◽

Shuvozit Ghose ◽

Amandeep Kumar ◽

Pinaki Nath Chowdhury ◽

Aneeshan Sain ◽

...

Keyword(s):

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

Fast writer adaptation with style extractor network for handwritten text recognition

Neural Networks ◽

10.1016/j.neunet.2021.12.002 ◽

2021 ◽

Author(s):

Zi-Rui Wang ◽

Jun Du

Keyword(s):

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text