A robust video text extraction method for character recognition

This paper proposes a new feature extraction method for off-line recognition of Myanmar printed documents. One of the most important factors to achieve high recognition performance in Optical Character Recognition (OCR) system is the selection of the feature extraction methods. Different types of existing OCR systems used various feature extraction methods because of the diversity of the scripts’ natures. One major contribution of the work in this paper is the design of logically rigorous coding based features. To show the effectiveness of the proposed method, this paper assumed the documents are successfully segmented into characters and extracted features from these isolated Myanmar characters. These features are extracted using structural analysis of the Myanmar scripts. The experimental results have been carried out using the Support Vector Machine (SVM) classifier and compare the pervious proposed feature extraction method.

Download Full-text

Multi-Oriented Text Extraction in Stylistic Documents

International Journal of Image and Graphics ◽

10.1142/s0219467815500023 ◽

2015 ◽

Vol 15 (01) ◽

pp. 1550002

Author(s):

Brij Mohan Singh ◽

Rahul Sharma ◽

Debashis Ghosh ◽

Ankush Mittal

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Morphological Operations ◽

Document Images ◽

Font Size ◽

Text Extraction ◽

Optical Character ◽

Engineering Drawings ◽

Flood Fill

In many documents such as maps, engineering drawings and artistic documents, etc. there exist many printed as well as handwritten materials where text regions and text-lines are not parallel to each other, curved in nature, and having various types of text such as different font size, text and non-text areas lying close to each other and non-straight, skewed and warped text-lines. Optical character recognition (OCR) systems available commercially such as ABYY fine reader and Free OCR, are not capable of handling different ranges of stylistic document images containing curved, multi-oriented, and stylish font text-lines. Extraction of individual text-lines and words from these documents is generally not straight forward. Most of the segmentation works reported is on simple documents but still it remains a highly challenging task to implement an OCR that works under all possible conditions and gives highly accurate results, especially in the case of stylistic documents. This paper presents dilation and flood fill morphological operations based approach that extracts multi-oriented text-lines and words from the complex layout or stylistic document images in the subsequent stages. The segmentation results obtained from our method proves to be superior over the standard profiling-based method.

Download Full-text

Robust Character Recognition Using Adaptive Feature Extraction Method

IEICE Transactions on Information and Systems ◽

10.1587/transinf.e93.d.125 ◽

2010 ◽

Vol E93-D (1) ◽

pp. 125-133

Author(s):

Minoru MORI ◽

Minako SAWAKI ◽

Junji YAMATO

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Extraction Method ◽

Feature Extraction Method ◽

Adaptive Feature Extraction

Download Full-text

Introduction of Balinese Script Handwriting Using Zoning and Multilayer Perceptron

ACSIE (International Journal of Application Computer Science and Informatic Engineering) ◽

10.33173/acsie.34 ◽

2019 ◽

Vol 1 (1) ◽

pp. 1-10

Author(s):

I Komang Arya Ganda Wiguna ◽

Agus Muliantara

Keyword(s):

Character Recognition ◽

Extraction Method ◽

Handwriting Recognition ◽

The Other ◽

Test Results ◽

The Difference ◽

Hidden Layer ◽

The Many ◽

Offline Testing ◽

Value Of Learning

Handwriting identification is one out of the many research ever conducted. In its development, the handwriting can be written in real time by the user by using the mouse (online character recognition). Various studies on the traditional character handwriting recognition continue to be developed. One of them is the recognition of the Balinese characters. Balinese characters have their own unique characters compared with the other regions. The difference between the shapes of the characters with the other characters are quite similar, or there are some characters that can only be distinguished by a small sketch or doodle.This study uses Artificial Neural Network with Backpropagation algorithm to perform the Balinese characters recognition and zoning as a method of feature extraction. In a variation of the extraction method, the characteristics used are Image Centroid and Zone (ICZ), Zone Centroid and Zone (ZCZ) and normalization of features. Of the three methods, it will be determined the best method used in the Balinese characters recognition.From the test results of the extraction method, the combined characteristics of the ICZ, ZCZ and normalization of features were the most effective to be used for the recognition of the Balinese characters. The level of accuracy obtained from the results of the online testing was 71,28% and 72,31% for offline testing, with parameters of Backpropagation, which used the value of learning rate of 0,03, a momentum value of 0,5 and the number of neurons in the hidden layer of 130.

Download Full-text

A Detailed Review on Text Extraction Using Optical Character Recognition

ICT Analysis and Applications - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-5655-2_69 ◽

2022 ◽

pp. 719-728

Author(s):

Chhanam Thorat ◽

Aishwarya Bhat ◽

Padmaja Sawant ◽

Isha Bartakke ◽

Swati Shirsath

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Detailed Review ◽

Text Extraction ◽

Optical Character

Download Full-text

Video Text Extraction from Images for Character Recognition

2006 Canadian Conference on Electrical and Computer Engineering ◽

10.1109/ccece.2006.277408 ◽

2006 ◽

Cited By ~ 1

Author(s):

Basavaraj Amarapur ◽

Nagaraj Patil

Keyword(s):

Character Recognition ◽

Text Extraction

Download Full-text

A dictionary learning and KPCA-based feature extraction method for off-line handwritten Tibetan character recognition

Optik ◽

10.1016/j.ijleo.2015.07.144 ◽

2015 ◽

Vol 126 (23) ◽

pp. 3795-3800 ◽

Cited By ~ 3

Author(s):

He-ming Huang ◽

Fei-peng Da

Keyword(s):

Feature Extraction ◽

Dictionary Learning ◽

Character Recognition ◽

Extraction Method ◽

Feature Extraction Method

Download Full-text

A robust video text extraction method based on text traversing line and stroke connectivity

2008 9th International Conference on Signal Processing ◽

10.1109/icosp.2008.4697297 ◽

2008 ◽

Cited By ~ 2

Author(s):

Peng Tianqiang ◽

Tian Pohuang ◽

Li Bicheng

Keyword(s):

Extraction Method ◽

Text Extraction

Download Full-text