Video Text Extraction from Images for Character Recognition

Multi-Oriented Text Extraction in Stylistic Documents

International Journal of Image and Graphics ◽

10.1142/s0219467815500023 ◽

2015 ◽

Vol 15 (01) ◽

pp. 1550002

Author(s):

Brij Mohan Singh ◽

Rahul Sharma ◽

Debashis Ghosh ◽

Ankush Mittal

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Morphological Operations ◽

Document Images ◽

Font Size ◽

Text Extraction ◽

Optical Character ◽

Engineering Drawings ◽

Flood Fill

In many documents such as maps, engineering drawings and artistic documents, etc. there exist many printed as well as handwritten materials where text regions and text-lines are not parallel to each other, curved in nature, and having various types of text such as different font size, text and non-text areas lying close to each other and non-straight, skewed and warped text-lines. Optical character recognition (OCR) systems available commercially such as ABYY fine reader and Free OCR, are not capable of handling different ranges of stylistic document images containing curved, multi-oriented, and stylish font text-lines. Extraction of individual text-lines and words from these documents is generally not straight forward. Most of the segmentation works reported is on simple documents but still it remains a highly challenging task to implement an OCR that works under all possible conditions and gives highly accurate results, especially in the case of stylistic documents. This paper presents dilation and flood fill morphological operations based approach that extracts multi-oriented text-lines and words from the complex layout or stylistic document images in the subsequent stages. The segmentation results obtained from our method proves to be superior over the standard profiling-based method.

Download Full-text

A video text extraction method for character recognition

Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318) ◽

10.1109/icdar.1999.791716 ◽

1999 ◽

Cited By ~ 17

Author(s):

O. Hori

Keyword(s):

Character Recognition ◽

Extraction Method ◽

Text Extraction

Download Full-text

An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition

2011 International Conference on Document Analysis and Recognition ◽

10.1109/icdar.2011.148 ◽

2011 ◽

Cited By ~ 20

Author(s):

Hongwei Zhang ◽

Changsong Liu ◽

Cheng Yang ◽

Xiaoqing Ding ◽

KongQiao Wang

Keyword(s):

Random Field ◽

Character Recognition ◽

Optical Character Recognition ◽

Extraction Method ◽

Conditional Random Field ◽

Text Extraction ◽

Optical Character ◽

Scene Text

Download Full-text

A Detailed Review on Text Extraction Using Optical Character Recognition

ICT Analysis and Applications - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-5655-2_69 ◽

2022 ◽

pp. 719-728

Author(s):

Chhanam Thorat ◽

Aishwarya Bhat ◽

Padmaja Sawant ◽

Isha Bartakke ◽

Swati Shirsath

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Detailed Review ◽

Text Extraction ◽

Optical Character

Download Full-text

A robust video text extraction method for character recognition

Systems and Computers in Japan ◽

10.1002/scj.10148 ◽

2005 ◽

Vol 36 (9) ◽

pp. 87-96 ◽

Cited By ~ 2

Author(s):

Osamu Hori ◽

Takeshi Mita

Keyword(s):

Character Recognition ◽

Extraction Method ◽

Text Extraction

Download Full-text

Development and customization of in-house developed OCR and its evaluation

The Electronic Library ◽

10.1108/el-01-2018-0011 ◽

2018 ◽

Vol 36 (5) ◽

pp. 766-781

Author(s):

Rajeswari S. ◽

Sai Baba Magapu

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Web Based ◽

Content Type ◽

Text Extraction ◽

Manual Intervention ◽

Key Phrases ◽

Microsoft Office ◽

Read Accuracy ◽

Scanned Documents

Purpose The purpose of this paper is to develop a text extraction tool for scanned documents that would extract text and build the keywords corpus and key phrases corpus for the document without manual intervention. Design/methodology/approach For text extraction from scanned documents, a Web-based optical character recognition (OCR) tool was developed. OCR is a well-established technology, so to develop the OCR, Microsoft Office document imaging tools were used. To account for the commonly encountered problem of skew being introduced, a method to detect and correct the skew introduced in the scanned documents was developed and integrated with the tool. The OCR tool was customized to build keywords and key phrases corpus for every document. Findings The developed tool was evaluated using a 100 document corpus to test the various properties of OCR. The tool had above 99 per cent word read accuracy for text only image documents. The customization of the OCR was tested with samples of Microfiches, sample of Journal pages from back volumes and samples from newspaper clips and the results are discussed in the summary. The tool was found to be useful for text extraction and processing. Social implications The scanned documents are converted to keywords and key phrases corpus. The tool could be used to build metadata for scanned documents without manual intervention. Originality/value The tool is used to convert unstructured data (in the form of image documents) to structured data (the document is converted into keywords, and key phrases database). In addition, the image document is converted to editable and searchable document.

Download Full-text

SELECTION TECHNIQUE FOR MULTIPLE OUTPUTS OF OPTICAL CHARACTER RECOGNITION

Eurasian Journal of Mathematical and Computer Applications ◽

10.32523/2306-6172-2020-8-2-41-51 ◽

2020 ◽

Vol 8 (2) ◽

pp. 41-51

Author(s):

I.Q. Habeeb ◽

Z.Q. Al-Zaydi ◽

H.N. Abdulkhudhur

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Selection Technique ◽

Multiple Outputs ◽

Optical Character

Download Full-text

A Structured Method for the Recognition of Complex Historical Tables

History and Computing ◽

10.3366/hac.1997.9.1-3.58 ◽

1997 ◽

Vol 9 (1-3) ◽

pp. 58-77

Author(s):

Vitaly Kliatskine ◽

Eugene Shchepin ◽

Gunnar Thorvaldsen ◽

Konstantin Zingerman ◽

Valery Lazarev

Keyword(s):

Nineteenth Century ◽

Character Recognition ◽

Optical Character Recognition ◽

Complex Structure ◽

Source Material ◽

Historical Sources ◽

Tax Assessment ◽

Optical Character ◽

Algorithmic Model ◽

Machine Readable

In principle, printed source material should be made machine-readable with systems for Optical Character Recognition, rather than being typed once more. Offthe-shelf commercial OCR programs tend, however, to be inadequate for lists with a complex layout. The tax assessment lists that assess most nineteenth century farms in Norway, constitute one example among a series of valuable sources which can only be interpreted successfully with specially designed OCR software. This paper considers the problems involved in the recognition of material with a complex table structure, outlining a new algorithmic model based on ‘linked hierarchies’. Within the scope of this model, a variety of tables and layouts can be described and recognized. The ‘linked hierarchies’ model has been implemented in the ‘CRIPT’ OCR software system, which successfully reads tables with a complex structure from several different historical sources.

Download Full-text