From object detection to text detection and recognition: A brief evolution history of optical character recognition

CNN-based Rain Reduction in Street View Images

London Imaging Meeting ◽

10.2352/issn.2694-118x.2020.lim-12 ◽

2020 ◽

Vol 2020 (1) ◽

pp. 78-81

Author(s):

Simone Zini ◽

Simone Bianco ◽

Raimondo Schettini

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

Weather Conditions ◽

Specific Interest ◽

Optical Character ◽

Street View ◽

In The Wild ◽

Bad Weather ◽

Detection And Recognition

Rain removal from pictures taken under bad weather conditions is a challenging task that aims to improve the overall quality and visibility of a scene. The enhanced images usually constitute the input for subsequent Computer Vision tasks such as detection and classification. In this paper, we present a Convolutional Neural Network, based on the Pix2Pix model, for rain streaks removal from images, with specific interest in evaluating the results of the processing operation with respect to the Optical Character Recognition (OCR) task. In particular, we present a way to generate a rainy version of the Street View Text Dataset (R-SVTD) for "text detection and recognition" evaluation in bad weather conditions. Experimental results on this dataset show that our model is able to outperform the state of the art in terms of two commonly used image quality metrics, and that it is capable to improve the performances of an OCR model to detect and recognise text in the wild.

Download Full-text

ConvNet-Based Optical Recognition for Engineering Drawings

Volume 1: 37th Computers and Information in Engineering Conference ◽

10.1115/detc2017-68186 ◽

2017 ◽

Author(s):

Andrew Brock ◽

Theodore Lim ◽

J. M. Ritchie ◽

Nick Weston

Keyword(s):

Object Detection ◽

Character Recognition ◽

Optical Character Recognition ◽

Cross Validation ◽

Convolutional Networks ◽

Optical Character ◽

Engineering Drawings ◽

Machine Analysis ◽

Fold Cross Validation ◽

Optical Recognition

End-to-end machine analysis of engineering document drawings requires a reliable and precise vision frontend capable of localizing and classifying various characters in context. We develop an object detection framework, based on convolutional networks, designed specifically for optical character recognition in engineering drawings. Our approach enables classification and localization on a 10-fold cross-validation of an internal dataset for which other techniques prove unsuitable.

Download Full-text

A Dual-Purpose Refreshable Braille Display Based on Real Time Object Detection and Optical Character Recognition

2019 IEEE International Conference on Signal Processing, Information, Communication & Systems (SPICSCON) ◽

10.1109/spicscon48833.2019.9065110 ◽

2019 ◽

Author(s):

K M Naimul Hassan ◽

Subrata Kumar Biswas ◽

Md Shakil Anwar ◽

Md Shakhrul Iman Siam ◽

Celia Shahnaz

Keyword(s):

Object Detection ◽

Real Time ◽

Character Recognition ◽

Optical Character Recognition ◽

Dual Purpose ◽

Optical Character ◽

Braille Display

Download Full-text

Video text detection and segmentation for optical character recognition

Multimedia Systems ◽

10.1007/s00530-004-0157-0 ◽

2005 ◽

Vol 10 (3) ◽

pp. 261-272 ◽

Cited By ~ 29

Author(s):

Chong-Wah Ngo ◽

Chi-Kwong Chan

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Text Detection ◽

Optical Character ◽

Video Text Detection

Download Full-text

YOLOv3-Tesseract model for improved intelligent form recognition

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666191204141610 ◽

2019 ◽

Vol 13 ◽

Author(s):

Zhang Yun-An ◽

Pan Ziheng ◽

Dui Hongyan ◽

Bai Guanghan

Keyword(s):

Big Data ◽

Object Detection ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Task ◽

Experimental Simulation ◽

Form Recognition ◽

Optical Character

Background: YOLOv3-Tesseract is widely used for the intelligent form recognition because it exhibits several attractive properties. It is important to improve the accuracy and efficiency of the optical character recognition. Methods: The YOLOv3 exhibits the classification advantages for the object detection. Tesseract can effectively recognize regular characters in the field of the optical character recognition. In this study, a YOLOv3 and Tesseract-based model of improved intelligent form recognition is proposed. Results: First, YOLOv3 is trained to detect the position of the text in the table and to subsequently segment text blocks. Second, Tesseract is used to individually detect separated text blocks and combine YOLOv3 and Tesseract to achieve the goal of table character recognition. Conclusion: Based on the Tianchi big data, experimental simulation is used to demonstrate the proposed method. The YOLOv3-Tesseract model is trained and tested to effectively accomplish the recognition task.

Download Full-text

Scribble-Scrabble Genius

The Connected Condition ◽

10.11126/stanford/9781503610040.003.0001 ◽

2019 ◽

pp. 37-73

Author(s):

Yohei Igarashi

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Writing System ◽

Optical Character ◽

History Of ◽

Recognition Software

Although Coleridge is mostly known for being a copious talker who was impossible to transcribe, this chapter recovers Coleridge’s role as transcriber, theorist of transcription practices, and inventor of his own idiosyncratic shorthand. Considering Coleridge’s time as a parliamentary reporter, his self-reflexive notebook entries, and the history of stenography, this chapter posits that Coleridge pursued an efficient writing system to record not speech but the flow of his own silent thoughts. Also discussing today’s optical character recognition software and the shorthand effect (when letters or words uncannily become illegible shapes, and non-linguistic shapes come to look like linguistic signs), this chapter culminates in a reading of the “signs” in “The Rime of the Ancient Mariner.”

Download Full-text

Optical Character Recognition for scene text detection, mining and recognition

2013 IEEE International Conference on Computational Intelligence and Computing Research ◽

10.1109/iccic.2013.6724165 ◽

2013 ◽

Cited By ~ 3

Author(s):

N. Nathiya ◽

K. Pradeepa

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Text Detection ◽

Optical Character ◽

Scene Text Detection ◽

Scene Text

Download Full-text

Real-time Automated Detection and Recognition of Nigerian License Plates via Deep Learning Single Shot Detection and Optical Character Recognition

Computer and Information Science ◽

10.5539/cis.v14n4p11 ◽

2021 ◽

Vol 14 (4) ◽

pp. 11

Author(s):

Kayode David Adedayo ◽

Ayomide Oluwaseyi Agunloye

Keyword(s):

Real Time ◽

Character Recognition ◽

Optical Character Recognition ◽

Character Segmentation ◽

Detection Accuracy ◽

License Plate ◽

Single Shot ◽

Optical Character ◽

License Plate Detection ◽

Detection And Recognition

License plate detection and recognition are critical components of the development of a connected Intelligent transportation system, but are underused in developing countries because to the associated costs. Existing license plate detection and recognition systems with high accuracy require the usage of Graphical Processing Units (GPU), which may be difficult to come by in developing nations. Single stage detectors and commercial optical character recognition engines, on the other hand, are less computationally expensive and can achieve acceptable detection and recognition accuracy without the use of a GPU. In this work, a pretrained SSD model and a tesseract tessdata-fast traineddata were fine-tuned on a dataset of more than 2,000 images of vehicles with license plate. These models were combined with a unique image preprocessing algorithm for character segmentation and tested using a general-purpose personal computer on a new collection of 200 automobiles with license plate photos. On this testing set, the plate detection system achieved a detection accuracy of 99.5 % at an IOU threshold of 0.45 while the OCR engine successfully recognized all characters on 150 license plates, one character incorrectly on 24 license plates, and two or more incorrect characters on 26 license plates. The detection procedure took an average of 80 milliseconds, while the character segmentation and identification stages took an average of 95 milliseconds, resulting in an average processing time of 175 milliseconds per image, or 6 photos per second. The obtained results are suitable for real-time traffic applications.

Download Full-text

SELECTION TECHNIQUE FOR MULTIPLE OUTPUTS OF OPTICAL CHARACTER RECOGNITION

Eurasian Journal of Mathematical and Computer Applications ◽

10.32523/2306-6172-2020-8-2-41-51 ◽

2020 ◽

Vol 8 (2) ◽

pp. 41-51

Author(s):

I.Q. Habeeb ◽

Z.Q. Al-Zaydi ◽

H.N. Abdulkhudhur

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Selection Technique ◽

Multiple Outputs ◽

Optical Character

Download Full-text

A Structured Method for the Recognition of Complex Historical Tables

History and Computing ◽

10.3366/hac.1997.9.1-3.58 ◽

1997 ◽

Vol 9 (1-3) ◽

pp. 58-77

Author(s):

Vitaly Kliatskine ◽

Eugene Shchepin ◽

Gunnar Thorvaldsen ◽

Konstantin Zingerman ◽

Valery Lazarev

Keyword(s):

Nineteenth Century ◽

Character Recognition ◽

Optical Character Recognition ◽

Complex Structure ◽

Source Material ◽

Historical Sources ◽

Tax Assessment ◽

Optical Character ◽

Algorithmic Model ◽

Machine Readable

In principle, printed source material should be made machine-readable with systems for Optical Character Recognition, rather than being typed once more. Offthe-shelf commercial OCR programs tend, however, to be inadequate for lists with a complex layout. The tax assessment lists that assess most nineteenth century farms in Norway, constitute one example among a series of valuable sources which can only be interpreted successfully with specially designed OCR software. This paper considers the problems involved in the recognition of material with a complex table structure, outlining a new algorithmic model based on ‘linked hierarchies’. Within the scope of this model, a variety of tables and layouts can be described and recognized. The ‘linked hierarchies’ model has been implemented in the ‘CRIPT’ OCR software system, which successfully reads tables with a complex structure from several different historical sources.

Download Full-text