Scene Text Detection with Polygon Offsetting and Border Augmentation

Thananop Kobchaisawat; Thanarat H. Chalidabhongse; Shin’ichi Satoh

doi:10.3390/electronics9010117

Scene Text Detection with Polygon Offsetting and Border Augmentation

Electronics ◽

10.3390/electronics9010117 ◽

2020 ◽

Vol 9 (1) ◽

pp. 117 ◽

Cited By ~ 3

Author(s):

Thananop Kobchaisawat ◽

Thanarat H. Chalidabhongse ◽

Shin’ichi Satoh

Keyword(s):

Object Detection ◽

Global Illumination ◽

Experimental Results ◽

Text Recognition ◽

Text Localization ◽

Crucial Step ◽

Scene Text Detection ◽

Wide Range ◽

Scene Text ◽

Scene Text Recognition

Scene text localization is a very crucial step in the issue of scene text recognition. The major challenges—such as how there are various sizes, shapes, unpredictable orientations, a wide range of colors and styles, occlusion, and local and global illumination variations—make the problem different from generic object detection. Unlike existing scene text localization methods, here we present a segmentation-based text detector which can detect an arbitrary shaped scene text by using polygon offsetting, combined with the border augmentation. This technique better distinguishes contiguous and arbitrary shaped text instances from nearby non-text regions. The quantitative experimental results on public benchmarks, ICDAR2015, ICDAR2017-MLT, ICDAR2019-MLT, and Total-Text datasets demonstrate the performance and robustness of our proposed method, compared to previous approaches which have been proposed.

Download Full-text

Text Recognition in the Wild

ACM Computing Surveys ◽

10.1145/3440756 ◽

2021 ◽

Vol 54 (2) ◽

pp. 1-35

Author(s):

Xiaoxue Chen ◽

Lianwen Jin ◽

Yuanzhi Zhu ◽

Canjie Luo ◽

Tianwei Wang

Keyword(s):

Text Recognition ◽

Future Research ◽

Natural Scenes ◽

Wide Range ◽

Scene Text ◽

In The Wild ◽

History Of ◽

Active Research ◽

Future Work ◽

Scene Text Recognition

The history of text can be traced back over thousands of years. Rich and precise semantic information carried by text is important in a wide range of vision-based application scenarios. Therefore, text recognition in natural scenes has been an active research topic in computer vision and pattern recognition. In recent years, with the rise and development of deep learning, numerous methods have shown promising results in terms of innovation, practicality, and efficiency. This article aims to (1) summarize the fundamental problems and the state-of-the-art associated with scene text recognition, (2) introduce new insights and ideas, (3) provide a comprehensive review of publicly available resources, and (4) point out directions for future work. In summary, this literature review attempts to present an entire picture of the field of scene text recognition. It provides a comprehensive reference for people entering this field and could be helpful in inspiring future research. Related resources are available at our GitHub repository: https://github.com/HCIILAB/Scene-Text-Recognition.

Download Full-text

Multi-granularity Deep Local Representations for Irregular Scene Text Recognition

ACM/IMS Transactions on Data Science ◽

10.1145/3446971 ◽

2021 ◽

Vol 2 (2) ◽

pp. 1-18

Author(s):

Hongchao Gao ◽

Yujia Li ◽

Jiao Dai ◽

Xi Wang ◽

Jizhong Han ◽

...

Keyword(s):

State Of The Art ◽

Visual Representation ◽

Text Recognition ◽

Natural Scene ◽

Attention Network ◽

Training Time ◽

Scene Text ◽

Benchmark Datasets ◽

Local Representations ◽

Scene Text Recognition

Recognizing irregular text from natural scene images is challenging due to the unconstrained appearance of text, such as curvature, orientation, and distortion. Recent recognition networks regard this task as a text sequence labeling problem and most networks capture the sequence only from a single-granularity visual representation, which to some extent limits the performance of recognition. In this article, we propose a hierarchical attention network to capture multi-granularity deep local representations for recognizing irregular scene text. It consists of several hierarchical attention blocks, and each block contains a Local Visual Representation Module (LVRM) and a Decoder Module (DM). Based on the hierarchical attention network, we propose a scene text recognition network. The extensive experiments show that our proposed network achieves the state-of-the-art performance on several benchmark datasets including IIIT-5K, SVT, CUTE, SVT-Perspective, and ICDAR datasets under shorter training time.

Download Full-text

Arabic Cursive Text Recognition from Natural Scene Images

Applied Sciences ◽

10.3390/app9020236 ◽

2019 ◽

Vol 9 (2) ◽

pp. 236 ◽

Cited By ~ 6

Author(s):

Saad Ahmed ◽

Saeeda Naz ◽

Muhammad Razzak ◽

Rubiyah Yusof

Keyword(s):

Recognition System ◽

Document Image ◽

Text Recognition ◽

Chinese Script ◽

Challenging Problem ◽

Future Directions ◽

Scene Text ◽

Comprehensive Survey ◽

Recognition Systems ◽

Scene Text Recognition

This paper presents a comprehensive survey on Arabic cursive scene text recognition. The recent years’ publications in this field have witnessed the interest shift of document image analysis researchers from recognition of optical characters to recognition of characters appearing in natural images. Scene text recognition is a challenging problem due to the text having variations in font styles, size, alignment, orientation, reflection, illumination change, blurriness and complex background. Among cursive scripts, Arabic scene text recognition is contemplated as a more challenging problem due to joined writing, same character variations, a large number of ligatures, the number of baselines, etc. Surveys on the Latin and Chinese script-based scene text recognition system can be found, but the Arabic like scene text recognition problem is yet to be addressed in detail. In this manuscript, a description is provided to highlight some of the latest techniques presented for text classification. The presented techniques following a deep learning architecture are equally suitable for the development of Arabic cursive scene text recognition systems. The issues pertaining to text localization and feature extraction are also presented. Moreover, this article emphasizes the importance of having benchmark cursive scene text dataset. Based on the discussion, future directions are outlined, some of which may provide insight about cursive scene text to researchers.

Download Full-text

A discriminative semi-Markov model for robust scene text recognition

2008 19th International Conference on Pattern Recognition ◽

10.1109/icpr.2008.4761818 ◽

2008 ◽

Cited By ~ 3

Author(s):

Jerod J. Weinman ◽

Erik Learned-Miller ◽

Allen Hanson

Keyword(s):

Markov Model ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Random Projected Convolutional Feature for Scene Text Recognition

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) ◽

10.1109/icfhr.2016.0036 ◽

2016 ◽

Cited By ~ 2

Author(s):

Rui Wu ◽

Shuli Yang ◽

Dawei Leng ◽

Zhenbo Luo ◽

Yunhong Wang

Keyword(s):

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

A Machine Learning Approach to Hypothesis Decoding in Scene Text Recognition

Computer Vision - ACCV 2014 Workshops - Lecture Notes in Computer Science ◽

10.1007/978-3-319-16631-5_13 ◽

2015 ◽

pp. 169-180

Author(s):

Jindřich Libovický ◽

Lukáš Neumann ◽

Pavel Pecina ◽

Jiří Matas

Keyword(s):

Machine Learning ◽

Text Recognition ◽

Learning Approach ◽

Machine Learning Approach ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling

2018 ACM Multimedia Conference on Multimedia Conference - MM '18 ◽

10.1145/3240508.3240571 ◽

2018 ◽

Cited By ~ 17

Author(s):

Shancheng Fang ◽

Hongtao Xie ◽

Zheng-Jun Zha ◽

Nannan Sun ◽

Jianlong Tan ◽

...

Keyword(s):

Text Recognition ◽

Sequence Modeling ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Ethiopic Natural Scene Text Recognition Using Deep Learning Approaches

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Advances of Science and Technology ◽

10.1007/978-3-030-43690-2_36 ◽

2020 ◽

pp. 502-511

Author(s):

Direselign Addis ◽

Chuan-Ming Liu ◽

Van-Dai Ta

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Learning Approaches ◽

Natural Scene ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Scene text recognition algorithm based on faster RCNN

2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS) ◽

10.1109/eiis.2017.8298720 ◽

2017 ◽

Cited By ~ 2

Author(s):

Boya Wang ◽

Jianqing Xu ◽

Junbao Li ◽

Cong Hu ◽

Jeng-Shyang Pan

Keyword(s):

Recognition Algorithm ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Accurate Scene Text Recognition Based on Recurrent Neural Network

Computer Vision -- ACCV 2014 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-16865-4_3 ◽

2015 ◽

pp. 35-48 ◽

Cited By ~ 27

Author(s):

Bolan Su ◽

Shijian Lu

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text