A deep learning based character recognition system from multimedia document

A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/3/3 ◽

2020 ◽

Vol 17 (3) ◽

pp. 299-305 ◽

Cited By ~ 1

Author(s):

Riaz Ahmad ◽

Saeeda Naz ◽

Muhammad Afzal ◽

Sheikh Rashid ◽

Marcus Liwicki ◽

...

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Data Augmentation ◽

Short Term Memory ◽

Recognition System ◽

Learning Approach ◽

Arabic Text ◽

Data Set ◽

Processing Step ◽

Handwritten Arabic

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Download Full-text

YORÙBÁNET: A DEEP CONVOLUTIONAL NEURAL NETWORK DESIGN FOR YORÙBÁ ALPHABETS RECOGNITION

International Journal of Engineering Applied Sciences and Technology ◽

10.33564/ijeast.2021.v05i11.008 ◽

2021 ◽

Vol 5 (11) ◽

Author(s):

Oyeniran Oluwashina Akinloye ◽

Oyebode Ebenezer Olukunle

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Network Design ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Test Set ◽

Novel Technique ◽

Optical Character

Numerous works have been proposed and implemented in computerization of various human languages, nevertheless, miniscule effort have also been made so as to put Yorùbá Handwritten Character on the map of Optical Character Recognition. This study presents a novel technique in the development of Yorùbá alphabets recognition system through the use of deep learning. The developed model was implemented on Matlab R2018a environment using the developed framework where 10,500 samples of dataset were for training and 2100 samples were used for testing. The training of the developed model was conducted using 30 Epoch, at 164 iteration per epoch while the total iteration is 4920 iterations. Also, the training period was estimated to 11296 minutes 41 seconds. The model yielded the network accuracy of 100% while the accuracy of the test set is 97.97%, with F1 score of 0.9800, Precision of 0.9803 and Recall value of 0.9797.

Download Full-text

The Implementation of the Chinese Language and Character Recognition System Based on the Deep Learning

Advances in Intelligent Systems and Computing - Big Data Analytics for Cyber-Physical System in Smart City ◽

10.1007/978-981-15-2568-1_248 ◽

2020 ◽

pp. 1761-1765

Author(s):

Yanwen Wang

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Chinese Language ◽

Recognition System

Download Full-text

Optical Character Recognition System for Czech Language Using Hierarchical Deep Learning Networks

Applied Computational Intelligence and Mathematical Methods - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-319-67621-0_10 ◽

2017 ◽

pp. 114-125

Author(s):

Arindam Chaudhuri ◽

Soumya K. Ghosh

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Learning Networks ◽

Optical Character

Download Full-text

Optical Character Recognition using CRNN

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h6264.069820 ◽

2020 ◽

Vol 9 (8) ◽

pp. 115-120

Keyword(s):

Neural Network ◽

Deep Learning ◽

Character Recognition ◽

Web Application ◽

Optical Character Recognition ◽

Recognition System ◽

Feature Maps ◽

Sequence Modeling ◽

Optical Character ◽

Network Component

Optical Character Recognition (OCR) is a computer vision technique which recognizes text present in any form of images, such as scanned documents and photos. In recent years, OCR has improved significantly in the precise recognition of text from images. Though there are many existing applications, we plan on exploring the domain of deep learning and build an optical character recognition system using deep learning architectures. In the later stage, this OCR system is developed to form a web application which provides the functionalities. The approach applied to achieve this is to implement a hybrid model containing three components namely, the Convolutional Neural Network component, the Recurrent Neural Network component and the Transcription component which decodes the output from RNN into the corresponding label sequence. The process of solving problems involving text recognition required CNN to extract feature maps from images. These sequence of feature vectors undergo sequence modeling through the RNN component predicting label distributions which are later translated using the Connectionist Temporal Classification technique in the transcription layer. The model implemented acts as the backend of the web application developed using the Flask web framework. The complete application is later containerized into an image using Docker. This helps in easy deployment on the application along with its environment across any system.

Download Full-text

Authorized Vehicle Recognition System

Intelligent Systems and Computer Technology - Advances in Parallel Computing ◽

10.3233/apc200190 ◽

2020 ◽

Author(s):

Kedar R ◽

Kaviraj A ◽

Manish R ◽

Niteesh B ◽

Suthir S

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Recognition System ◽

Learning Object ◽

Image Search ◽

Human Beings ◽

Government Organizations ◽

Vehicle Recognition ◽

Character Extraction ◽

Region Extraction

The technology is growing and increasing in our day to day life to satisfy the needs of human beings. The system we are going to propose makes the human job easier. Here the YOLO algorithm which is a deep learning object detection architecture is used to detect the number plate of the vehicle. After detecting the number plate it converts the vehicle number to a text format. Then it checks it with the database to see if the vehicle is authorized to enter into the premise or not. This system can be implemented in highly restrained areas such as military areas, government organizations, Parliament, etc. This proposed system has around six stages: Capture Image, Search for black pixels, Image filtering, Plate region extraction, character extraction, OCR for character recognition. The alphanumeric characters are identified using the OCR algorithm. It is then used to compare the obtained result from the YOLO algorithm with the database and then check if the vehicle is allowed to enter the premise or not. This proposed system is simulated and implemented using Python, and it was also tested on real-time images for performance purposes.

Download Full-text

Offline Handwritten Gurumukhi Character Recognition System Using Deep Learning

Advances in Intelligent Systems and Computing - Advances in Bioinformatics, Multimedia, and Electronics Circuits and Signals ◽

10.1007/978-981-15-0339-9_11 ◽

2019 ◽

pp. 121-133

Author(s):

Udit Jindal ◽

Sheifali Gupta ◽

Vishal Jain ◽

Marcin Paprzycki

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Recognition System

Download Full-text

A New Deep Learning-Based Handwritten Character Recognition System on Mobile Computing Devices

Mobile Networks and Applications ◽

10.1007/s11036-019-01243-5 ◽

2019 ◽

Vol 25 (2) ◽

pp. 402-411 ◽

Cited By ~ 4

Author(s):

Yu Weng ◽

Chunlei Xia

Keyword(s):

Deep Learning ◽

Mobile Computing ◽

Character Recognition ◽

Recognition System ◽

Handwritten Character Recognition ◽

Handwritten Character

Download Full-text

Fast and Accurate Recognition for Codes on Complex Backgrounds for Real-Life Industrial Applications

Journal of Engineering Research ◽

10.36909/jer.10603 ◽

2021 ◽

Author(s):

Qiaokang Liang ◽

◽

Qiao Ge ◽

Wei Sun ◽

Dan Zhang ◽

...

Keyword(s):

Image Processing ◽

Deep Learning ◽

Character Recognition ◽

Real Life ◽

Recognition System ◽

Industrial Applications ◽

Processing Methods ◽

Data Generator ◽

Model Training ◽

Traditional Image

In the food and beverage industry, the existing recognition of code characters on the surface of complex packaging usually suffers from low accuracy and low speed. This work presents an efficient and accurate inkjet code recognition system based on the combination of the deep learning and traditional image processing methods. The proposed system mainly consists of three sequential modules, i.e., the characters region extraction by modified YOLOv3-tiny network, the character processing by the traditional image processing methods such as binarization and the modified character projection segmentation, and the character recognition by a Convolutional recurrent neural network (CRNN) model based on a modified version of MobileNetV3. In this system, only a small amount of tag data has been made and an effective character data generator is designed to randomly generate different experimental data for the CRNN model training. To the best of our knowledge, this report for the first time describes that deep learning has been applied to the recognition of codes on complex background for the real-life industrial application. Experimental results have been provided to verify the accuracy and effectiveness of the proposed model, demonstrating a recognition accuracy of 0.986 and a processing speed of 100 ms per bottle in the end-to-end character recognition system.

Download Full-text

A Study of Different Methodologies Helpful in the Identification of Offline Handwritten Script

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i6.287 ◽

2018 ◽

Vol 6 (6) ◽

pp. 307

Author(s):

Manish M. Kayasth ◽

Bharat C. Patel

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Recognition Rate ◽

Recognition System ◽

Post Processing ◽

Classification Technique ◽

Scanned Image ◽

Gujarati Language ◽

High Degree ◽

Selection Of

The entire character recognition system is logically characterized into different sections like Scanning, Pre-processing, Classification, Processing, and Post-processing. In the targeted system, the scanned image is first passed through pre-processing modules then feature extraction, classification in order to achieve a high recognition rate. This paper describes mainly on Feature extraction and Classification technique. These are the methodologies which play an important role to identify offline handwritten characters specifically in Gujarati language. Feature extraction provides methods with the help of which characters can identify uniquely and with high degree of accuracy. Feature extraction helps to find the shape contained in the pattern. Several techniques are available for feature extraction and classification, however the selection of an appropriate technique based on its input decides the degree of accuracy of recognition.

Download Full-text