OCR with the Deep CNN Model for Ligature Script-Based Languages like Manchu

Scientific Programming ◽

10.1155/2021/5520338 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Diandian Zhang ◽

Yan Liu ◽

Zhuowei Wang ◽

Depei Wang

Keyword(s):

Recognition Accuracy ◽

Sliding Window ◽

Recognition System ◽

Text Retrieval ◽

Text Recognition ◽

Manual Segmentation ◽

Low Resource ◽

Sample Data ◽

Deep Cnn

Manchu is a low-resource language that is rarely involved in text recognition technology. Because of the combination of typefaces, ordinary text recognition practice requires segmentation before recognition, which affects the recognition accuracy. In this paper, we propose a Manchu text recognition system divided into two parts: text recognition and text retrieval. First, a deep CNN model is used for text recognition, using a sliding window instead of manual segmentation. Second, text retrieval finds similarities within the image and locates the position of the recognized text in the database; this process is described in detail. We conducted comparative experiments on the FAST-NU dataset using different quantities of sample data, as well as comparisons with the latest model. The experiments revealed that the optimal results of the proposed deep CNN model reached 98.84%.

Download Full-text

On the Determination of the Chip Nozzle Recognition System by Using Machine Vision

Frontiers in Business, Economics and Management ◽

10.54097/fbem.v1i3.21 ◽

2021 ◽

Vol 1 (3) ◽

pp. 1-7

Author(s):

Jing Qiu ◽

Yun Xu ◽

Siyi Liu

Keyword(s):

Machine Vision ◽

Recognition Accuracy ◽

Recognition System ◽

Analysis Method ◽

Practical Application ◽

Average Value ◽

Blob Analysis ◽

Sample Data ◽

Semiconductor Chips

To solve the problem of chip damage caused by the using the wrong type of vacuum nozzle during the packaging of semiconductor chips. A recognition system of vacuum nozzle based on machine vision was proposed. In this research, 29 kinds of lifting nozzles are selected as test samples. The backlight intensity of two lifting nozzle images (one strong and one weak separately) is collected at the first beginning. Then, the Blob analysis method is using to analyze the weak backlighting image. The area of the lifting nozzle and the minimum outer rectangular feature can be obtained subsequently. To identify the shape of the liftin nozzle (round or square), the area ratio is calculated. At the same time, the minimum outer rectangular of the lifting nozzle is selected as the reference rectangle. Then, construct the measurement rectangle. The 2-dimensional size of the lifting nozzle is measured as well. Meanwhile, for the strong backlight image, the average value of the grayscale which located within the minimum outer rectangle is calculated. Therefore, the color (black, white, or beige) of the nozzle can be identified. Finally, the sample data is saved to the database as the sample database. During the recognition process, the shape, color, and size of the lifting nozzle being analyzing are using as the parameter to realize the condition inquire. The experimental results show that the recognition accuracy of this method is 98.85%, and the recognition time of one nozzle is around 1 second, which meets the requirements of practical application.

Download Full-text

OFFLINE YORÙBÁ HANDWRITTEN WORD RECOGNITION USING GEOMETRIC FEATURE EXTRACTION AND SUPPORT VECTOR MACHINE CLASSIFIER

MALAYSIAN JOURNAL OF COMPUTING ◽

10.24191/mjoc.v5i2.8947 ◽

2020 ◽

Vol 5 (2) ◽

pp. 504

Author(s):

Matthias Omotayo Oladele ◽

Temilola Morufat Adepoju ◽

Olaide ` Abiodun Olatoke ◽

Oluwaseun Adewale Ojo

Keyword(s):

Support Vector Machine ◽

Feature Extraction ◽

Word Recognition ◽

Support Vector Machine Classifier ◽

Recognition Accuracy ◽

Recognition System ◽

Support Vector ◽

Geometric Features ◽

Total Length ◽

Yoruba Language

Yorùbá language is one of the three main languages that is been spoken in Nigeria. It is a tonal language that carries an accent on the vowel alphabets. There are twenty-five (25) alphabets in Yorùbá language with one of the alphabets a digraph (GB). Due to the difficulty in typing handwritten Yorùbá documents, there is a need to develop a handwritten recognition system that can convert the handwritten texts to digital format. This study discusses the offline Yorùbá handwritten word recognition system (OYHWR) that recognizes Yorùbá uppercase alphabets. Handwritten characters and words were obtained from different writers using the paint application and M708 graphics tablets. The characters were used for training and the words were used for testing. Pre-processing was done on the images and the geometric features of the images were extracted using zoning and gradient-based feature extraction. Geometric features are the different line types that form a particular character such as the vertical, horizontal, and diagonal lines. The geometric features used are the number of horizontal lines, number of vertical lines, number of right diagonal lines, number of left diagonal lines, total length of all horizontal lines, total length of all vertical lines, total length of all right slanting lines, total length of all left-slanting lines and the area of the skeleton. The characters are divided into 9 zones and gradient feature extraction was used to extract the horizontal and vertical components and geometric features in each zone. The words were fed into the support vector machine classifier and the performance was evaluated based on recognition accuracy. Support vector machine is a two-class classifier, hence a multiclass SVM classifier least square support vector machine (LSSVM) was used for word recognition. The one vs one strategy and RBF kernel were used and the recognition accuracy obtained from the tested words ranges between 66.7%, 83.3%, 85.7%, 87.5%, and 100%. The low recognition rate for some of the words could be as a result of the similarity in the extracted features.

Download Full-text

TDNN-based Multilingual Speech Recognition System for Low Resource Indian Languages

10.21437/interspeech.2018-2117 ◽

2018 ◽

Cited By ~ 7

Author(s):

Noor Fathima ◽

Tanvina Patel ◽

Mahima C ◽

Anuroop Iyengar

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Indian Languages ◽

Low Resource ◽

Multilingual Speech Recognition

Download Full-text

A Novel Gesture Recognition System Based on CSI Extracted from a Smartphone with Nexmon Firmware

Sensors ◽

10.3390/s21010222 ◽

2020 ◽

Vol 21 (1) ◽

pp. 222

Author(s):

Tao Li ◽

Chenqi Shi ◽

Peihao Li ◽

Pengpeng Chen

Keyword(s):

Frequency Domain ◽

Gesture Recognition ◽

Channel State Information ◽

Cross Correlation ◽

Recognition Accuracy ◽

Correlation Method ◽

Bottom Layer ◽

Recognition System ◽

Channel State ◽

State Information

In this paper, we propose a novel gesture recognition system based on a smartphone. Due to the limitation of Channel State Information (CSI) extraction equipment, existing WiFi-based gesture recognition is limited to the microcomputer terminal equipped with Intel 5300 or Atheros 9580 network cards. Therefore, accurate gesture recognition can only be performed in an area relatively fixed to the transceiver link. The new gesture recognition system proposed by us breaks this limitation. First, we use nexmon firmware to obtain 256 CSI subcarriers from the bottom layer of the smartphone in IEEE 802.11ac mode on 80 MHz bandwidth to realize the gesture recognition system’s mobility. Second, we adopt the cross-correlation method to integrate the extracted CSI features in the time and frequency domain to reduce the influence of changes in the smartphone location. Third, we use a new improved DTW algorithm to classify and recognize gestures. We implemented vast experiments to verify the system’s recognition accuracy at different distances in different directions and environments. The results show that the system can effectively improve the recognition accuracy.

Download Full-text

Arabic Cursive Text Recognition from Natural Scene Images

Applied Sciences ◽

10.3390/app9020236 ◽

2019 ◽

Vol 9 (2) ◽

pp. 236 ◽

Cited By ~ 6

Author(s):

Saad Ahmed ◽

Saeeda Naz ◽

Muhammad Razzak ◽

Rubiyah Yusof

Keyword(s):

Recognition System ◽

Document Image ◽

Text Recognition ◽

Chinese Script ◽

Challenging Problem ◽

Future Directions ◽

Scene Text ◽

Comprehensive Survey ◽

Recognition Systems ◽

Scene Text Recognition

This paper presents a comprehensive survey on Arabic cursive scene text recognition. The recent years’ publications in this field have witnessed the interest shift of document image analysis researchers from recognition of optical characters to recognition of characters appearing in natural images. Scene text recognition is a challenging problem due to the text having variations in font styles, size, alignment, orientation, reflection, illumination change, blurriness and complex background. Among cursive scripts, Arabic scene text recognition is contemplated as a more challenging problem due to joined writing, same character variations, a large number of ligatures, the number of baselines, etc. Surveys on the Latin and Chinese script-based scene text recognition system can be found, but the Arabic like scene text recognition problem is yet to be addressed in detail. In this manuscript, a description is provided to highlight some of the latest techniques presented for text classification. The presented techniques following a deep learning architecture are equally suitable for the development of Arabic cursive scene text recognition systems. The issues pertaining to text localization and feature extraction are also presented. Moreover, this article emphasizes the importance of having benchmark cursive scene text dataset. Based on the discussion, future directions are outlined, some of which may provide insight about cursive scene text to researchers.

Download Full-text

Finger-Vein Recognition Using Heterogeneous Databases by Domain Adaption Based on a Cycle-Consistent Adversarial Network

Sensors ◽

10.3390/s21020524 ◽

2021 ◽

Vol 21 (2) ◽

pp. 524

Author(s):

Kyoung Jun Noh ◽

Jiho Choi ◽

Jin Seong Hong ◽

Kang Ryoung Park

Keyword(s):

Domain Adaptation ◽

Recognition Accuracy ◽

Recognition System ◽

Heterogeneous Databases ◽

Second Best ◽

Finger Vein ◽

Adversarial Network ◽

Image Characteristics ◽

Vein Recognition ◽

Finger Vein Recognition

The conventional finger-vein recognition system is trained using one type of database and entails the serious problem of performance degradation when tested with different types of databases. This degradation is caused by changes in image characteristics due to variable factors such as position of camera, finger, and lighting. Therefore, each database has varying characteristics despite the same finger-vein modality. However, previous researches on improving the recognition accuracy of unobserved or heterogeneous databases is lacking. To overcome this problem, we propose a method to improve the finger-vein recognition accuracy using domain adaptation between heterogeneous databases using cycle-consistent adversarial networks (CycleGAN), which enhances the recognition accuracy of unobserved data. The experiments were performed with two open databases—Shandong University homologous multi-modal traits finger-vein database (SDUMLA-HMT-DB) and Hong Kong Polytech University finger-image database (HKPolyU-DB). They showed that the equal error rate (EER) of finger-vein recognition was 0.85% in case of training with SDUMLA-HMT-DB and testing with HKPolyU-DB, which had an improvement of 33.1% compared to the second best method. The EER was 3.4% in case of training with HKPolyU-DB and testing with SDUMLA-HMT-DB, which also had an improvement of 4.8% compared to the second best method.

Download Full-text

Automatic Receipt Recognition System Based on Artificial Intelligence Technology

Applied Sciences ◽

10.3390/app12020853 ◽

2022 ◽

Vol 12 (2) ◽

pp. 853

Author(s):

Cheng-Jian Lin ◽

Yu-Cheng Liu ◽

Chin-Ling Lee

Keyword(s):

Character Recognition ◽

Template Matching ◽

Recognition Accuracy ◽

Recognition System ◽

Character Segmentation ◽

Small Object ◽

Labor Costs ◽

Accuracy Rate ◽

Artificial Intelligence Technology ◽

S Model

In this study, an automatic receipt recognition system (ARRS) is developed. First, a receipt is scanned for conversion into a high-resolution image. Receipt characters are automatically placed into two categories according to the receipt characteristics: printed and handwritten characters. Images of receipts with these characters are preprocessed separately. For handwritten characters, template matching and the fixed features of the receipts are used for text positioning, and projection is applied for character segmentation. Finally, a convolutional neural network is used for character recognition. For printed characters, a modified You Only Look Once (version 4) model (YOLOv4-s) executes precise text positioning and character recognition. The proposed YOLOv4-s model reduces downsampling, thereby enhancing small-object recognition. Finally, the system produces recognition results in a tax declaration format, which can upload to a tax declaration system. Experimental results revealed that the recognition accuracy of the proposed system was 80.93% for handwritten characters. Moreover, the YOLOv4-s model had a 99.39% accuracy rate for printed characters; only 33 characters were misjudged. The recognition accuracy of the YOLOv4-s model was higher than that of the traditional YOLOv4 model by 20.57%. Therefore, the proposed ARRS can considerably improve the efficiency of tax declaration, reduce labor costs, and simplify operating procedures.

Download Full-text

User-independent accelerometer-based gesture recognition for mobile devices

ADCAIJ ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL ◽

10.14201/adcaij20121311125 ◽

2013 ◽

Vol 1 (3) ◽

pp. 11-25 ◽

Cited By ~ 2

Author(s):

Xian Wang ◽

Paula Tarrío ◽

Ana María Bernardos ◽

Eduardo Metola ◽

José Ramón Casar

Keyword(s):

Computational Complexity ◽

Mobile Devices ◽

Gesture Recognition ◽

Inertial Sensors ◽

Recognition Accuracy ◽

State Of The Art ◽

Smart Phone ◽

Recognition System ◽

Independent Manner ◽

Wheeled Robot

Many mobile devices embed nowadays inertial sensors. This enables new forms of human-computer interaction through the use of gestures (movements performed with the mobile device) as a way of communication. This paper presents an accelerometer-based gesture recognition system for mobile devices which is able to recognize a collection of 10 different hand gestures. The system was conceived to be light and to operate in a user-independent manner in real time. The recognition system was implemented in a smart phone and evaluated through a collection of user tests, which showed a recognition accuracy similar to other state-of-the art techniques and a lower computational complexity. The system was also used to build a human-robot interface that enables controlling a wheeled robot with the gestures made with the mobile phone

Download Full-text

A Baybayin word recognition system

PeerJ Computer Science ◽

10.7717/peerj-cs.596 ◽

2021 ◽

Vol 7 ◽

pp. e596

Author(s):

Rodney Pino ◽

Renier Mendoza ◽

Rachelle Sambayan

Keyword(s):

Basic Education ◽

Recognition Accuracy ◽

Recognition System ◽

The Philippines ◽

Support Vector ◽

Writing System ◽

Review Of The Literature ◽

Word Level ◽

Latin Script ◽

Word Images

Baybayin is a pre-Hispanic Philippine writing system used in Luzon island. With the effort in reintroducing the script, in 2018, the Committee on Basic Education and Culture of the Philippine Congress approved House Bill 1022 or the ”National Writing System Act,” which declares the Baybayin script as the Philippines’ national writing system. Since then, Baybayin OCR has become a field of research interest. Numerous works have proposed different techniques in recognizing Baybayin scripts. However, all those studies anchored on the classification and recognition at the character level. In this work, we propose an algorithm that provides the Latin transliteration of a Baybayin word in an image. The proposed system relies on a Baybayin character classifier generated using the Support Vector Machine (SVM). The method involves isolation of each Baybayin character, then classifying each character according to its equivalent syllable in Latin script, and finally concatenate each result to form the transliterated word. The system was tested using a novel dataset of Baybayin word images and achieved a competitive 97.9% recognition accuracy. Based on our review of the literature, this is the first work that recognizes Baybayin scripts at the word level. The proposed system can be used in automated transliterations of Baybayin texts transcribed in old books, tattoos, signage, graphic designs, and documents, among others.

Download Full-text

A Performance Prediction Method Based on Sliding Window Grey Neural Network for Inertial Platform

Remote Sensing ◽

10.3390/rs13234864 ◽

2021 ◽

Vol 13 (23) ◽

pp. 4864

Author(s):

Langfu Cui ◽

Qingzhen Zhang ◽

Liman Yang ◽

Chenggang Bai

Keyword(s):

Neural Network ◽

Prediction Models ◽

Prediction Method ◽

Sliding Window ◽

Small Sample ◽

Grey Theory ◽

Sensing System ◽

Performance Change ◽

Sample Data ◽

Platform System

An inertial platform is the key component of a remote sensing system. During service, the performance of the inertial platform appears in degradation and accuracy reduction. For better maintenance, the inertial platform system is checked and maintained regularly. The performance change of an inertial platform can be evaluated by detection data. Due to limitations of detection conditions, inertial platform detection data belongs to small sample data. In this paper, in order to predict the performance of an inertial platform, a prediction model for an inertial platform is designed combining a sliding window, grey theory and neural network (SGMNN). The experiments results show that the SGMNN model performs best in predicting the inertial platform drift rate compared with other prediction models.

Download Full-text