scholarly journals Practical Recognition System for Text Printed on Clear Reflected Material

2012 ◽  
Vol 2012 ◽  
pp. 1-16 ◽  
Author(s):  
Khader Mohammad ◽  
Sos Agaian

Text embedded in an image contains useful information for applications in the medical, industrial, commercial, and research fields. While many systems have been designed to correctly identify text in images, no work addressing the recognition of degraded text on clear plastic has been found. This paper posits novel methods and an apparatus for extracting text from an image with the practical assumption: (a) poor background contrast, (b) white, curved, and/or differing fonts or character width between sets of images, (c) dotted text printed on curved reflective material, and/or (d) touching characters. Methods were evaluated using a total of 100 unique test images containing a variety of texts captured from water bottles. These tests averaged a processing time of ~10 seconds (using MATLAB R2008A on an HP 8510 W with 4 G of RAM and 2.3 GHz of processor speed), and experimental results yielded an average recognition rate of 90 to 93% using customized systems generated by the proposed development.

Author(s):  
Youssef Ouadid ◽  
Abderrahmane Elbalaoui ◽  
Mehdi Boutaounte ◽  
Mohamed Fakir ◽  
Brahim Minaoui

<p>In this paper, a graph based handwritten Tifinagh character recognition system is presented. In preprocessing Zhang Suen algorithm is enhanced. In features extraction, a novel key point extraction algorithm is presented. Images are then represented by adjacency matrices defining graphs where nodes represent feature points extracted by a novel algorithm. These graphs are classified using a graph matching method. Experimental results are obtained using two databases to test the effectiveness. The system shows good results in terms of recognition rate.</p>


2013 ◽  
Vol 753-755 ◽  
pp. 3064-3067
Author(s):  
Ju Zhong ◽  
Ye Zi Sheng ◽  
Chun Li Lin ◽  
Nai Dong Cui

Double-direction two-dimensional Maximum Scatter Difference (2D2MSD) based on Maximum Scatter Difference (MSD) was proposed,which overcame the small sample size problem of LDA, and data were more concise. In the Weizmann human action database, experimental results showed the algorithm was fast, the average recognition rate reached 92% and the highest recognition rate reached 100%.


2002 ◽  
Vol 14 (01) ◽  
pp. 12-19 ◽  
Author(s):  
DUU-TONG FUH ◽  
CHING-HSING LUO

The standard Morse code defines the tone ratio (dash/dot) and the silent ratio (dash-space/dotspace) as 3:1. Since human typing ratio can't keep this ratio precisely and the two ratios —tone ratio and silent ratio—are not equal, the Morse code can't be recognized automatically. The requirement of the standard ratio is difficult to satisfy even for an ordinary person. As for the unstable Morse code typing pattern, the auto-recognition algorithms in the literature are not good enough in applications. The disabled persons usually have difficulty in maintaining a stable typing speeds and typing ratios, we therefore adopted an Expert-Gating neural network model to implement in single chip and recognize online unstable Morse codes. Also, we used another method—a linear back propagation recalling algorithm, to implement in single chip and recognize unstable Morse codes. From three person tests: Test one is a cerebral palsy; Test two is a beginner: Test three is a skilled expert, we have the results: in the experiment of test one, we have 91.15% (use 6 characters average time series as thresholds) and 91.54% (learning 26 characters) online average recognition rate; test two have 95.77% and 96.15%, and test three have 98.46% and 99.23% respectively. As for linear back propagation recalling method online recognized rate, we have the results from test one: 92.31% online average recognition rate; test two: 96.15%; and test three 99.23% respectively. So, we concluded: The Expert-Gating neural network and the linear back propagation recalling algorithm have successfully overcome the difficulty of analyzing a severely online unstable Morse code time series and successfully implement in single chip to recognize online unstable Morse code.


Author(s):  
Binod Kumar Prasad

Purpose of the study: The purpose of this work is to present an offline Optical Character Recognition system to recognise handwritten English numerals to help automation of document reading. It helps to avoid tedious and time-consuming manual typing to key in important information in a computer system to preserve it for a longer time. Methodology: This work applies Curvature Features of English numeral images by encoding them in terms of distance and slope. The finer local details of images have been extracted by using Zonal features. The feature vectors obtained from the combination of these features have been fed to the KNN classifier. The whole work has been executed using the MatLab Image Processing toolbox. Main Findings: The system produces an average recognition rate of 96.67% with K=1 whereas, with K=3, the rate increased to 97% with corresponding errors of 3.33% and 3% respectively. Out of all the ten numerals, some numerals like ‘3’ and ‘8’ have shown respectively lower recognition rates. It is because of the similarity between their structures. Applications of this study: The proposed work is related to the recognition of English numerals. The model can be used widely for recognition of any pattern like signature verification, face recognition, character or word recognition in another language under Natural Language Processing, etc. Novelty/Originality of this study: The novelty of the work lies in the process of feature extraction. Curves present in the structure of a numeral sample have been encoded based on distance and slope thereby presenting Distance features and Slope features. Vertical Delta Distance Coding (VDDC) and Horizontal Delta Distance Coding (HDDC) encode a curve from vertical and horizontal directions to reveal concavity and convexity from different angles.


2021 ◽  
Vol 2021 ◽  
pp. 1-6
Author(s):  
Xueyan Chen ◽  
Xiaofei Zhong

In order to help pathologists quickly locate the lesion area, improve the diagnostic efficiency, and reduce missed diagnosis, a convolutional neural network algorithm for the optimization of emergency nursing rescue efficiency of critical patients was proposed. Specifically, three convolution layers and convolution kernels of different sizes are used to extract the features of patients’ posture behavior, and the classifier of patients’ posture behavior recognition system is used to learn the feature information by capturing the nonlinear relationship between the features to achieve accurate classification. By testing the accuracy of patient posture behavior feature extraction, the recognition rate of a certain action, and the average recognition rate of all actions in the patient body behavior recognition system, it is proved that the convolution neural network algorithm can greatly improve the efficiency of emergency nursing. The algorithm is applied to the patient posture behavior detection system, so as to realize the identification and monitoring of patients and improve the level of intelligent medical care. Finally, the open source framework platform is used to test the patient behavior detection system. The experimental results show that the larger the test data set is, the higher the accuracy of patient posture behavior feature extraction is, and the average recognition rate of patient posture behavior category is 97.6%, thus verifying the effectiveness and correctness of the system, to prove that the convolutional neural network algorithm has a very large improvement of emergency nursing rescue efficiency.


2014 ◽  
Vol 644-650 ◽  
pp. 4174-4177
Author(s):  
Xue Mei Wang ◽  
Jia Jun Zhang

In order to improve the accuracy of recognition system for fatigue facial expression of driver, driver fatigue expression of this paper, the detection method for key feature points in the fatigue facial expression image of driver is applied in the paper to establish a fatigue expression image recognition model based on attention mechanism. Experimental results show that the algorithm can improve the recognition rate of driver's expression image, so as to record fatigue expression image of driver more accurate.


2013 ◽  
Vol 846-847 ◽  
pp. 1380-1383
Author(s):  
Xian Yi Rui ◽  
Yi Biao Yu ◽  
Ying Jiang

Because of the single-syllable of Chinese words and the confusing nature of Chinese pronunciation, connected mandarin digit speech recognition (CMDSR) is a challenging task in the field of speech recognition. This paper applied a novel acoustic representation of speech, called the acoustic universal structure (AUS) where the non-linguistic variations such as vocal tract length, lines and noises are well removed. A two-layer matching strategy based on the AUS models of speech, including the digit and string AUS models, is proposed for connected mandarin digit speech recognition. The speech recognition system for connected mandarin digits is described in detail, and the experimental results show that the proposed method can obtain the higher recognition rate.


2013 ◽  
Vol 284-287 ◽  
pp. 2950-2954
Author(s):  
Ching Tang Hsieh ◽  
Chia Shing Hu ◽  
Meng Shian Shih

Conventional 2D face recognition methods often struggle when a subject's head is turned even slightly to the side. In this study, a face recognition system based on 3D head modeling that is able to tolerate facial rotation angles was constructed by leveraging the Open source graphic library (OpenGL) framework. To minimize the extensive angle searching time that often occurs in conventional 3D modeling, Particle Swarm Optimization (PSO) was used to determine the correct facial angle in 3D. This reduced the angle computation time to 6 seconds, which is significantly faster than other methods. Experimental results showed that successful ID recognition can be achieved with a high recognition rate of 90%.


2021 ◽  
Vol 2083 (4) ◽  
pp. 042049
Author(s):  
Lei Li ◽  
Fenggang Liu

Abstract This paper proposes an efficient and accurate method of two-dimensional code recognition for industrial actual projects, and develops a high-speed and batch two-dimensional code recognition system based on machine vision. Firstly, according to the position of the QR code in the target subspace, a method to locate the region of interest of each QR code by using geometric relationship and batch processing QR code is proposed. On this basis, Gaussian noise is added to simulate the possible noise in production practice, and the anti-noise ability of the system is evaluated. Finally, the relationship between system recognition rate and QR code movement speed is analyzed and the experimental results are compared. The experimental results show that the system can meet the requirement of real-time online detection.


Generally, pattern recognition considered a strong challenge in many information processing research fields. The aim of this paper is to propose a highly accurate model for recognizing a handwritten English numeral through efficiently extracting the most valuable features of a certain handwritten numeral or digit. The handwritten English Numerals Recognition Model (HENRM) is proposed in this paper. The features extraction of the proposal based on combining both statistical and structural features of the certain numeral sample image. Mainly, the proposed HENCM has four phases which are image acquisition, image preprocessing, features extraction, and classification. In fact, four feature extraction approaches are utilized in this paper, which are the number of intersection points, the number of open-end points, calculation of density feature, and determining the chain code for each of the English numerals. The latter phase gives a features vector of 26-element size to be fed into the classifier that uses the Multi-class Support Vector Machine (MSVM) for the classification process. The experimental results showed that the proposed HENCM exhibits an average recognition rate equals to 97%. Index Terms—Chain Code, Density feature, MSVM, Recognition.


Sign in / Sign up

Export Citation Format

Share Document