Efficient Caoshu Character Recognition Scheme and Service Using CNN-Based Recognition Model Optimization

Boseon Hong; Bongjae Kim

doi:10.3390/s20164641

Efficient Caoshu Character Recognition Scheme and Service Using CNN-Based Recognition Model Optimization

Sensors ◽

10.3390/s20164641 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4641

Author(s):

Boseon Hong ◽

Bongjae Kim

Keyword(s):

Performance Optimization ◽

Character Recognition ◽

Data Augmentation ◽

Recognition Performance ◽

Android Application ◽

Recognition Model ◽

Intelligence Models ◽

Augmentation Techniques ◽

Artificial Intelligence Models ◽

Optimized Model

Deep learning-based artificial intelligence models are widely used in various computing fields. Especially, Convolutional Neural Network (CNN) models perform very well for image recognition and classification. In this paper, we propose an optimized CNN-based recognition model to recognize Caoshu characters. In the proposed scheme, an image pre-processing and data augmentation techniques for our Caoshu dataset were applied to optimize and enhance the CNN-based Caoshu character recognition model’s recognition performance. In the performance evaluation, Caoshu character recognition performance was compared and analyzed according to the proposed performance optimization. Based on the model validation results, the recognition accuracy was up to about 98.0% in the case of TOP-1. Based on the testing results of the optimized model, the accuracy, precision, recall, and F1 score are 88.12%, 81.84%, 84.20%, and 83.0%, respectively. Finally, we have designed and implemented a Caoshu recognition service as an Android application based on the optimized CNN based Cahosu recognition model. We have verified that the Caoshu recognition service could be performed in real-time.

Download Full-text

Convective Heat Transfer Characteristics of Low Concentration CuO-Water Nanofluids in the Turbulent Flow Regime Based on an Artificial Intelligence Models

Proceedings of the 15th International Heat Transfer Conference ◽

10.1615/ihtc15.fcv.008461 ◽

2014 ◽

Author(s):

Mehdi Mehrabi ◽

Mohsen Sharifpur ◽

Josua P. Meyer

Keyword(s):

Heat Transfer ◽

Artificial Intelligence ◽

Turbulent Flow ◽

Convective Heat Transfer ◽

Flow Regime ◽

Convective Heat ◽

Heat Transfer Characteristics ◽

Turbulent Flow Regime ◽

Intelligence Models ◽

Artificial Intelligence Models

Download Full-text

A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/3/3 ◽

2020 ◽

Vol 17 (3) ◽

pp. 299-305 ◽

Cited By ~ 1

Author(s):

Riaz Ahmad ◽

Saeeda Naz ◽

Muhammad Afzal ◽

Sheikh Rashid ◽

Marcus Liwicki ◽

...

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Data Augmentation ◽

Short Term Memory ◽

Recognition System ◽

Learning Approach ◽

Arabic Text ◽

Data Set ◽

Processing Step ◽

Handwritten Arabic

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Download Full-text

The Data-augmentation Techniques in Item Response Modeling: Current Approaches and New Developments

Advances in Psychological Science ◽

10.3724/sp.j.1042.2014.01036 ◽

2014 ◽

Vol 22 (6) ◽

pp. 1036

Author(s):

Wei TIAN ◽

Tao XIN ◽

Chunhua KANG

Keyword(s):

Item Response ◽

Data Augmentation ◽

New Developments ◽

Item Response Modeling ◽

Augmentation Techniques ◽

Response Modeling

Download Full-text

Automatic Labeled Dialogue Generation for Nursing Record Systems

Journal of Personalized Medicine ◽

10.3390/jpm10030062 ◽

2020 ◽

Vol 10 (3) ◽

pp. 62

Author(s):

Tittaya Mairittha ◽

Nattaya Mairittha ◽

Sozo Inoue

Keyword(s):

Data Augmentation ◽

Short Term Memory ◽

Generative Models ◽

Abstract Knowledge ◽

Augmentation Techniques ◽

Nursing Record ◽

Long Short Term Memory ◽

The Individual ◽

High Level ◽

Embedding Methods

The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.

Download Full-text

Generative Adversarial Networks to Improve the Robustness of Visual Defect Segmentation by Semantic Networks in Manufacturing Components

Applied Sciences ◽

10.3390/app11146368 ◽

2021 ◽

Vol 11 (14) ◽

pp. 6368

Author(s):

Fátima A. Saiz ◽

Garazi Alfaro ◽

Iñigo Barandiaran ◽

Manuel Graña

Keyword(s):

Ad Hoc ◽

Data Augmentation ◽

Semantic Network ◽

Semantic Networks ◽

Stereo Image ◽

Generative Adversarial Networks ◽

Specific Class ◽

Adversarial Networks ◽

Augmentation Techniques ◽

Image Acquisition System

This paper describes the application of Semantic Networks for the detection of defects in images of metallic manufactured components in a situation where the number of available samples of defects is small, which is rather common in real practical environments. In order to overcome this shortage of data, the common approach is to use conventional data augmentation techniques. We resort to Generative Adversarial Networks (GANs) that have shown the capability to generate highly convincing samples of a specific class as a result of a game between a discriminator and a generator module. Here, we apply the GANs to generate samples of images of metallic manufactured components with specific defects, in order to improve training of Semantic Networks (specifically DeepLabV3+ and Pyramid Attention Network (PAN) networks) carrying out the defect detection and segmentation. Our process carries out the generation of defect images using the StyleGAN2 with the DiffAugment method, followed by a conventional data augmentation over the entire enriched dataset, achieving a large balanced dataset that allows robust training of the Semantic Network. We demonstrate the approach on a private dataset generated for an industrial client, where images are captured by an ad-hoc photometric-stereo image acquisition system, and a public dataset, the Northeastern University surface defect database (NEU). The proposed approach achieves an improvement of 7% and 6% in an intersection over union (IoU) measure of detection performance on each dataset over the conventional data augmentation.

Download Full-text

Data Augmentation Techniques on Arabic Data for Named Entity Recognition

Procedia Computer Science ◽

10.1016/j.procs.2021.05.092 ◽

2021 ◽

Vol 189 ◽

pp. 292-299

Author(s):

Caroline Sabty ◽

Islam Omar ◽

Fady Wasfalla ◽

Mohamed Islam ◽

Slim Abdennadher

Keyword(s):

Data Augmentation ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Augmentation Techniques

Download Full-text

Stress testing reveals gaps in clinic readiness of image-based diagnostic artificial intelligence models

npj Digital Medicine ◽

10.1038/s41746-020-00380-6 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Albert T. Young ◽

Kristen Fernandez ◽

Jacob Pfau ◽

Rasika Reddy ◽

Nhat Anh Cao ◽

...

Keyword(s):

Artificial Intelligence ◽

Real World ◽

Stress Testing ◽

Operating Characteristic ◽

Characteristic Curve ◽

Skin Lesions ◽

Stress Tests ◽

Performance Metric ◽

Intelligence Models ◽

Artificial Intelligence Models

AbstractArtificial intelligence models match or exceed dermatologists in melanoma image classification. Less is known about their robustness against real-world variations, and clinicians may incorrectly assume that a model with an acceptable area under the receiver operating characteristic curve or related performance metric is ready for clinical use. Here, we systematically assessed the performance of dermatologist-level convolutional neural networks (CNNs) on real-world non-curated images by applying computational “stress tests”. Our goal was to create a proxy environment in which to comprehensively test the generalizability of off-the-shelf CNNs developed without training or evaluation protocols specific to individual clinics. We found inconsistent predictions on images captured repeatedly in the same setting or subjected to simple transformations (e.g., rotation). Such transformations resulted in false positive or negative predictions for 6.5–22% of skin lesions across test datasets. Our findings indicate that models meeting conventionally reported metrics need further validation with computational stress tests to assess clinic readiness.

Download Full-text

Artificial intelligence models derived from 2D transperineal ultrasound images in the clinical diagnosis of stress urinary incontinence

International Urogynecology Journal ◽

10.1007/s00192-021-04859-y ◽

2021 ◽

Author(s):

Man Zhang ◽

Xin Lin ◽

Zhijuan Zheng ◽

Ying Chen ◽

Yong Ren ◽

...

Keyword(s):

Artificial Intelligence ◽

Urinary Incontinence ◽

Stress Urinary Incontinence ◽

Clinical Diagnosis ◽

Transperineal Ultrasound ◽

Ultrasound Images ◽

Intelligence Models ◽

Artificial Intelligence Models

Download Full-text

Proposition of new computer artificial intelligence models for shear strength prediction of reinforced concrete beams

Engineering With Computers ◽

10.1007/s00366-021-01400-z ◽

2021 ◽

Author(s):

Hayder Riyadh Mohammed Mohammed ◽

Sumarni Ismail

Keyword(s):

Artificial Intelligence ◽

Shear Strength ◽

Reinforced Concrete ◽

Concrete Beams ◽

Reinforced Concrete Beams ◽

Strength Prediction ◽

Intelligence Models ◽

Artificial Intelligence Models

Download Full-text

Neural Data Augmentation Techniques for Time Series Data and its Benefits

2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) ◽

10.1109/icmla51294.2020.00026 ◽

2020 ◽

Author(s):

Anindya Sarkar ◽

Anirudh Sunder Raj ◽

Raghu Sesha Iyengar

Keyword(s):

Time Series ◽

Data Augmentation ◽

Time Series Data ◽

Series Data ◽

Neural Data ◽

Augmentation Techniques

Download Full-text