Novel Deep Convolutional Neural Network-Based Contextual Recognition of Arabic Handwritten Scripts

Rami Ahmed; Mandar Gogate; Ahsen Tahir; Kia Dashtipour; Bassam Al-tamimi; Ahmad Hawalah; Mohammed A. El-Affendi; Amir Hussain

doi:10.3390/e23030340

Novel Deep Convolutional Neural Network-Based Contextual Recognition of Arabic Handwritten Scripts

Entropy ◽

10.3390/e23030340 ◽

2021 ◽

Vol 23 (3) ◽

pp. 340

Author(s):

Rami Ahmed ◽

Mandar Gogate ◽

Ahsen Tahir ◽

Kia Dashtipour ◽

Bassam Al-tamimi ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Deep Neural Networks ◽

State Of The Art ◽

Handwriting Recognition ◽

Arabic Text ◽

Context Aware ◽

Proposed Model ◽

Regularization Parameters ◽

Deep Cnn

Offline Arabic Handwriting Recognition (OAHR) has recently become instrumental in the areas of pattern recognition and image processing due to its application in several fields, such as office automation and document processing. However, OAHR continues to face several challenges, including high variability of the Arabic script and its intrinsic characteristics such as cursiveness, ligatures, and diacritics, the unlimited variation in human handwriting, and the lack of large public databases. In this paper, we introduce a novel context-aware model based on deep neural networks to address the challenges of recognizing offline handwritten Arabic text, including isolated digits, characters, and words. Specifically, we propose a supervised Convolutional Neural Network (CNN) model that contextually extracts optimal features and employs batch normalization and dropout regularization parameters. This aims to prevent overfitting and further enhance generalization performance when compared to conventional deep learning models. We employ a number of deep stacked-convolutional layers to design the proposed Deep CNN (DCNN) architecture. The model is extensively evaluated and shown to demonstrate excellent classification accuracy when compared to conventional OAHR approaches on a diverse set of six benchmark databases, including MADBase (Digits), CMATERDB (Digits), HACDB (Characters), SUST-ALT (Digits), SUST-ALT (Characters), and SUST-ALT (Names). A further experimental study is conducted on the benchmark Arabic databases by exploiting transfer learning (TL)-based feature extraction which demonstrates the superiority of our proposed model in relation to state-of-the-art VGGNet-19 and MobileNet pre-trained models. Finally, experiments are conducted to assess comparative generalization capabilities of the models using another language database , specifically the benchmark MNIST English isolated Digits database, which further confirm the superiority of our proposed DCNN model.

Download Full-text

Identifying Ethnics of People through Face Recognition: A Deep CNN Approach

Scientific Programming ◽

10.1155/2020/6385281 ◽

2020 ◽

Vol 2020 ◽

pp. 1-7

Author(s):

Ahmed Jawad A. AlBdairi ◽

Zhu Xiao ◽

Mohammed Alghaili

Keyword(s):

Neural Network ◽

Deep Learning ◽

Face Recognition ◽

Convolutional Neural Network ◽

State Of The Art ◽

Research Community ◽

Facial Features ◽

New Model ◽

Image Dataset ◽

Deep Cnn

The interest in face recognition studies has grown rapidly in the last decade. One of the most important problems in face recognition is the identification of ethnics of people. In this study, a new deep learning convolutional neural network is designed to create a new model that can recognize the ethnics of people through their facial features. The new dataset for ethnics of people consists of 3141 images collected from three different nationalities. To the best of our knowledge, this is the first image dataset collected for the ethnics of people and that dataset will be available for the research community. The new model was compared with two state-of-the-art models, VGG and Inception V3, and the validation accuracy was calculated for each convolutional neural network. The generated models have been tested through several images of people, and the results show that the best performance was achieved by our model with a verification accuracy of 96.9%.

Download Full-text

Identification of Tomato Leaf Disease Detection using Pretrained Deep Convolutional Neural Network Models

Scalable Computing Practice and Experience ◽

10.12694/scpe.v21i4.1780 ◽

2020 ◽

Vol 21 (4) ◽

pp. 625-635

Author(s):

Anandhakrishnan T ◽

Jaisakthi S.M Murugaiyan

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Early Stage ◽

Computing Time ◽

Network Models ◽

Deep Convolutional Neural Network ◽

Neural Network Models ◽

Leaf Disease ◽

Proposed Model ◽

Deep Cnn

In this paper, we proposed a plant leaf disease identification model based on a Pretrained deep convolutional neural network (Deep CNN). The Deep CNN model is trained using an open dataset with 10 different classes of tomato leaves We observed that overall architectures which can increase the best performance of the model. The proposed model was trained using different training epochs, batch sizes and dropouts. The Xception has attained maximum accuracy compare with all other approaches. After an extensive simulation, the proposed model achieves classification accuracy better. This accuracy of the proposed work is greater than the accuracy of all other Pretrained approaches. The proposed model is also tested with respect to its consistency and reliability. The set of data used for this work was collected from the plant village dataset, including sick and healthy images. Models for detection of plant disease should predict the disease quickly and accurately in the early stage itself so that a proper precautionary measures can be applied to avoid further spread of the diseases. So, to reduce the main issue about the leaf diseases, we can analyze distinct kinds of deep neural network architectures in this research. From the outcomes, Xception has a constantly improving more to enhance the accuracy by increasing the number of epochs, without any indications of overfitting and decreasein quality. And Xception also generated a fine 99.45% precision in less computing time.

Download Full-text

A modified Bayesian Convolutional Neural Network for Breast Histopathology Image Classification and Uncertainty Quantification

10.31224/osf.io/5xf8c ◽

2020 ◽

Author(s):

Pushkar Khairnar ◽

Ponkrshnan Thiagarajan ◽

Susanta Ghosh

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Uncertainty Quantification ◽

State Of The Art ◽

False Negative ◽

Activation Function ◽

Ensemble Averaging ◽

Proposed Model ◽

Improve Accuracy ◽

Histopathological Images

Convolutional neural network (CNN) based classification models have been successfully used on histopathological images for the detection of diseases. Despite its success, CNN may yield erroneous or overfitted results when the data is not sufficiently large or is biased. To overcome these limitations of CNN and to provide uncertainty quantification Bayesian CNN is recently proposed. However, we show that Bayesian-CNN still suffers from inaccuracies, especially in negative predictions. In the present work, we extend the Bayesian-CNN to improve accuracy and the rate of convergence. The proposed model is called modified Bayesian-CNN. The novelty of the proposed model lies in an adaptive activation function that contains a learnable parameter for each of the neurons. This adaptive activation function dynamically changes the loss function thereby providing faster convergence and better accuracy. The uncertainties associated with the predictions are obtained since the model learns a probability distribution on the network parameters. It reduces overfitting through an ensemble averaging over networks, which in turn improves accuracy on the unknown data. The proposed model demonstrates significant improvement by nearly eliminating overfitting and remarkably reducing (about 38%) the number of false-negative predictions. We found that the proposed model predicts higher uncertainty for images having features of both the classes. The uncertainty in the predictions of individual images can be used to decide when further human-expert intervention is needed. These findings have the potential to advance the state-of-the-art machine learning-based automatic classification for histopathological images.

Download Full-text

ThriftyNets: Convolutional Neural Networks with Tiny Parameter Budget

IoT ◽

10.3390/iot2020012 ◽

2021 ◽

Vol 2 (2) ◽

pp. 222-235

Author(s):

Guillaume Coiffier ◽

Ghouthi Boukli Hacene ◽

Vincent Gripon

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Convolutional Neural Network ◽

Spatial Resolution ◽

Network Architecture ◽

Deep Neural Networks ◽

State Of The Art ◽

Feature Maps ◽

Neural Network Architecture

Deep Neural Networks are state-of-the-art in a large number of challenges in machine learning. However, to reach the best performance they require a huge pool of parameters. Indeed, typical deep convolutional architectures present an increasing number of feature maps as we go deeper in the network, whereas spatial resolution of inputs is decreased through downsampling operations. This means that most of the parameters lay in the final layers, while a large portion of the computations are performed by a small fraction of the total parameters in the first layers. In an effort to use every parameter of a network at its maximum, we propose a new convolutional neural network architecture, called ThriftyNet. In ThriftyNet, only one convolutional layer is defined and used recursively, leading to a maximal parameter factorization. In complement, normalization, non-linearities, downsamplings and shortcut ensure sufficient expressivity of the model. ThriftyNet achieves competitive performance on a tiny parameters budget, exceeding 91% accuracy on CIFAR-10 with less than 40 k parameters in total, 74.3% on CIFAR-100 with less than 600 k parameters, and 67.1% On ImageNet ILSVRC 2012 with no more than 4.15 M parameters. However, the proposed method typically requires more computations than existing counterparts.

Download Full-text

An Optimized Approach for Intra-Class Fruit Classification Using Deep Convolutional Neural Network

International Journal of Image and Graphics ◽

10.1142/s0219467821400143 ◽

2021 ◽

pp. 2140014

Author(s):

Rishipal Singh ◽

Rajneesh Rani ◽

Aman Kamboj

Keyword(s):

Neural Network ◽

Computer Vision ◽

Convolutional Neural Network ◽

State Of The Art ◽

Deep Convolutional Neural Network ◽

Classification Models ◽

Proposed Model ◽

Traditional Classification

Fruits classification is one of the influential applications of computer vision. Traditional classification models are trained by considering various features such as color, shape, texture, etc. These features are common for different varieties of the same fruit. Therefore, a new set of features is required to classify the fruits belonging to the same class. In this paper, we have proposed an optimized method to classify intra-class fruits using deep convolutional layers. The proposed architecture is capable of solving the challenges of a commercial tray-based system in the supermarket. As the research in intra-class classification is still in its infancy, there are challenges that have not been tackled. So, the proposed method is specifically designed to overcome the challenges related to intra-class fruits classification. The proposed method showcases an impressive performance for intra-class classification, which is achieved using a few parameters than the existing methods. The proposed model consists of Inception block, Residual connections and various other layers in very precise order. To validate its performance, the proposed method is compared with state-of-the-art models and performs best in terms of accuracy, loss, parameters, and depth.

Download Full-text

A deep fully residual convolutional neural network for segmentation in EM images

International Journal of Wavelets Multiresolution and Information Processing ◽

10.1142/s0219691320500071 ◽

2020 ◽

Vol 18 (03) ◽

pp. 2050007

Author(s):

Juanjuan He ◽

Song Xiang ◽

Ziqi Zhu

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Medical Image ◽

Spatial Information ◽

State Of The Art ◽

Ct Images ◽

Medical Image Segmentation ◽

Post Process ◽

Proposed Model

In standard U-net, researchers only use long skip connections to skip features from the encoding path to the decoding path in order to recover spatial information loss during downsampling. However, it would result in gradient vanishing and limit the depth of the network. To address this issue, we propose a novel deep fully residual convolutional neural network that combines the U-net with the ResNet for medical image segmentation. By applying short skip connections, this new extension of U-net decreases the amount of parameters compared to the standard U-net, although the depth of the layer is increased. We evaluate the performance of the proposed model and other state-of-the-art models on the Electron Microscopy (EM) images dataset and the Computed Tomography (CT) images dataset. The result shows that our model achieves competitive accuracy on the EM benchmark without any further post-process. Moreover, the performance of image segmentation on CT images of the lungs is improved in contrast to the standard U-net.

Download Full-text

Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network

Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University ◽

10.1051/jnwpu/20203810162 ◽

2020 ◽

Vol 38 (1) ◽

pp. 162-169

Author(s):

Ke Zhang ◽

Yu Su ◽

Jingyu Wang ◽

Sanyu Wang ◽

Yanhua Zhang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Deep Neural Networks ◽

Recognition System ◽

Sound Recognition ◽

Environmental Sound ◽

Sound Classification ◽

Proposed Model ◽

Auditory Features

At present, the environment sound recognition system mainly identifies environment sounds with deep neural networks and a wide variety of auditory features. Therefore, it is necessary to analyze which auditory features are more suitable for deep neural networks based ESCR systems. In this paper, we chose three sound features which based on two widely used filters:the Mel and Gammatone filter banks. Subsequently, the hybrid feature MGCC is presented. Finally, a deep convolutional neural network is proposed to verify which features are more suitable for environment sound classification and recognition tasks. The experimental results show that the signal processing features are better than the spectrogram features in the deep neural network based environmental sound recognition system. Among all the acoustic features, the MGCC feature achieves the best performance than other features. Finally, the MGCC-CNN model proposed in this paper is compared with the state-of-the-art environmental sound classification models on the UrbanSound 8K dataset. The results show that the proposed model has the best classification accuracy.

Download Full-text

RecPOID: POI Recommendation with Friendship Aware and Deep CNN

Future Internet ◽

10.3390/fi13030079 ◽

2021 ◽

Vol 13 (3) ◽

pp. 79

Author(s):

Sadaf Safavi ◽

Mehrdad Jalali

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

State Of The Art ◽

Spatial Features ◽

Points Of Interest ◽

Poi Recommendation ◽

Time Distance ◽

Deep Cnn ◽

Temporal And Spatial ◽

Location Based Social Networks

In location-based social networks (LBSNs), exploit several key features of points-of-interest (POIs) and users on precise POI recommendation be significant. In this work, a novel POI recommendation pipeline based on the convolutional neural network named RecPOID is proposed, which can recommend an accurate sequence of top-k POIs and considers only the effect of the most similar pattern friendship rather than all user’s friendship. We use the fuzzy c-mean clustering method to find the similarity. Temporal and spatial features of similar friends are fed to our Deep CNN model. The 10-layer convolutional neural network can predict longitude and latitude and the Id of the next proper locations; after that, based on the shortest time distance from a similar pattern’s friendship, select the smallest distance locations. The proposed structure uses six features, including user’s ID, month, day, hour, minute, and second of visiting time by each user as inputs. RecPOID based on two accessible LBSNs datasets is evaluated. Experimental outcomes illustrate considering most similar friendship could improve the accuracy of recommendations and the proposed RecPOID for POI recommendation outperforms state-of-the-art approaches.

Download Full-text

Multi-scale 3D-convolutional neural network for hyperspectral image classification

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v25.i1.pp307-316 ◽

2022 ◽

Vol 25 (1) ◽

pp. 307

Author(s):

Murali Kanthi ◽

Thogarcheti Hitendra Sarma ◽

Chigarapalle Shoba Bindu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Classification Accuracy ◽

Hyperspectral Image ◽

State Of The Art ◽

Spatial Dimension ◽

Multi Scale ◽

Proposed Model ◽

Spectral Channels

Deep Learning methods are state-of-the-art approaches for pixel-based hyperspectral images (HSI) classification. High classification accuracy has been achieved by extracting deep features from both spatial-spectral channels. However, the efficiency of such spatial-spectral approaches depends on the spatial dimension of each patch and there is no theoretically valid approach to find the optimum spatial dimension to be considered. It is more valid to extract spatial features by considering varying neighborhood scales in spatial dimensions. In this regard, this article proposes a deep convolutional neural network (CNN) model wherein three different multi-scale spatial-spectral patches are used to extract the features in both the spatial and spectral channels. In order to extract these potential features, the proposed deep learning architecture takes three patches various scales in spatial dimension. 3D convolution is performed on each selected patch and the process runs through entire image. The proposed is named as multi-scale three-dimensional convolutional neural network (MS-3DCNN). The efficiency of the proposed model is being verified through the experimental studies on three publicly available benchmark datasets including Pavia University, Indian Pines, and Salinas. It is empirically proved that the classification accuracy of the proposed model is improved when compared with the remaining state-of-the-art methods.

Download Full-text

Enhanced context-aware recommendation using topic modeling and particle swarm optimization

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210331 ◽

2021 ◽

pp. 1-16

Author(s):

Ibtissem Gasmi ◽

Mohamed Walid Azizi ◽

Hassina Seridi-Bouchelaghem ◽

Nabiha Azizi ◽

Samir Brahim Belhaouari

Keyword(s):

Topic Modeling ◽

Latent Dirichlet Allocation ◽

State Of The Art ◽

Weighting Function ◽

Contextual Factors ◽

Pearson Correlation ◽

Correlation Coefficients ◽

Pso Algorithm ◽

Context Aware ◽

Proposed Model

Context-Aware Recommender System (CARS) suggests more relevant services by adapting them to the user’s specific context situation. Nevertheless, the use of many contextual factors can increase data sparsity while few context parameters fail to introduce the contextual effects in recommendations. Moreover, several CARSs are based on similarity algorithms, such as cosine and Pearson correlation coefficients. These methods are not very effective in the sparse datasets. This paper presents a context-aware model to integrate contextual factors into prediction process when there are insufficient co-rated items. The proposed algorithm uses Latent Dirichlet Allocation (LDA) to learn the latent interests of users from the textual descriptions of items. Then, it integrates both the explicit contextual factors and their degree of importance in the prediction process by introducing a weighting function. Indeed, the PSO algorithm is employed to learn and optimize weights of these features. The results on the Movielens 1 M dataset show that the proposed model can achieve an F-measure of 45.51% with precision as 68.64%. Furthermore, the enhancement in MAE and RMSE can respectively reach 41.63% and 39.69% compared with the state-of-the-art techniques.

Download Full-text