Metallographic image segmentation of GCr15 bearing steel based on CGAN

Yuanyuan Chen; Wuyin Jin; Meng Wang

doi:10.3233/jae-209441

Metallographic image segmentation of GCr15 bearing steel based on CGAN

International Journal of Applied Electromagnetics and Mechanics ◽

10.3233/jae-209441 ◽

2020 ◽

Vol 64 (1-4) ◽

pp. 1237-1243 ◽

Cited By ~ 1

Author(s):

Yuanyuan Chen ◽

Wuyin Jin ◽

Meng Wang

Keyword(s):

Deep Learning ◽

Carbide Particle ◽

State Of The Art ◽

Bearing Steel ◽

Segmentation Method ◽

Low Contrast ◽

Gcr15 Bearing Steel ◽

Proposed Model ◽

Metallographic Images ◽

Particle Segmentation

A novel deep learning segmentation method based on Conditional Generative Adversarial Nets (CGAN) is proposed, being U-GAN in this paper to overtake shortcomings of the metallographic images of GCr15 bearing steel, such as multi-noise, low contrast and difficult to segment. The results of experiment indicate that the proposed model is the most accurate comparing with the digital image processing methods and deep learning methods on carbide particle segmentation. The average Dice’s coefficient of similarity measure function is 0.9158, which is the state-of-the-art performance on dataset.

Download Full-text

Learning to Combine Local and Global Image Information for Contactless Palmprint Recognition

Sensors ◽

10.3390/s22010073 ◽

2021 ◽

Vol 22 (1) ◽

pp. 73

Author(s):

Marjan Stoimchev ◽

Marija Ivanovska ◽

Vitomir Štruc

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Input Image ◽

Palmprint Recognition ◽

Learning Approaches ◽

Elastic Deformations ◽

Feature Representations ◽

Palmar Surface ◽

Proposed Model ◽

Visual Artifacts

In the past few years, there has been a leap from traditional palmprint recognition methodologies, which use handcrafted features, to deep-learning approaches that are able to automatically learn feature representations from the input data. However, the information that is extracted from such deep-learning models typically corresponds to the global image appearance, where only the most discriminative cues from the input image are considered. This characteristic is especially problematic when data is acquired in unconstrained settings, as in the case of contactless palmprint recognition systems, where visual artifacts caused by elastic deformations of the palmar surface are typically present in spatially local parts of the captured images. In this study we address the problem of elastic deformations by introducing a new approach to contactless palmprint recognition based on a novel CNN model, designed as a two-path architecture, where one path processes the input in a holistic manner, while the second path extracts local information from smaller image patches sampled from the input image. As elastic deformations can be assumed to most significantly affect the global appearance, while having a lesser impact on spatially local image areas, the local processing path addresses the issues related to elastic deformations thereby supplementing the information from the global processing path. The model is trained with a learning objective that combines the Additive Angular Margin (ArcFace) Loss and the well-known center loss. By using the proposed model design, the discriminative power of the learned image representation is significantly enhanced compared to standard holistic models, which, as we show in the experimental section, leads to state-of-the-art performance for contactless palmprint recognition. Our approach is tested on two publicly available contactless palmprint datasets—namely, IITD and CASIA—and is demonstrated to perform favorably against state-of-the-art methods from the literature. The source code for the proposed model is made publicly available.

Download Full-text

A Tweet Sentiment Classification Approach Using a Hybrid Stacked Ensemble Technique

Information ◽

10.3390/info12090374 ◽

2021 ◽

Vol 12 (9) ◽

pp. 374

Author(s):

Babacar Gaye ◽

Dezheng Zhang ◽

Aziguli Wulamu

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Deep Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Short Term Memory ◽

State Of The Art ◽

Accuracy Score ◽

Learning Models ◽

Proposed Model

With the extensive availability of social media platforms, Twitter has become a significant tool for the acquisition of peoples’ views, opinions, attitudes, and emotions towards certain entities. Within this frame of reference, sentiment analysis of tweets has become one of the most fascinating research areas in the field of natural language processing. A variety of techniques have been devised for sentiment analysis, but there is still room for improvement where the accuracy and efficacy of the system are concerned. This study proposes a novel approach that exploits the advantages of the lexical dictionary, machine learning, and deep learning classifiers. We classified the tweets based on the sentiments extracted by TextBlob using a stacked ensemble of three long short-term memory (LSTM) as base classifiers and logistic regression (LR) as a meta classifier. The proposed model proved to be effective and time-saving since it does not require feature extraction, as LSTM extracts features without any human intervention. We also compared our proposed approach with conventional machine learning models such as logistic regression, AdaBoost, and random forest. We also included state-of-the-art deep learning models in comparison with the proposed model. Experiments were conducted on the sentiment140 dataset and were evaluated in terms of accuracy, precision, recall, and F1 Score. Empirical results showed that our proposed approach manifested state-of-the-art results by achieving an accuracy score of 99%.

Download Full-text

Segmentation of Overlapping Cervical Cells with Mask Region Convolutional Neural Network

Computational and Mathematical Methods in Medicine ◽

10.1155/2021/3890988 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Jiajia Chen ◽

Baocan Zhang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

State Of The Art ◽

Cytological Analysis ◽

Segmentation Method ◽

Challenging Tasks ◽

Diagnostic Technology ◽

Cervical Cells ◽

Bounding Boxes

The task of segmenting cytoplasm in cytology images is one of the most challenging tasks in cervix cytological analysis due to the presence of fuzzy and highly overlapping cells. Deep learning-based diagnostic technology has proven to be effective in segmenting complex medical images. We present a two-stage framework based on Mask RCNN to automatically segment overlapping cells. In stage one, candidate cytoplasm bounding boxes are proposed. In stage two, pixel-to-pixel alignment is used to refine the boundary and category classification is also presented. The performance of the proposed method is evaluated on publicly available datasets from ISBI 2014 and 2015. The experimental results demonstrate that our method outperforms other state-of-the-art approaches with DSC 0.92 and FPRp 0.0008 at the DSC threshold of 0.8. Those results indicate that our Mask RCNN-based segmentation method could be effective in cytological analysis.

Download Full-text

A Deep Learning Framework to Predict Routability for FPGA Circuit Placement

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3465373 ◽

2021 ◽

Vol 14 (3) ◽

pp. 1-28

Author(s):

Abeer Al-Hyari ◽

Hannah Szentimrey ◽

Ahmed Shamli ◽

Timothy Martin ◽

Gary Gréwal ◽

...

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Parameter Tuning ◽

Learning Framework ◽

Proposed Model ◽

Field Programmable ◽

Circuit Placement ◽

Detailed Placement ◽

Place And Route ◽

Deep Learning Model

The ability to accurately and efficiently estimate the routability of a circuit based on its placement is one of the most challenging and difficult tasks in the Field Programmable Gate Array (FPGA) flow. In this article, we present a novel, deep learning framework based on a Convolutional Neural Network (CNN) model for predicting the routability of a placement. Since the performance of the CNN model is strongly dependent on the hyper-parameters selected for the model, we perform an exhaustive parameter tuning that significantly improves the model’s performance and we also avoid overfitting the model. We also incorporate the deep learning model into a state-of-the-art placement tool and show how the model can be used to (1) avoid costly, but futile, place-and-route iterations, and (2) improve the placer’s ability to produce routable placements for hard-to-route circuits using feedback based on routability estimates generated by the proposed model. The model is trained and evaluated using over 26K placement images derived from 372 benchmarks supplied by Xilinx Inc. We also explore several opportunities to further improve the reliability of the predictions made by the proposed DLRoute technique by splitting the model into two separate deep learning models for (a) global and (b) detailed placement during the optimization process. Experimental results show that the proposed framework achieves a routability prediction accuracy of 97% while exhibiting runtimes of only a few milliseconds.

Download Full-text

Diversity-Generated Image Inpainting with Style Extraction

10.20944/preprints201912.0028.v1 ◽

2019 ◽

Author(s):

Weiwei Cai ◽

Zhanguo Wei

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Image Inpainting ◽

Ground Truth ◽

Generative Model ◽

Input Noise ◽

Latent Vector ◽

Proposed Model ◽

Ground Truth Image

The latest methods based on deep learning have achieved amazing results regarding the complex work of inpainting large missing areas in an image. This type of method generally attempts to generate one single "optimal" inpainting result, ignoring many other plausible results. However, considering the uncertainty of the inpainting task, one sole result can hardly be regarded as a desired regeneration of the missing area. In view of this weakness, which is related to the design of the previous algorithms, we propose a novel deep generative model equipped with a brand new style extractor which can extract the style noise (a latent vector) from the ground truth image. Once obtained, the extracted style noise and the ground truth image are both input into the generator. We also craft a consistency loss that guides the generated image to approximate the ground truth. Meanwhile, the same extractor captures the style noise from the generated image, which is forced to approach the input noise according to the consistency loss. After iterations, our generator is able to learn the styles corresponding to multiple sets of noise. The proposed model can generate a (sufficiently large) number of inpainting results consistent with the context semantics of the image. Moreover, we check the effectiveness of our model on three databases, i.e., CelebA, Agricultural Disease, and MauFlex. Compared to state-of-the-art inpainting methods, this model is able to offer desirable inpainting results with both a better quality and higher diversity. The code and model will be made available on https://github.com/vivitsai/SEGAN.

Download Full-text

A Deep Learning Based Approach for Localization and Recognition of Pakistani Vehicle License Plates

Sensors ◽

10.3390/s21227696 ◽

2021 ◽

Vol 21 (22) ◽

pp. 7696

Author(s):

Umair Yousaf ◽

Ahmad Khan ◽

Hazrat Ali ◽

Fiaz Gul Khan ◽

Zia ur Rehman ◽

...

Keyword(s):

Deep Learning ◽

State Of The Art ◽

The Other ◽

License Plate ◽

Standard Size ◽

Current State ◽

Bounding Box ◽

Proposed Model ◽

Plate Area

License plate localization is the process of finding the license plate area and drawing a bounding box around it, while recognition is the process of identifying the text within the bounding box. The current state-of-the-art license plate localization and recognition approaches require license plates of standard size, style, fonts, and colors. Unfortunately, in Pakistan, license plates are non-standard and vary in terms of the characteristics mentioned above. This paper presents a deep-learning-based approach to localize and recognize Pakistani license plates with non-uniform and non-standardized sizes, fonts, and styles. We developed a new Pakistani license plate dataset (PLPD) to train and evaluate the proposed model. We conducted extensive experiments to compare the accuracy of the proposed approach with existing techniques. The results show that the proposed method outperformed the other methods to localize and recognize non-standard license plates.

Download Full-text

Hybrid Local and Global Deep-Learning Architecture for Salient-Object Detection

Applied Sciences ◽

10.3390/app10238754 ◽

2020 ◽

Vol 10 (23) ◽

pp. 8754

Author(s):

Wajeeha Sultan ◽

Nadeem Anjum ◽

Mark Stansfield ◽

Naeem Ramzan

Keyword(s):

Deep Learning ◽

Object Detection ◽

State Of The Art ◽

Salient Object Detection ◽

Salient Object ◽

Global Level ◽

Low Contrast ◽

Qualitative And Quantitative ◽

Exact Boundary ◽

Quantitative Analyses

Salient-object detection is a fundamental and the most challenging problem in computer vision. This paper focuses on the detection of salient objects, especially in low-contrast images. To this end, a hybrid deep-learning architecture is proposed where features are extracted on both the local and global level. These features are then integrated to extract the exact boundary of the object of interest in an image. Experimentation was performed on five standard datasets, and results were compared with state-of-the-art approaches. Both qualitative and quantitative analyses showed the robustness of the proposed architecture.

Download Full-text

Improved optic disc and cup segmentation in Glaucomatic images using deep learning architecture

Multimedia Tools and Applications ◽

10.1007/s11042-020-10430-6 ◽

2021 ◽

Author(s):

Partha Sarathi Mangipudi ◽

Hari Mohan Pandey ◽

Ankur Choudhary

Keyword(s):

Deep Learning ◽

Optic Disc ◽

Vision Loss ◽

State Of The Art ◽

Key Factors ◽

Glaucoma Diagnosis ◽

Proposed Model ◽

Effective System ◽

Benchmark Datasets ◽

Multiple Experts

AbstractGlaucoma is an ailment causing permanent vision loss but can be prevented through the early detection. Optic disc to cup ratio is one of the key factors for glaucoma diagnosis. But accurate segmentation of disc and cup is still a challenge. To mitigate this challenge, an effective system for optic disc and cup segmentation using deep learning architecture is presented in this paper. Modified Groundtruth is utilized to train the proposed model. It works as fused segmentation marking by multiple experts that helps in improving the performance of the system. Extensive computer simulations are conducted to test the efficiency of the proposed system. For the implementation three standard benchmark datasets such as DRISHTI-GS, DRIONS-DB and RIM-ONE v3 are used. The performance of the proposed system is validated against the state-of-the-art methods. Results indicate an average overlapping score of 96.62%, 96.15% and 98.42% respectively for optic disc segmentation and an average overlapping score of 94.41% is achieved on DRISHTI-GS which is significant for optic cup segmentation.

Download Full-text

Performance Analysis of Hyperparameters on a Sentiment Analysis Model

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.3549 ◽

2020 ◽

Vol 10 (4) ◽

pp. 6016-6020

Author(s):

I. A. Kandhro ◽

S. Z. Jumani ◽

F. Ali ◽

Z. U. Shaikh ◽

M. A. Arain ◽

...

Keyword(s):

Deep Learning ◽

Performance Analysis ◽

Sentiment Analysis ◽

State Of The Art ◽

Course Evaluation ◽

Experimental Results ◽

Activation Functions ◽

Analysis Model ◽

Proposed Model ◽

Evaluation Dataset

This paper focuses on the performance analysis of hyperparameters of the Sentiment Analysis (SA) model of a course evaluation dataset. The performance was analyzed regarding hyperparameters such as activation, optimization, and regularization. In this paper, the activation functions used were adam, adagrad, nadam, adamax, and hard_sigmoid, the optimization functions were softmax, softplus, sigmoid, and relu, and the dropout values were 0.1, 0.2, 0.3, and 0.4. The results indicate that parameters adam and softmax with dropout value 2.0 are effective when compared to other combinations of the SA model. The experimental results reveal that the proposed model outperforms the state-of-the-art deep learning classifiers.

Download Full-text

Deep Learning-Based Instance Segmentation Method of Litchi Canopy from UAV-Acquired Images

Remote Sensing ◽

10.3390/rs13193919 ◽

2021 ◽

Vol 13 (19) ◽

pp. 3919

Author(s):

Jiawei Mo ◽

Yubin Lan ◽

Dongzi Yang ◽

Fei Wen ◽

Hongbin Qiu ◽

...

Keyword(s):

Deep Learning ◽

Image Annotation ◽

Strong Dependence ◽

Training Data ◽

Complex Data ◽

Segmentation Method ◽

Proposed Model ◽

Tree Canopies ◽

Digital Orthophoto ◽

Instance Segmentation

Instance segmentation of fruit tree canopies from images acquired by unmanned aerial vehicles (UAVs) is of significance for the precise management of orchards. Although deep learning methods have been widely used in the fields of feature extraction and classification, there are still phenomena of complex data and strong dependence on software performances. This paper proposes a deep learning-based instance segmentation method of litchi trees, which has a simple structure and lower requirements for data form. Considering that deep learning models require a large amount of training data, a labor-friendly semi-auto method for image annotation is introduced. The introduction of this method allows for a significant improvement in the efficiency of data pre-processing. Facing the high requirement of a deep learning method for computing resources, a partition-based method is presented for the segmentation of high-resolution digital orthophoto maps (DOMs). Citrus data is added to the training set to alleviate the lack of diversity of the original litchi dataset. The average precision (AP) is selected to evaluate the metric of the proposed model. The results show that with the help of training with the litchi-citrus datasets, the best AP on the test set reaches 96.25%.

Download Full-text