EnCaps: Clothing Image Classification Based on Enhanced Capsule Network

Feng Yu; Chenghu Du; Ailing Hua; Minghua Jiang; Xiong Wei; Tao Peng; Xinrong Hu

doi:10.3390/app112211024

EnCaps: Clothing Image Classification Based on Enhanced Capsule Network

Applied Sciences ◽

10.3390/app112211024 ◽

2021 ◽

Vol 11 (22) ◽

pp. 11024

Author(s):

Feng Yu ◽

Chenghu Du ◽

Ailing Hua ◽

Minghua Jiang ◽

Xiong Wei ◽

...

Keyword(s):

Spatial Structure ◽

Image Classification ◽

Classification Accuracy ◽

High Performance ◽

Great Difficulty ◽

Image Feature ◽

Structure Information ◽

Structure Feature ◽

Feature Based ◽

Extraction Model

Clothing image classification is more and more important in the development of online clothing shopping. The clothing category marking, clothing commodity retrieval, and similar clothing recommendations are the popular applications in current clothing shopping, which are based on the technology of accurate clothing image classification. Wide varieties and various styles of clothing lead to great difficulty for the accurate clothing image classification. The traditional neural network can not obtain the spatial structure information of clothing images, which leads to poor classification accuracy. In order to reach the high accuracy, the enhanced capsule (EnCaps) network is proposed with the image feature and spatial structure feature. First, the spatial structure extraction model is proposed to obtain the clothing structure feature based on the EnCaps network. Second, the enhanced feature extraction model is proposed to extract more robust clothing features based on deeper network structure and attention mechanism. Third, parameter optimization is used to reduce the computation in the proposed network based on inception mechanism. Experimental results indicate that the proposed EnCaps network achieves high performance in terms of classification accuracy and computational efficiency.

Download Full-text

PolSAR Image Feature Extraction via Co-Regularized Graph Embedding

Remote Sensing ◽

10.3390/rs12111738 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1738

Author(s):

Xiayuan Huang ◽

Xiangli Nie ◽

Hong Qiao

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Weighted Graph ◽

Graph Embedding ◽

Image Feature ◽

Weighted Graphs ◽

Image Feature Extraction ◽

Structure Information ◽

Low Dimensional ◽

Low Dimensional Features

Dimensionality reduction (DR) methods based on graph embedding are widely used for feature extraction. For these methods, the weighted graph plays a vital role in the process of DR because it can characterize the data’s structure information. Moreover, the similarity measurement is a crucial factor for constructing a weighted graph. Wishart distance of covariance matrices and Euclidean distance of polarimetric features are two important similarity measurements for polarimetric synthetic aperture radar (PolSAR) image classification. For obtaining a satisfactory PolSAR image classification performance, a co-regularized graph embedding (CRGE) method by combing the two distances is proposed for PolSAR image feature extraction in this paper. Firstly, two weighted graphs are constructed based on the two distances to represent the data’s local structure information. Specifically, the neighbouring samples are sought in a local patch to decrease computation cost and use spatial information. Next the DR model is constructed based on the two weighted graphs and co-regularization. The co-regularization aims to minimize the dissimilarity of low-dimensional features corresponding to two weighted graphs. We employ two types of co-regularization and the corresponding algorithms are proposed. Ultimately, the obtained low-dimensional features are used for PolSAR image classification. Experiments are implemented on three PolSAR datasets and results show that the co-regularized graph embedding can enhance the performance of PolSAR image classification.

Download Full-text

COVID-19 image classification using deep features and fractional-order marine predators algorithm

Scientific Reports ◽

10.1038/s41598-020-71294-2 ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 3

Author(s):

Ahmed T. Sahlol ◽

Dalia Yousri ◽

Ahmed A. Ewees ◽

Mohammed A. A. Al-qaness ◽

Robertas Damasevicius ◽

...

Keyword(s):

Feature Selection ◽

Image Classification ◽

Fractional Order ◽

Classification Accuracy ◽

High Performance ◽

Computational Cost ◽

X Ray ◽

Marine Predators ◽

Hybrid Classification ◽

Selection Algorithms

Abstract Currently, we witness the severe spread of the pandemic of the new Corona virus, COVID-19, which causes dangerous symptoms to humans and animals, its complications may lead to death. Although convolutional neural networks (CNNs) is considered the current state-of-the-art image classification technique, it needs massive computational cost for deployment and training. In this paper, we propose an improved hybrid classification approach for COVID-19 images by combining the strengths of CNNs (using a powerful architecture called Inception) to extract features and a swarm-based feature selection algorithm (Marine Predators Algorithm) to select the most relevant features. A combination of fractional-order and marine predators algorithm (FO-MPA) is considered an integration among a robust tool in mathematics named fractional-order calculus (FO). The proposed approach was evaluated on two public COVID-19 X-ray datasets which achieves both high performance and reduction of computational complexity. The two datasets consist of X-ray COVID-19 images by international Cardiothoracic radiologist, researchers and others published on Kaggle. The proposed approach selected successfully 130 and 86 out of 51 K features extracted by inception from dataset 1 and dataset 2, while improving classification accuracy at the same time. The results are the best achieved on these datasets when compared to a set of recent feature selection algorithms. By achieving 98.7%, 98.2% and 99.6%, 99% of classification accuracy and F-Score for dataset 1 and dataset 2, respectively, the proposed approach outperforms several CNNs and all recent works on COVID-19 images.

Download Full-text

Structure-Feature based Graph Self-adaptive Pooling

Proceedings of The Web Conference 2020 ◽

10.1145/3366423.3380083 ◽

2020 ◽

Cited By ~ 2

Author(s):

Liang Zhang ◽

Xudong Wang ◽

Hongsheng Li ◽

Guangming Zhu ◽

Peiyi Shen ◽

...

Keyword(s):

Structure Feature ◽

Feature Based ◽

Self Adaptive

Download Full-text

An Imbalanced Image Classification Method for the Cell Cycle Phase

Information ◽

10.3390/info12060249 ◽

2021 ◽

Vol 12 (6) ◽

pp. 249

Author(s):

Xin Jin ◽

Yuanwen Zou ◽

Zhongbing Huang

Keyword(s):

Cell Cycle ◽

Deep Learning ◽

Image Classification ◽

Classification Accuracy ◽

Data Augmentation ◽

Cycle Phase ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Cellular Life

The cell cycle is an important process in cellular life. In recent years, some image processing methods have been developed to determine the cell cycle stages of individual cells. However, in most of these methods, cells have to be segmented, and their features need to be extracted. During feature extraction, some important information may be lost, resulting in lower classification accuracy. Thus, we used a deep learning method to retain all cell features. In order to solve the problems surrounding insufficient numbers of original images and the imbalanced distribution of original images, we used the Wasserstein generative adversarial network-gradient penalty (WGAN-GP) for data augmentation. At the same time, a residual network (ResNet) was used for image classification. ResNet is one of the most used deep learning classification networks. The classification accuracy of cell cycle images was achieved more effectively with our method, reaching 83.88%. Compared with an accuracy of 79.40% in previous experiments, our accuracy increased by 4.48%. Another dataset was used to verify the effect of our model and, compared with the accuracy from previous results, our accuracy increased by 12.52%. The results showed that our new cell cycle image classification system based on WGAN-GP and ResNet is useful for the classification of imbalanced images. Moreover, our method could potentially solve the low classification accuracy in biomedical images caused by insufficient numbers of original images and the imbalanced distribution of original images.

Download Full-text

Deep Learning-Based Hepatocellular Carcinoma Histopathology Image Classification: Accuracy versus Training Dataset Size

IEEE Access ◽

10.1109/access.2021.3060765 ◽

2021 ◽

pp. 1-1

Author(s):

Yu-Shiang Lin ◽

Pei-Hsin Huang ◽

Yung-Yaw Chen

Keyword(s):

Hepatocellular Carcinoma ◽

Deep Learning ◽

Image Classification ◽

Classification Accuracy ◽

Training Dataset ◽

Dataset Size

Download Full-text

Hyperspectral Image Classification Based on Multi-Scale Residual Network with Attention Mechanism

Remote Sensing ◽

10.3390/rs13030335 ◽

2021 ◽

Vol 13 (3) ◽

pp. 335

Author(s):

Yuhao Qing ◽

Wenyi Liu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Image Classification ◽

Classification Accuracy ◽

Hyperspectral Image ◽

Principal Component ◽

Hyperspectral Image Classification ◽

Deep Network ◽

Multi Scale

In recent years, image classification on hyperspectral imagery utilizing deep learning algorithms has attained good results. Thus, spurred by that finding and to further improve the deep learning classification accuracy, we propose a multi-scale residual convolutional neural network model fused with an efficient channel attention network (MRA-NET) that is appropriate for hyperspectral image classification. The suggested technique comprises a multi-staged architecture, where initially the spectral information of the hyperspectral image is reduced into a two-dimensional tensor, utilizing a principal component analysis (PCA) scheme. Then, the constructed low-dimensional image is input to our proposed ECA-NET deep network, which exploits the advantages of its core components, i.e., multi-scale residual structure and attention mechanisms. We evaluate the performance of the proposed MRA-NET on three public available hyperspectral datasets and demonstrate that, overall, the classification accuracy of our method is 99.82 %, 99.81%, and 99.37, respectively, which is higher compared to the corresponding accuracy of current networks such as 3D convolutional neural network (CNN), three-dimensional residual convolution structure (RES-3D-CNN), and space–spectrum joint deep network (SSRN).

Download Full-text

Image feature based video object description and tracking

2012 IEEE International Conference on Vehicular Electronics and Safety (ICVES 2012) ◽

10.1109/icves.2012.6294255 ◽

2012 ◽

Cited By ~ 2

Author(s):

Ramazan Yildiz ◽

Tankut Acarman

Keyword(s):

Image Feature ◽

Video Object ◽

Object Description ◽

Feature Based

Download Full-text

An New Image Feature Based on ROLD

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.774-776.1625 ◽

2013 ◽

Vol 774-776 ◽

pp. 1625-1628 ◽

Cited By ~ 1

Author(s):

Kai Hu ◽

Wei Feng Chen ◽

Dan Mao ◽

Zi Chen Zheng ◽

Jing Yi Duan

Keyword(s):

Experimental Data ◽

Image Recognition ◽

Visual Navigation ◽

Image Feature ◽

Fundamental Analysis ◽

Feature Based

To make robot more intelligence, this paper proposed a new image feature named as ROLD-map which based on Rank-Ordered Logarithmic Difference (ROLD), and this feature enable researchers understand images complication directly and accuracy. Experimental data show that it can recognize the sky, tree and road obviously with very little time through proposed feature. It provides the fundamental analysis for improving the precision of image recognition, and also gives the reference research for improving the precision of image recognition for the process of visual navigation of robot.

Download Full-text

A Novel Feature Extraction Model to Enhance Underwater Image Classification

Intelligent Computing Systems - Communications in Computer and Information Science ◽

10.1007/978-3-030-43364-2_8 ◽

2020 ◽

pp. 78-91

Author(s):

Muhammad Irfan ◽

Jiangbin Zheng ◽

Muhammad Iqbal ◽

Muhammad Hassan Arif

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Underwater Image ◽

Extraction Model

Download Full-text

Extracting image features for classification by two-tier genetic programming

10.26686/wgtn.13150940 ◽

2020 ◽

Author(s):

Harith Al-Sahaf ◽

A Song ◽

K Neshatian ◽

Mengjie Zhang

Keyword(s):

Genetic Programming ◽

Image Classification ◽

Domain Knowledge ◽

Extraction Process ◽

High Accuracy ◽

Classification Performance ◽

Image Features ◽

Classification Methods ◽

Feature Based ◽

Second Tier

Image classification is a complex but important task especially in the areas of machine vision and image analysis such as remote sensing and face recognition. One of the challenges in image classification is finding an optimal set of features for a particular task because the choice of features has direct impact on the classification performance. However the goodness of a feature is highly problem dependent and often domain knowledge is required. To address these issues we introduce a Genetic Programming (GP) based image classification method, Two-Tier GP, which directly operates on raw pixels rather than features. The first tier in a classifier is for automatically defining features based on raw image input, while the second tier makes decision. Compared to conventional feature based image classification methods, Two-Tier GP achieved better accuracies on a range of different tasks. Furthermore by using the features defined by the first tier of these Two-Tier GP classifiers, conventional classification methods obtained higher accuracies than classifying on manually designed features. Analysis on evolved Two-Tier image classifiers shows that there are genuine features captured in the programs and the mechanism of achieving high accuracy can be revealed. The Two-Tier GP method has clear advantages in image classification, such as high accuracy, good interpretability and the removal of explicit feature extraction process. © 2012 IEEE.

Download Full-text