Double Additive Margin Softmax Loss for Face Recognition

Shengwei Zhou; Caikou Chen; Guojiang Han; Xielian Hou

doi:10.3390/app10010060

Double Additive Margin Softmax Loss for Face Recognition

Applied Sciences ◽

10.3390/app10010060 ◽

2019 ◽

Vol 10 (1) ◽

pp. 60 ◽

Cited By ~ 1

Author(s):

Shengwei Zhou ◽

Caikou Chen ◽

Guojiang Han ◽

Xielian Hou

Keyword(s):

Neural Networks ◽

Face Recognition ◽

Loss Function ◽

State Of The Art ◽

Feature Learning ◽

Loss Functions ◽

Deep Convolutional Neural Networks ◽

Large Margin ◽

Face Features ◽

Geometrical Explanation

Learning large-margin face features whose intra-class variance is small and inter-class diversity is one of important challenges in feature learning applying Deep Convolutional Neural Networks (DCNNs) for face recognition. Recently, an appealing line of research is to incorporate an angular margin in the original softmax loss functions for obtaining discriminative deep features during the training of DCNNs. In this paper we propose a novel loss function, termed as double additive margin Softmax loss (DAM-Softmax). The presented loss has a clearer geometrical explanation and can obtain highly discriminative features for face recognition. Extensive experimental evaluation of several recent state-of-the-art softmax loss functions are conducted on the relevant face recognition benchmarks, CASIA-Webface, LFW, CALFW, CPLFW, and CFP-FP. We show that the proposed loss function consistently outperforms the state-of-the-art.

Download Full-text

Mis-Classified Vector Guided Softmax Loss for Face Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6906 ◽

2020 ◽

Vol 34 (07) ◽

pp. 12241-12248 ◽

Cited By ~ 3

Author(s):

Xiaobo Wang ◽

Shifeng Zhang ◽

Shuo Wang ◽

Tianyu Fu ◽

Hailin Shi ◽

...

Keyword(s):

Face Recognition ◽

Loss Function ◽

State Of The Art ◽

Feature Learning ◽

Ground Truth ◽

Significant Progress ◽

Deep Convolutional Neural Networks ◽

Face Features ◽

Discriminative Feature ◽

Feature Mining

Face recognition has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs), the central task of which is how to improve the feature discrimination. To this end, several margin-based (e.g., angular, additive and additive angular margins) softmax loss functions have been proposed to increase the feature margin between different classes. However, despite great achievements have been made, they mainly suffer from three issues: 1) Obviously, they ignore the importance of informative features mining for discriminative learning; 2) They encourage the feature margin only from the ground truth class, without realizing the discriminability from other non-ground truth classes; 3) The feature margin between different classes is set to be same and fixed, which may not adapt the situations very well. To cope with these issues, this paper develops a novel loss function, which adaptively emphasizes the mis-classified feature vectors to guide the discriminative feature learning. Thus we can address all the above issues and achieve more discriminative face features. To the best of our knowledge, this is the first attempt to inherit the advantages of feature margin and feature mining into a unified loss function. Experimental results on several benchmarks have demonstrated the effectiveness of our method over state-of-the-art alternatives. Our code is available at http://www.cbsr.ia.ac.cn/users/xiaobowang/.

Download Full-text

Illumination-robust face recognition based on deep convolutional neural networks architectures

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i2.pp1015-1027 ◽

2020 ◽

Vol 18 (2) ◽

pp. 1015

Author(s):

Ridha Ilyas Bendjillali ◽

Mohammed Beladgham ◽

Khaled Merit ◽

Abdelmalik Taleb-Ahmed

Keyword(s):

Neural Networks ◽

Face Recognition ◽

Convolutional Neural Networks ◽

Feature Learning ◽

Histogram Equalization ◽

Detection Algorithm ◽

Deep Convolutional Neural Networks ◽

Biometric Technology ◽

Equalization Algorithm ◽

Robust Face Recognition

<p><span>In the last decade, facial recognition techniques are considered the most important fields of research in biometric technology. In this research paper, we present a Face Recognition (FR) system divided into three steps: The Viola-Jones face detection algorithm, facial image enhancement using Modified Contrast Limited Adaptive Histogram Equalization algorithm (M-CLAHE), and feature learning for classiﬁcation. For learning the features followed by classiﬁcation we used VGG16, ResNet50 and Inception-v3 Convolutional Neural Networks (CNN) architectures for the proposed system. Our experimental work was performed on the Extended Yale B database and CMU PIE face database. Finally, the comparison with the other methods on both databases shows the robustness and effectiveness of the proposed approach. Where the Inception-v3 architecture has achieved a rate of 99, 44% and 99, 89% respectively.</span></p>

Download Full-text

A New Dataset and Deep Residual Spectral Spatial Network for Hyperspectral Image Classification

Symmetry ◽

10.3390/sym12040561 ◽

2020 ◽

Vol 12 (4) ◽

pp. 561 ◽

Cited By ~ 1

Author(s):

Yiming Xue ◽

Dan Zeng ◽

Fansheng Chen ◽

Yueming Wang ◽

Zhijiang Zhang

Keyword(s):

Neural Networks ◽

Loss Function ◽

Hyperspectral Image ◽

State Of The Art ◽

Feature Learning ◽

Spatial Network ◽

Hyperspectral Image Classification ◽

Classification Framework ◽

Prediction Confidence ◽

Art Methods

Due to the limited varieties and sizes of existing public hyperspectral image (HSI) datasets, the classification accuracies are higher than 99% with convolutional neural networks (CNNs). In this paper, we presented a new HSI dataset named Shandong Feicheng, whose size and pixel quantity are much larger. It also has a larger intra-class variance and a smaller inter-class variance. State-of-the-art methods were compared on it to verify its diversity. Otherwise, to reduce overfitting caused by the imbalance between high dimension and small quantity of labeled HSI data, existing CNNs for HSI classification are relatively shallow and suffer from low capacity of feature learning. To solve this problem, we proposed an HSI classification framework named deep residual spectral spatial setwork (DRSSN). By using shortcut connection structure, which is an asymmetry structure, DRSSN can be deeper to extract features with better discrimination. In addition, to alleviate insufficient training caused by unbalanced sample sizes between easily and hard classified samples, we proposed a novel training loss function named sample balanced loss, which allocated weights to the losses of samples according to their prediction confidence. Experimental results on two popular datasets and our proposed dataset showed that our proposed network could provide competitive results compared with state-of-the-art methods.

Download Full-text

Assessing the Impact of the Loss Function, Architecture and Image Type for Deep Learning-Based Wildfire Segmentation

Applied Sciences ◽

10.3390/app11157046 ◽

2021 ◽

Vol 11 (15) ◽

pp. 7046

Author(s):

Jorge Francisco Ciprián-Sánchez ◽

Gilberto Ochoa-Ruiz ◽

Lucile Rossi ◽

Frédéric Morandini

Keyword(s):

Deep Learning ◽

Loss Function ◽

State Of The Art ◽

Fire Detection ◽

Loss Functions ◽

Wildfire Spread ◽

Combine Information ◽

The Impact ◽

Image Type ◽

Segmentation Models

Wildfires stand as one of the most relevant natural disasters worldwide, particularly more so due to the effect of climate change and its impact on various societal and environmental levels. In this regard, a significant amount of research has been done in order to address this issue, deploying a wide variety of technologies and following a multi-disciplinary approach. Notably, computer vision has played a fundamental role in this regard. It can be used to extract and combine information from several imaging modalities in regard to fire detection, characterization and wildfire spread forecasting. In recent years, there has been work pertaining to Deep Learning (DL)-based fire segmentation, showing very promising results. However, it is currently unclear whether the architecture of a model, its loss function, or the image type employed (visible, infrared, or fused) has the most impact on the fire segmentation results. In the present work, we evaluate different combinations of state-of-the-art (SOTA) DL architectures, loss functions, and types of images to identify the parameters most relevant to improve the segmentation results. We benchmark them to identify the top-performing ones and compare them to traditional fire segmentation techniques. Finally, we evaluate if the addition of attention modules on the best performing architecture can further improve the segmentation results. To the best of our knowledge, this is the first work that evaluates the impact of the architecture, loss function, and image type in the performance of DL-based wildfire segmentation models.

Download Full-text

Learning Large Logic Programs By Going Beyond Entailment

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/287 ◽

2020 ◽

Author(s):

Andrew Cropper ◽

Sebastijan Dumančic

Keyword(s):

Logic Programming ◽

Loss Function ◽

Inductive Logic Programming ◽

State Of The Art ◽

Inductive Logic ◽

Program Synthesis ◽

Loss Functions ◽

Logic Programs ◽

Binary Decision ◽

Best First Search

A major challenge in inductive logic programming (ILP) is learning large programs. We argue that a key limitation of existing systems is that they use entailment to guide the hypothesis search. This approach is limited because entailment is a binary decision: a hypothesis either entails an example or does not, and there is no intermediate position. To address this limitation, we go beyond entailment and use 'example-dependent' loss functions to guide the search, where a hypothesis can partially cover an example. We implement our idea in Brute, a new ILP system which uses best-first search, guided by an example-dependent loss function, to incrementally build programs. Our experiments on three diverse program synthesis domains (robot planning, string transformations, and ASCII art), show that Brute can substantially outperform existing ILP systems, both in terms of predictive accuracies and learning times, and can learn programs 20 times larger than state-of-the-art systems.

Download Full-text

AI-driven deep CNN approach for multi-label pathology classification using chest X-Rays

PeerJ Computer Science ◽

10.7717/peerj-cs.495 ◽

2021 ◽

Vol 7 ◽

pp. e495

Author(s):

Saleh Albahli ◽

Hafiz Tayyab Rauf ◽

Abdulelah Algosaibi ◽

Valentina Emilia Balas

Keyword(s):

Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Synthetic Data ◽

X Rays ◽

Deep Convolutional Neural Networks ◽

Current State ◽

Pathology Classification ◽

Wide Range ◽

Multi Class Classification

Artificial intelligence (AI) has played a significant role in image analysis and feature extraction, applied to detect and diagnose a wide range of chest-related diseases. Although several researchers have used current state-of-the-art approaches and have produced impressive chest-related clinical outcomes, specific techniques may not contribute many advantages if one type of disease is detected without the rest being identified. Those who tried to identify multiple chest-related diseases were ineffective due to insufficient data and the available data not being balanced. This research provides a significant contribution to the healthcare industry and the research community by proposing a synthetic data augmentation in three deep Convolutional Neural Networks (CNNs) architectures for the detection of 14 chest-related diseases. The employed models are DenseNet121, InceptionResNetV2, and ResNet152V2; after training and validation, an average ROC-AUC score of 0.80 was obtained competitive as compared to the previous models that were trained for multi-class classification to detect anomalies in x-ray images. This research illustrates how the proposed model practices state-of-the-art deep neural networks to classify 14 chest-related diseases with better accuracy.

Download Full-text

Crop disease identification using state-of-the-art deep convolutional neural networks

Smart Computing ◽

10.1201/9781003167488-21 ◽

2021 ◽

pp. 160-169

Author(s):

P.S. Thakur ◽

T. Sheorey ◽

Aparajita Ojha

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

State Of The Art ◽

Deep Convolutional Neural Networks ◽

Disease Identification ◽

Crop Disease

Download Full-text

A Coupling Support Vector Machines with the Feature Learning of Deep Convolutional Neural Networks for Classifying Microarray Gene Expression Data

Modern Approaches for Intelligent Information and Database Systems - Studies in Computational Intelligence ◽

10.1007/978-3-319-76081-0_20 ◽

2018 ◽

pp. 233-243 ◽

Cited By ~ 4

Author(s):

Phuoc-Hai Huynh ◽

Van-Hoa Nguyen ◽

Thanh-Nghi Do

Keyword(s):

Gene Expression ◽

Neural Networks ◽

Support Vector Machines ◽

Feature Learning ◽

Microarray Gene Expression Data ◽

Support Vector ◽

Deep Convolutional Neural Networks ◽

Microarray Gene Expression ◽

Vector Machines ◽

Microarray Gene

Download Full-text

Self-Supervised Learning for Generalizable Out-of-Distribution Detection

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5966 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5216-5223 ◽

Cited By ~ 1

Author(s):

Sina Mohseni ◽

Mandar Pitale ◽

JBS Yadawa ◽

Zhangyang Wang

Keyword(s):

Neural Networks ◽

Autonomous Vehicles ◽

Deep Neural Networks ◽

State Of The Art ◽

Feature Learning ◽

Detection Methods ◽

Training Set ◽

Safety Critical ◽

Multiple Image ◽

A New Technique

The real-world deployment of Deep Neural Networks (DNNs) in safety-critical applications such as autonomous vehicles needs to address a variety of DNNs' vulnerabilities, one of which being detecting and rejecting out-of-distribution outliers that might result in unpredictable fatal errors. We propose a new technique relying on self-supervision for generalizable out-of-distribution (OOD) feature learning and rejecting those samples at the inference time. Our technique does not need to pre-know the distribution of targeted OOD samples and incur no extra overheads compared to other methods. We perform multiple image classification experiments and observe our technique to perform favorably against state-of-the-art OOD detection methods. Interestingly, we witness that our method also reduces in-distribution classification risk via rejecting samples near the boundaries of the training set distribution.

Download Full-text

NROI based feature learning for automated tumor stage classification of pulmonary lung nodules using deep convolutional neural networks

Journal of King Saud University - Computer and Information Sciences ◽

10.1016/j.jksuci.2019.11.013 ◽

2019 ◽

Cited By ~ 2

Author(s):

Supriya Suresh ◽

Subaji Mohan

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Feature Learning ◽

Tumor Stage ◽

Lung Nodules ◽

Deep Convolutional Neural Networks ◽

Stage Classification

Download Full-text