Fusion of Machine Learning and Privacy Preserving for Secure Facial Expression Recognition

Security and Communication Networks ◽

10.1155/2021/6673992 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Asad Ullah ◽

Jing Wang ◽

M. Shahid Anwar ◽

Arshad Ahmad ◽

Shah Nazir ◽

...

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Secure Communication ◽

Data Augmentation ◽

Mental Disease ◽

Research Work ◽

Expression Recognition ◽

Spatial Problem ◽

Spatial Transformation ◽

Client Server

The interest in Facial Expression Recognition (FER) is increasing day by day due to its practical and potential applications, such as human physiological interaction diagnosis and mental disease detection. This area has received much attention from the research community in recent years and achieved remarkable results; however, a significant improvement is required in spatial problems. This research work presents a novel framework and proposes an effective and robust solution for FER under an unconstrained environment; it also helps us to classify facial images in the client/server model along with preserving privacy. There are a lot of cryptography techniques available but they are computationally expensive; on the other side, we have implemented a lightweight method capable of ensuring secure communication with the help of randomization. Initially, we perform preprocessing techniques to encounter the unconstrained environment. Face detection is performed for the removal of excessive background and it detects the face in the real-world environment. Data augmentation is for the insufficient data regime. A dual-enhanced capsule network is used to handle the spatial problem. The traditional capsule networks are unable to sufficiently extract the features, as the distance varies greatly between facial features. Therefore, the proposed network is capable of spatial transformation due to the action unit aware mechanism and thus forwards the most desiring features for dynamic routing between capsules. The squashing function is used for classification purposes. Simple classification is performed through a single party, whereas we also implemented the client/server model with privacy measurements. Both parties do not trust each other, as they do not know the input of each other. We have elaborated that the effectiveness of our method remains unchanged by preserving privacy by validating the results on four popular and versatile databases that outperform all the homomorphic cryptographic techniques.

Download Full-text

Empirical Investigation of Multimodal Sensors in Novel Deep Facial Expression Recognition In-the-Wild

Journal of Sensors ◽

10.1155/2021/8893661 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Asad Ullah ◽

Jing Wang ◽

M. Shahid Anwar ◽

Taeg Keun Whangbo ◽

Yaping Zhu

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Research Work ◽

Ground Truth ◽

Expression Recognition ◽

Spatial Problem ◽

Spatial Transformation ◽

Robust Solution ◽

Multimodal Sensors ◽

In The Wild

The interest in the facial expression recognition (FER) is increasing day by day due to its practical and potential applications, such as human physiological interaction diagnosis and mental diseases detection. This area has received much attention from the research community in recent years and achieved remarkable results; however, a significant improvement is required in spatial problems. This research work presents a novel framework and proposes an effective and robust solution for FER under an unconstrained environment. Face detection is performed using the supervision of facial attributes. Faceness-Net is used for deep facial part responses for the detection of faces under severe unconstrained variations. In order to improve the generalization problems and avoid insufficient data regime, Deep Convolutional Graphical Adversarial Network (DC-GAN) is utilized. Due to the challenging environmental factors faced in the wild, a large number of noises disrupt feature extraction, thus making it hard to capture ground truth. We leverage different multimodal sensors with a camera that aids in data acquisition, by extracting the features more accurately and improve the overall performance of FER. These intelligent sensors are used to tackle the significant challenges like illumination variance, subject dependence, and head pose. Dual-enhanced capsule network is used which is able to handle the spatial problem. The traditional capsule networks are unable to sufficiently extract the features, as the distance varies greatly between facial features. Therefore, the proposed network is capable of spatial transformation due to action unit aware mechanism and thus forward most desiring features for dynamic routing between capsules. Squashing function is used for the classification function. We have elaborated the effectiveness of our method by validating the results on four popular and versatile databases that outperform all state-of-the-art methods.

Download Full-text

eXnet: An Efficient Approach for Emotion Recognition in the Wild

Sensors ◽

10.3390/s20041087 ◽

2020 ◽

Vol 20 (4) ◽

pp. 1087

Author(s):

Muhammad Naveed Riaz ◽

Yao Shen ◽

Muhammad Sohail ◽

Minyi Guo

Keyword(s):

Facial Expression ◽

Emotion Recognition ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Raspberry Pi ◽

Expression Recognition ◽

Extensive Evaluation ◽

Benchmark Datasets ◽

Augmentation Techniques ◽

In The Wild

Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk.

Download Full-text

Evaluation of Data Augmentation Techniques for Facial Expression Recognition Systems

Electronics ◽

10.3390/electronics9111892 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1892

Author(s):

Simone Porcu ◽

Alessandro Floris ◽

Luigi Atzori

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Geometric Transformation ◽

Expression Recognition ◽

Learning Approaches ◽

Geometric Transformations ◽

Augmentation Techniques ◽

The Impact

Most Facial Expression Recognition (FER) systems rely on machine learning approaches that require large databases for an effective training. As these are not easily available, a good solution is to augment the databases with appropriate data augmentation (DA) techniques, which are typically based on either geometric transformation or oversampling augmentations (e.g., generative adversarial networks (GANs)). However, it is not always easy to understand which DA technique may be more convenient for FER systems because most state-of-the-art experiments use different settings which makes the impact of DA techniques not comparable. To advance in this respect, in this paper, we evaluate and compare the impact of using well-established DA techniques on the emotion recognition accuracy of a FER system based on the well-known VGG16 convolutional neural network (CNN). In particular, we consider both geometric transformations and GAN to increase the amount of training images. We performed cross-database evaluations: training with the "augmented" KDEF database and testing with two different databases (CK+ and ExpW). The best results were obtained combining horizontal reflection, translation and GAN, bringing an accuracy increase of approximately 30%. This outperforms alternative approaches, except for the one technique that could however rely on a quite bigger database.

Download Full-text

Efficient facial expression recognition based on convolutional neural network

Intelligent Data Analysis ◽

10.3233/ida-194965 ◽

2021 ◽

Vol 25 (1) ◽

pp. 139-154

Author(s):

Yongxiang Cai ◽

Jingwen Gao ◽

Gen Zhang ◽

Yuangang Liu

Keyword(s):

Neural Network ◽

Facial Expression ◽

Convolutional Neural Network ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Recognition Rate ◽

Classification Performance ◽

Driving Safety ◽

Expression Recognition ◽

Before And After

The goal of research in Facial Expression Recognition (FER) is to build a robust and strong recognizability model. In this paper, we propose a new scheme for FER systems based on convolutional neural network. Part of the regular convolution operation is replaced by depthwise separable convolution to reduce the number of parameters and the computational workload; the self-adaption joint loss function is adopted to improve the classification performance. In addition, we balance our train set through data augmentation, and we preprocess the input images through illumination processing, face detection, and other methods, effectively maximizing the expression recognition rate. Experiments to validate our methods are conducted based on the TensorFlow platform and Fer2013 dataset. We analyze the experimental results before and after train set balancing and network model modification, and we compare our results with those of other researchers. The results show that our method is effective at increasing the expression recognition rate under the same experiment conditions. We further conduct an experiment on our own expression dataset relevant to driving safety, and it yields similar results.

Download Full-text

Facial expression recognition with trade-offs between data augmentation and deep learning features

Journal of Ambient Intelligence and Humanized Computing ◽

10.1007/s12652-020-02845-8 ◽

2021 ◽

Author(s):

Saiyed Umer ◽

Ranjeet Kumar Rout ◽

Chiara Pero ◽

Michele Nappi

Keyword(s):

Deep Learning ◽

Facial Expression ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Expression Recognition ◽

Trade Offs ◽

Deep Learning Features

Download Full-text

Accuracy improvement of facial expression recognition in speech acts: Confirmation of effectiveness of information around a mouth and GAN-based data augmentation

2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) ◽

10.1109/ro-man46459.2019.8956436 ◽

2019 ◽

Author(s):

Kyu-Seob Song ◽

Dong-Soo Kwon

Keyword(s):

Facial Expression ◽

Speech Acts ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Expression Recognition ◽

Accuracy Improvement

Download Full-text

Data Augmentation and Second-Order Pooling for Facial Expression Recognition

IEEE Access ◽

10.1109/access.2019.2923530 ◽

2019 ◽

Vol 7 ◽

pp. 86821-86828 ◽

Cited By ~ 3

Author(s):

Xiaoyun Tong ◽

Songlin Sun ◽

Meixia Fu

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Second Order ◽

Expression Recognition

Download Full-text

Facial Expression Recognition using Convolutional Neural Network with Data Augmentation

2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR) ◽

10.1109/iciev.2019.8858529 ◽

2019 ◽

Cited By ~ 6

Author(s):

Tawsin Uddin Ahmed ◽

Sazzad Hossain ◽

Mohammad Shahadat Hossain ◽

Raihan ul Islam ◽

Karl Andersson

Keyword(s):

Neural Network ◽

Facial Expression ◽

Convolutional Neural Network ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Expression Recognition

Download Full-text

Facial Expression Recognition with Data Augmentation and Compact Feature Learning

2018 25th IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2018.8451039 ◽

2018 ◽

Cited By ~ 7

Author(s):

Feng Lin ◽

Richang Hong ◽

Wengang Zhou ◽

Houqiang Li

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Feature Learning ◽

Expression Recognition

Download Full-text

A Unified Framework of Deep Learning-Based Facial Expression Recognition System for Diversified Applications

Applied Sciences ◽

10.3390/app11199174 ◽

2021 ◽

Vol 11 (19) ◽

pp. 9174

Author(s):

Sanoar Hossain ◽

Saiyed Umer ◽

Vijayan Asari ◽

Ranjeet Kumar Rout

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Data Augmentation ◽

Recognition System ◽

Input Image ◽

Fine Tuning ◽

Expression Recognition ◽

Unified Framework ◽

Face Region ◽

In The Wild

This work proposes a facial expression recognition system for a diversified field of applications. The purpose of the proposed system is to predict the type of expressions in a human face region. The implementation of the proposed method is fragmented into three components. In the first component, from the given input image, a tree-structured part model has been applied that predicts some landmark points on the input image to detect facial regions. The detected face region was normalized to its fixed size and then down-sampled to its varying sizes such that the advantages, due to the effect of multi-resolution images, can be introduced. Then, some convolutional neural network (CNN) architectures were proposed in the second component to analyze the texture patterns in the facial regions. To enhance the proposed CNN model’s performance, some advanced techniques, such data augmentation, progressive image resizing, transfer-learning, and fine-tuning of the parameters, were employed in the third component to extract more distinctive and discriminant features for the proposed facial expression recognition system. The performance of the proposed system, due to different CNN models, is fused to achieve better performance than the existing state-of-the-art methods and for this reason, extensive experimentation has been carried out using the Karolinska-directed emotional faces (KDEF), GENKI-4k, Cohn-Kanade (CK+), and Static Facial Expressions in the Wild (SFEW) benchmark databases. The performance has been compared with some existing methods concerning these databases, which shows that the proposed facial expression recognition system outperforms other competing methods.

Download Full-text