scholarly journals Robust face recognition based on multi-task convolutional neural network

2021 ◽  
Vol 18 (5) ◽  
pp. 6638-6651
Author(s):  
Huilin Ge ◽  
◽  
Yuewei Dai ◽  
Zhiyu Zhu ◽  
Biao Wang

<abstract> <sec><title>Purpose</title><p>Due to the lack of prior knowledge of face images, large illumination changes, and complex backgrounds, the accuracy of face recognition is low. To address this issue, we propose a face detection and recognition algorithm based on multi-task convolutional neural network (MTCNN).</p> </sec> <sec><title>Methods</title><p>In our paper, MTCNN mainly uses three cascaded networks, and adopts the idea of candidate box plus classifier to perform fast and efficient face recognition. The model is trained on a database of 50 faces we have collected, and Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index Measurement (SSIM), and receiver operating characteristic (ROC) curve are used to analyse MTCNN, Region-CNN (R-CNN) and Faster R-CNN.</p> </sec> <sec><title>Results</title><p>The average PSNR of this technique is 1.24 dB higher than that of R-CNN and 0.94 dB higher than that of Faster R-CNN. The average SSIM value of MTCNN is 10.3% higher than R-CNN and 8.7% higher than Faster R-CNN. The Area Under Curve (AUC) of MTCNN is 97.56%, the AUC of R-CNN is 91.24%, and the AUC of Faster R-CNN is 92.01%. MTCNN has the best comprehensive performance in face recognition. For the face images with defective features, MTCNN still has the best effect.</p> </sec> <sec><title>Conclusions</title><p>This algorithm can effectively improve face recognition to a certain extent. The accuracy rate and the reduction of the false detection rate of face detection can not only be better used in key places, ensure the safety of property and security of the people, improve safety, but also better reduce the waste of human resources and improve efficiency.</p> </sec> </abstract>

2020 ◽  
Vol 21 (2) ◽  
pp. 217-232
Author(s):  
Reshma V K ◽  
Vinod Kumar R S ◽  
Shahi D ◽  
Shyjith M B

Image steganography is considered as one of the promising and popular techniques utilized to maintain the confidentiality of the secret message that is embedded in an image. Even though there are various techniques available in the previous works, an approach providing better results is still the challenge. Therefore, an effective pixel prediction based on image stegonography is developed, which employs error dependent Deep Convolutional Neural Network (DCNN) classifier for pixel identification. Here, the best pixels are identified from the medical image based on DCNN classifier using pixel features, like texture, wavelet energy, Gabor, scattering features, and so on. The DCNN is optimally trained using Chicken-Moth search optimization (CMSO). The CMSO is designed by integrating Chicken Swarm Optimization (CSO) and Moth Search Optimization (MSO) algorithm based on limited error. Subsequently, the Tetrolet transform is fed to the predicted pixel for the embedding process. At last, the inverse tetrolet transform is used for extracting the secret message from an embedded image. The experimentation is carried out using BRATS dataset, and the performance of image stegonography based on CMSO-DCNN+tetrolet is evaluated based on correlation coefficient, Structural Similarity Index, and Peak Signal to Noise Ratio, which attained 0.85, 46.981dB, and 0.6388, for the image with noise.  


Author(s):  
LIANG-HUA CHEN ◽  
SHAO-HUA DENG ◽  
HONG-YUAN LIAO

This paper proposes a complete procedure for the extraction and recognition of human faces in complex scenes. The morphology-based face detection algorithm can locate multiple faces oriented in any direction. The recognition algorithm is based on the minimum classification error (MCE) criterion. In our work, the minimum classification error formulation is incorporated into a multilayer perceptron neural network. Experimental results show that our system is robust to noisy images and complex background.


2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Shahab U. Ansari ◽  
Kamran Javed ◽  
Saeed Mian Qaisar ◽  
Rashad Jillani ◽  
Usman Haider

Multiple sclerosis (MS) is a chronic and autoimmune disease that forms lesions in the central nervous system. Quantitative analysis of these lesions has proved to be very useful in clinical trials for therapies and assessing disease prognosis. However, the efficacy of these quantitative analyses greatly depends on how accurately the MS lesions have been identified and segmented in brain MRI. This is usually carried out by radiologists who label 3D MR images slice by slice using commonly available segmentation tools. However, such manual practices are time consuming and error prone. To circumvent this problem, several automatic segmentation techniques have been investigated in recent years. In this paper, we propose a new framework for automatic brain lesion segmentation that employs a novel convolutional neural network (CNN) architecture. In order to segment lesions of different sizes, we have to pick a specific filter or size 3 × 3 or 5 × 5. Sometimes, it is hard to decide which filter will work better to get the best results. Google Net has solved this problem by introducing an inception module. An inception module uses 3 × 3 , 5 × 5 , 1 × 1 and max pooling filters in parallel fashion. Results show that incorporating inception modules in a CNN has improved the performance of the network in the segmentation of MS lesions. We compared the results of the proposed CNN architecture for two loss functions: binary cross entropy (BCE) and structural similarity index measure (SSIM) using the publicly available ISBI-2015 challenge dataset. A score of 93.81 which is higher than the human rater with BCE loss function is achieved.


Author(s):  
Asma Abdulelah Abdulrahman ◽  
Fouad Shaker Tahir

<p>In this work, it was proposed to compress the color image after de-noise by proposing a coding for the discrete transport of new wavelets called discrete chebysheve wavelet transduction (DCHWT) and linking it to a neural network that relies on the convolutional neural network to compress the color image. The aim of this work is to find an effective method for face recognition, which is to raise the noise and compress the image in convolutional neural networks to remove the noise that caused the image while it was being transmitted in the communication network. The work results of the algorithm were calculated by calculating the peak signal to noise ratio (PSNR), mean square error (MSE), compression ratio (CR) and bit-per-pixel (BPP) of the compressed image after a color image (256×256) was entered to demonstrate the quality and efficiency of the proposed algorithm in this work. The result obtained by using a convolutional neural network with new wavelets is to provide a better CR with the ratio of PSNR to be a high value that increases the high-quality ratio of the compressed image to be ready for face recognition.</p>


Author(s):  
Zhengqiu Lu ◽  
Chunliang Zhou ◽  
Xuyang Xuyang ◽  
Weipeng Zhang

with rapid development of deep learning technology, face recognition based on deep convolutional neural network becomes one of the main research methods. In order to solve the problems of information loss and equal treatment of each element in the input feature graph in the traditional pooling method of convolutional neural network, a face recognition algorithm based on convolutional neural network is proposed in this paper. First, MTCNN algorithm is used to detect the faces and do gray processing, and then a local weighted average pooling method based on local concern strategy is designed and a convolutional neural network based on VGG16 to recognize faces is constructed which is finally compared with common convolutional neural network. The experimental results show that this method has good face recognition accuracy in common face databases.


Sign in / Sign up

Export Citation Format

Share Document