Spotting DeepFakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models

Semih Yavuzkilic; Abdulkadir Sengur; Zahid Akhtar; Kamran Siddique

doi:10.3390/sym13081352

Spotting DeepFakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models

Symmetry ◽

10.3390/sym13081352 ◽

2021 ◽

Vol 13 (8) ◽

pp. 1352

Author(s):

Semih Yavuzkilic ◽

Abdulkadir Sengur ◽

Zahid Akhtar ◽

Kamran Siddique

Keyword(s):

Large Scale ◽

Performance Metrics ◽

Learning Algorithm ◽

Detection System ◽

Classification Problem ◽

Detection Methods ◽

Hair Color ◽

Third Stream ◽

Deep Learning Algorithm ◽

Fully Connected

Deepfake is one of the applications that is deemed harmful. Deepfakes are a sort of image or video manipulation in which a person’s image is changed or swapped with that of another person’s face using artificial neural networks. Deepfake manipulations may be done with a variety of techniques and applications. A quintessential countermeasure against deepfake or face manipulation is deepfake detection method. Most of the existing detection methods perform well under symmetric data distributions, but are still not robust to asymmetric datasets variations and novel deepfake/manipulation types. In this paper, for the identification of fake faces in videos, a new multi-stream deep learning algorithm is developed, where three streams are merged at the feature level using the fusion layer. After the fusion layer, the fully connected, Softmax, and classification layers are used to classify the data. The pre-trained VGG16 model is adopted for transferred CNN1stream. In transfer learning, the weights of the pre-trained CNN model are further used for training the new classification problem. In the second stream (transferred CNN2), the pre-trained VGG19 model is used. Whereas, in the third stream, the pre-trained ResNet18 model is considered. In this paper, a new large-scale dataset (i.e., World Politicians Deepfake Dataset (WPDD)) is introduced to improve deepfake detection systems. The dataset was created by downloading videos of 20 different politicians from YouTube. Over 320,000 frames were retrieved after dividing the downloaded movie into little sections and extracting the frames. Finally, various manipulations were performed to these frames, resulting in seven separate manipulation classes for men and women. In the experiments, three fake face detection scenarios are investigated. First, fake and real face discrimination is studied. Second, seven face manipulations are performed, including age, beard, face swap, glasses, hair color, hairstyle, smiling, and genuine face discrimination. Third, performance of deepfake detection system under novel type of face manipulation is analyzed. The proposed strategy outperforms the prior existing methods. The calculated performance metrics are over 99%.

Download Full-text

Vision-Based Deep Learning Algorithm for Detecting Potholes

10.20944/preprints202102.0569.v1 ◽

2021 ◽

Author(s):

Kanushka Gajjar ◽

Theo van Niekerk ◽

Thomas Wilm ◽

Paolo Mercorelli

Keyword(s):

Deep Learning ◽

Real Time ◽

Autonomous Vehicles ◽

Learning Algorithm ◽

Detection System ◽

Detection Methods ◽

Deep Learning Algorithm ◽

Learning Techniques ◽

Pothole Detection ◽

Adequate Data

Potholes on roads pose a major threat to motorists and autonomous vehicles. Driving over a pothole has the potential to cause serious damage to a vehicle, which in turn may result in fatal accidents. Currently, many pothole detection methods exist. However, these methods do not utilize deep learning techniques to detect a pothole in real-time, determine the location thereof and display its location on a map. The success of determining an effective pothole detection method, which includes the aforementioned deep learning techniques, is dependent on acquiring a large amount of data, including images of potholes. Once adequate data had been gathered, the images were processed and annotated. The next step was to determine which deep learning algorithms could be utilized. Three different models, including Faster R-CNN, SSD and YOLOv3 were trained on the custom dataset containing images of potholes to determine which network produces the best results for real-time detection. It was revealed that YOLOv3 produced the most accurate results and performed the best in real-time, with an average detection time of only 0.836s per image. The final results revealed that a real-time pothole detection system, integrated with a cloud and maps service, can be created to allow drivers to avoid potholes.

Download Full-text

Fully Convolutional Variational Autoencoder For Feature Extraction Of Fire Detection System

Jurnal Ilmu Komputer dan Informasi ◽

10.21609/jiki.v13i1.761 ◽

2020 ◽

Vol 13 (1) ◽

pp. 9

Author(s):

Herminarto Nugroho ◽

Meredita Susanty ◽

Ade Irawan ◽

Muhamad Koyimatu ◽

Ariana Yunita

Keyword(s):

Deep Learning ◽

Large Scale ◽

Learning Algorithm ◽

Detection System ◽

Fire Detection ◽

Features Extraction ◽

Essential Information ◽

Deep Learning Algorithm ◽

Specific Direction ◽

Variational Autoencoder

This paper proposes a fully convolutional variational autoencoder (VAE) for features extraction from a large-scale dataset of fire images. The dataset will be used to train the deep learning algorithm to detect fire and smoke. The features extraction is used to tackle the curse of dimensionality, which is the common issue in training deep learning with huge datasets. Features extraction aims to reduce the dimension of the dataset significantly without losing too much essential information. Variational autoencoders (VAEs) are powerfull generative model, which can be used for dimension reduction. VAEs work better than any other methods available for this purpose because they can explore variations on the data in a specific direction.

Download Full-text

Face Mask Detection System using Deep Learning

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061231 ◽

2020 ◽

Vol 6 (12) ◽

pp. 161-164

Author(s):

Pinki and Prof. Sachin Garg

Keyword(s):

Deep Learning ◽

Large Scale ◽

Learning Algorithm ◽

Detection System ◽

Face Mask ◽

Video Stream ◽

Public Places ◽

Railway Stations ◽

Deep Learning Algorithm ◽

Safety Guidelines

In the present scenario due to Covid-19, there is no efficient face mask detection applications which are now in high demand for transportation means, densely populated areas, residential districts, large-scale manufacturers and other enterprises to ensure safety. This system can therefore be used in real-time applications which require face-mask detection for safety purposes due to the outbreak of Covid-19. This project can be integrated with embedded systems for application in airports, railway stations, offices, schools, and public places to ensure that public safety guidelines are followed. To identify the person on image/video stream wearing face mask or not. If the person doesn’t wear a mask, the notification will be sent to the respected admin with the help of Python and deep learning algorithm by using the Convolutional Neural Network, Keras Framework and OpenCV.

Download Full-text

Hybrid classification model to detect advanced intrusions using data mining techniques

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.4.10031 ◽

2018 ◽

Vol 7 (2.4) ◽

pp. 10

Author(s):

V Mala ◽

K Meena

Keyword(s):

Data Mining ◽

Learning Algorithm ◽

Detection System ◽

Classification Model ◽

Detection Methods ◽

Data Mining Techniques ◽

Detection Systems ◽

Intruder Detection ◽

Hybrid Classification ◽

Using Data

Traditional signature based approach fails in detecting advanced malwares like stuxnet, flame, duqu etc. Signature based comparison and correlation are not up to the mark in detecting such attacks. Hence, there is crucial to detect these kinds of attacks as early as possible. In this research, a novel data mining based approach were applied to detect such attacks. The main innovation lies on Misuse signature detection systems based on supervised learning algorithm. In learning phase, labeled examples of network packets systems calls are (gave) provided, on or after which algorithm can learn about the attack which is fast and reliable to known. In order to detect advanced attacks, unsupervised learning methodologies were employed to detect the presence of zero day/ new attacks. The main objective is to review, different intruder detection methods. To study the role of Data Mining techniques used in intruder detection system. Hybrid –classification model is utilized to detect advanced attacks.

Download Full-text

Research on Efficient Deep Learning Algorithm Based on ShuffleGhost in the Field of Virtual Reality

Wireless Communications and Mobile Computing ◽

10.1155/2021/1382781 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Bangtong Huang ◽

Hongquan Zhang ◽

Zihong Chen ◽

Lingling Li ◽

Lihua Shi

Keyword(s):

Virtual Reality ◽

Deep Learning ◽

Large Scale ◽

Learning Algorithm ◽

Feature Maps ◽

Embedded Devices ◽

Feature Map ◽

Deep Learning Algorithm ◽

Proper Design ◽

The Cost

Deep learning algorithms are facing the limitation in virtual reality application due to the cost of memory, computation, and real-time computation problem. Models with rigorous performance might suffer from enormous parameters and large-scale structure, and it would be hard to replant them onto embedded devices. In this paper, with the inspiration of GhostNet, we proposed an efficient structure ShuffleGhost to make use of the redundancy in feature maps to alleviate the cost of computations, as well as tackling some drawbacks of GhostNet. Since GhostNet suffers from high computation of convolution in Ghost module and shortcut, the restriction of downsampling would make it more difficult to apply Ghost module and Ghost bottleneck to other backbone. This paper proposes three new kinds of ShuffleGhost structure to tackle the drawbacks of GhostNet. The ShuffleGhost module and ShuffleGhost bottlenecks are utilized by the shuffle layer and group convolution from ShuffleNet, and they are designed to redistribute the feature maps concatenated from Ghost Feature Map and Primary Feature Map. Besides, they eliminate the gap of them and extract the features. Then, SENet layer is adopted to reduce the computation cost of group convolution, as well as evaluating the importance of the feature maps which concatenated from Ghost Feature Maps and Primary Feature Maps and giving proper weights for the feature maps. This paper conducted some experiments and proved that the ShuffleGhostV3 has smaller trainable parameters and FLOPs with the ensurance of accuracy. And with proper design, it could be more efficient in both GPU and CPU side.

Download Full-text

Rib fracture detection system based on deep learning

Scientific Reports ◽

10.1038/s41598-021-03002-7 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Liding Yao ◽

Xiaojun Guan ◽

Xiaowei Song ◽

Yanbin Tan ◽

Chun Wang ◽

...

Keyword(s):

Deep Learning ◽

Learning Algorithm ◽

Detection System ◽

Rib Fracture ◽

Diagnostic Efficiency ◽

Fracture Detection ◽

Deep Learning Algorithm ◽

Experienced Radiologist ◽

Comparable Performance ◽

Detection And Diagnosis

AbstractRib fracture detection is time-consuming and demanding work for radiologists. This study aimed to introduce a novel rib fracture detection system based on deep learning which can help radiologists to diagnose rib fractures in chest computer tomography (CT) images conveniently and accurately. A total of 1707 patients were included in this study from a single center. We developed a novel rib fracture detection system on chest CT using a three-step algorithm. According to the examination time, 1507, 100 and 100 patients were allocated to the training set, the validation set and the testing set, respectively. Free Response ROC analysis was performed to evaluate the sensitivity and false positivity of the deep learning algorithm. Precision, recall, F1-score, negative predictive value (NPV) and detection and diagnosis were selected as evaluation metrics to compare the diagnostic efficiency of this system with radiologists. The radiologist-only study was used as a benchmark and the radiologist-model collaboration study was evaluated to assess the model’s clinical applicability. A total of 50,170,399 blocks (fracture blocks, 91,574; normal blocks, 50,078,825) were labelled for training. The F1-score of the Rib Fracture Detection System was 0.890 and the precision, recall and NPV values were 0.869, 0.913 and 0.969, respectively. By interacting with this detection system, the F1-score of the junior and the experienced radiologists had improved from 0.796 to 0.925 and 0.889 to 0.970, respectively; the recall scores had increased from 0.693 to 0.920 and 0.853 to 0.972, respectively. On average, the diagnosis time of radiologist assisted with this detection system was reduced by 65.3 s. The constructed Rib Fracture Detection System has a comparable performance with the experienced radiologist and is readily available to automatically detect rib fracture in the clinical setting with high efficacy, which could reduce diagnosis time and radiologists’ workload in the clinical practice.

Download Full-text

Does Tail Label Help for Large-Scale Multi-Label Learning

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/395 ◽

2018 ◽

Cited By ~ 5

Author(s):

Tong Wei ◽

Yu-Feng Li

Keyword(s):

Large Scale ◽

Performance Metrics ◽

Learning Algorithm ◽

Predictive Performance ◽

Low Complexity ◽

Tail Distribution ◽

Prediction Time ◽

Unseen Data ◽

Model Size ◽

Fast Prediction

Large-scale multi-label learning annotates relevant labels for unseen data from a huge number of candidate labels. It is well known that in large-scale multi-label learning, labels exhibit a long tail distribution in which a significant fraction of labels are tail labels. Nonetheless, how tail labels make impact on the performance metrics in large-scale multi-label learning was not explicitly quantified. In this paper, we disclose that whatever labels are randomly missing or misclassified, tail labels impact much less than common labels in terms of commonly used performance metrics (Top-$k$ precision and nDCG@$k$). With the observation above, we develop a low-complexity large-scale multi-label learning algorithm with the goal of facilitating fast prediction and compact models by trimming tail labels adaptively. Experiments clearly verify that both the prediction time and the model size are significantly reduced without sacrificing much predictive performance for state-of-the-art approaches.

Download Full-text

Deep Learning Method on Target Echo Signal Recognition for Obscurant Penetrating Lidar Detection in Degraded Visual Environments

Sensors ◽

10.3390/s20123424 ◽

2020 ◽

Vol 20 (12) ◽

pp. 3424

Author(s):

Xujia Liang ◽

Zhonghua Huang ◽

Liping Lu ◽

Zhigang Tao ◽

Bing Yang ◽

...

Keyword(s):

Deep Learning ◽

Autonomous Vehicles ◽

Mobile Robotics ◽

Learning Algorithm ◽

Rapid Development ◽

Detection Methods ◽

Signal Recognition ◽

Echo Signal ◽

Deep Learning Algorithm ◽

Recognition Criterion

With the rapid development of autonomous vehicles and mobile robotics, the desire to advance robust light detection and ranging (Lidar) detection methods for real world applications is increasing. However, this task still suffers in degraded visual environments (DVE), including smoke, dust, fog, and rain, as the aerosols lead to false alarm and dysfunction. Therefore, a novel Lidar target echo signal recognition method, based on a multi-distance measurement and deep learning algorithm is presented in this paper; neither the backscatter suppression nor the denoise functions are required. The 2-D spectrogram images are constructed by using the frequency-distance relation derived from the 1-D echo signals of the Lidar sensor individual cell in the course of approaching target. The characteristics of the target echo signal and noise in the spectrogram images are analyzed and determined; thus, the target recognition criterion is established accordingly. A customized deep learning algorithm is subsequently developed to perform the recognition. The simulation and experimental results demonstrate that the proposed method can significantly improve the Lidar detection performance in DVE.

Download Full-text

A deep learning view of the census of galaxy clusters in IllustrisTNG

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/staa2690 ◽

2020 ◽

Vol 498 (4) ◽

pp. 5620-5628

Author(s):

Y Su ◽

Y Zhang ◽

G Liang ◽

J A ZuHone ◽

D J Barnes ◽

...

Keyword(s):

Deep Learning ◽

Galaxy Clusters ◽

Large Scale ◽

Learning Algorithm ◽

X Ray ◽

Deep Learning Algorithm ◽

Cluster Evolution ◽

Novel Method ◽

Massive Clusters ◽

Activation Mapping

ABSTRACT The origin of the diverse population of galaxy clusters remains an unexplained aspect of large-scale structure formation and cluster evolution. We present a novel method of using X-ray images to identify cool core (CC), weak cool core (WCC), and non-cool core (NCC) clusters of galaxies that are defined by their central cooling times. We employ a convolutional neural network, ResNet-18, which is commonly used for image analysis, to classify clusters. We produce mock Chandra X-ray observations for a sample of 318 massive clusters drawn from the IllustrisTNG simulations. The network is trained and tested with low-resolution mock Chandra images covering a central 1 Mpc square for the clusters in our sample. Without any spectral information, the deep learning algorithm is able to identify CC, WCC, and NCC clusters, achieving balanced accuracies (BAcc) of 92 per cent, 81 per cent, and 83 per cent, respectively. The performance is superior to classification by conventional methods using central gas densities, with an average ${\rm BAcc}=81{{\ \rm per\ cent}}$, or surface brightness concentrations, giving ${\rm BAcc}=73{{\ \rm per\ cent}}$. We use class activation mapping to localize discriminative regions for the classification decision. From this analysis, we observe that the network has utilized regions from cluster centres out to r ≈ 300 kpc and r ≈ 500 kpc to identify CC and NCC clusters, respectively. It may have recognized features in the intracluster medium that are associated with AGN feedback and disruptive major mergers.

Download Full-text

Research on improved wavelet convolutional wavelet neural networks

Applied Intelligence ◽

10.1007/s10489-020-02015-5 ◽

2020 ◽

Author(s):

Jing-Wei Liu ◽

Fang-Ling Zuo ◽

Ying-Xiao Guo ◽

Tian-Yue Li ◽

Jia-Ming Chen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

High Performance ◽

Learning Algorithm ◽

Wavelet Neural Network ◽

Deep Learning Algorithm ◽

Main Work ◽

Good Ability ◽

Fully Connected

AbstractConvolutional neural network (CNN) is recognized as state of the art of deep learning algorithm, which has a good ability on the image classification and recognition. The problems of CNN are as follows: the precision, accuracy and efficiency of CNN are expected to be improved to satisfy the requirements of high performance. The main work is as follows: Firstly, wavelet convolutional neural network (wCNN) is proposed, where wavelet transform function is added to the convolutional layers of CNN. Secondly, wavelet convolutional wavelet neural network (wCwNN) is proposed, where fully connected neural network (FCNN) of wCNN and CNN are replaced by wavelet neural network (wNN). Thirdly, image classification experiments using CNN, wCNN and wCwNN algorithms, and comparison analysis are implemented with MNIST dataset. The effect of the improved methods are as follows: (1) Both precision and accuracy are improved. (2) The mean square error and the rate of error are reduced. (3) The complexitie of the improved algorithms is increased.

Download Full-text