Random Untargeted Adversarial Example on Deep Neural Network

Hyun Kwon; Yongchul Kim; Hyunsoo Yoon; Daeseon Choi

doi:10.3390/sym10120738

GAN-based classifier protection against adversarial attacks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200280 ◽

2020 ◽

Vol 39 (5) ◽

pp. 7085-7095

Author(s):

Shuqi Liu ◽

Mingwen Shao ◽

Xinping Liu

Keyword(s):

Neural Network ◽

Deep Neural Networks ◽

Significant Progress ◽

Security Issue ◽

Generative Adversarial Network ◽

Adversarial Network ◽

The Neural Network ◽

Benchmark Datasets ◽

Adversarial Examples ◽

Adversarial Example

In recent years, deep neural networks have made significant progress in image classification, object detection and face recognition. However, they still have the problem of misclassification when facing adversarial examples. In order to address security issue and improve the robustness of the neural network, we propose a novel defense network based on generative adversarial network (GAN). The distribution of clean - and adversarial examples are matched to solve the mentioned problem. This guides the network to remove invisible noise accurately, and restore the adversarial example to a clean example to achieve the effect of defense. In addition, in order to maintain the classification accuracy of clean examples and improve the fidelity of neural network, we input clean examples into proposed network for denoising. Our method can effectively remove the noise of the adversarial examples, so that the denoised adversarial examples can be correctly classified. In this paper, extensive experiments are conducted on five benchmark datasets, namely MNIST, Fashion-MNIST, CIFAR10, CIFAR100 and ImageNet. Moreover, six mainstream attack methods are adopted to test the robustness of our defense method including FGSM, PGD, MIM, JSMA, CW and Deep-Fool. Results show that our method has strong defensive capabilities against the tested attack methods, which confirms the effectiveness of the proposed method.

Download Full-text

Classification score approach for detecting adversarial example in deep neural network

Multimedia Tools and Applications ◽

10.1007/s11042-020-09167-z ◽

2020 ◽

Author(s):

Hyun Kwon ◽

Yongchul Kim ◽

Hyunsoo Yoon ◽

Daeseon Choi

Keyword(s):

Machine Learning ◽

Autonomous Vehicles ◽

Superior Performance ◽

Success Rates ◽

Learning Tasks ◽

Classification Score ◽

Adversarial Examples ◽

Additional Process ◽

Adversarial Example ◽

The Right

AbstractDeep neural networks (DNNs) provide superior performance on machine learning tasks such as image recognition, speech recognition, pattern analysis, and intrusion detection. However, an adversarial example, created by adding a little noise to an original sample, can cause misclassification by a DNN. This is a serious threat to the DNN because the added noise is not detected by the human eye. For example, if an attacker modifies a right-turn sign so that it misleads to the left, autonomous vehicles with the DNN will incorrectly classify the modified sign as pointing to the left, but a person will correctly classify the modified sign as pointing to the right. Studies are under way to defend against such adversarial examples. The existing method of defense against adversarial examples requires an additional process such as changing the classifier or modifying input data. In this paper, we propose a new method for detecting adversarial examples that does not invoke any additional process. The proposed scheme can detect adversarial examples by using a pattern feature of the classification scores of adversarial examples. We used MNIST and CIFAR10 as experimental datasets and Tensorflow as a machine learning library. The experimental results show that the proposed method can detect adversarial examples with success rates: 99.05% and 99.9% for the untargeted and targeted cases in MNIST, respectively, and 94.7% and 95.8% for the untargeted and targeted cases in CIFAR10, respectively.

Download Full-text

A Non-Global Disturbance Targeted Adversarial Example Algorithm Combined with C&W and Grad-Cam

10.21203/rs.3.rs-865960/v1 ◽

2021 ◽

Author(s):

Yinghui Zhu ◽

Yuzhen Jiang

Keyword(s):

Learning Systems ◽

Fine Tuning ◽

Generation Process ◽

Original Image ◽

Signal Features ◽

Adversarial Examples ◽

Salient Regions ◽

Adversarial Attack ◽

Adversarial Example ◽

Generation Control

Abstract Adversarial examples are artificially crafted to mislead deep learning systems into making wrong decisions. In the research of attack algorithms against multi-class image classifiers, an improved strategy of applying category explanation to the generation control of targeted adversarial example is proposed to reduce the perturbation noise and improve the adversarial robustness. On the basis of C&W adversarial attack algorithm, the method uses Grad-Cam, a category visualization explanation algorithm of CNN, to dynamically obtain the salient regions according to the signal features of source and target categories during the iterative generation process. The adversarial example of non-global perturbation is finally achieved by gradually shielding the non salient regions and fine-tuning the perturbation signals. Compared with other similar algorithms under the same conditions, the method enhances the effects of the original image category signal on the perturbation position. Experimental results show that, the improved adversarial examples have higher PSNR. In addition, in a variety of different defense processing tests, the examples can keep high adversarial performance and show strong attacking robustness.

Download Full-text

Robust Audio Adversarial Example for a Physical Attack

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/741 ◽

2019 ◽

Cited By ~ 7

Author(s):

Hiromu Yakura ◽

Jun Sakuma

Keyword(s):

Speech Recognition ◽

Process Evaluation ◽

State Of The Art ◽

Physical World ◽

Generation Process ◽

Recognition Model ◽

Physical Attack ◽

Adversarial Examples ◽

Adversarial Example ◽

Listening Experiment

We propose a method to generate audio adversarial examples that can attack a state-of-the-art speech recognition model in the physical world. Previous work assumes that generated adversarial examples are directly fed to the recognition model, and is not able to perform such a physical attack because of reverberation and noise from playback environments. In contrast, our method obtains robust adversarial examples by simulating transformations caused by playback or recording in the physical world and incorporating the transformations into the generation process. Evaluation and a listening experiment demonstrated that our adversarial examples are able to attack without being noticed by humans. This result suggests that audio adversarial examples generated by the proposed method may become a real threat.

Download Full-text

Universal Adversarial Attack via Conditional Sampling for Text Classification

Applied Sciences ◽

10.3390/app11209539 ◽

2021 ◽

Vol 11 (20) ◽

pp. 9539

Author(s):

Yu Zhang ◽

Kun Shao ◽

Junan Yang ◽

Hui Liu

Keyword(s):

Language Processing ◽

Text Classification ◽

Deep Neural Networks ◽

High Quality ◽

The Face ◽

Adversarial Examples ◽

Specific Prediction ◽

Novel Method ◽

Adversarial Attack ◽

Classification Tasks

Despite deep neural networks (DNNs) having achieved impressive performance in various domains, it has been revealed that DNNs are vulnerable in the face of adversarial examples, which are maliciously crafted by adding human-imperceptible perturbations to an original sample to cause the wrong output by the DNNs. Encouraged by numerous researches on adversarial examples for computer vision, there has been growing interest in designing adversarial attacks for Natural Language Processing (NLP) tasks. However, the adversarial attacking for NLP is challenging because text is discrete data and a small perturbation can bring a notable shift to the original input. In this paper, we propose a novel method, based on conditional BERT sampling with multiple standards, for generating universal adversarial perturbations: input-agnostic of words that can be concatenated to any input in order to produce a specific prediction. Our universal adversarial attack can create an appearance closer to natural phrases and yet fool sentiment classifiers when added to benign inputs. Based on automatic detection metrics and human evaluations, the adversarial attack we developed dramatically reduces the accuracy of the model on classification tasks, and the trigger is less easily distinguished from natural text. Experimental results demonstrate that our method crafts more high-quality adversarial examples as compared to baseline methods. Further experiments show that our method has high transferability. Our goal is to prove that adversarial attacks are more difficult to detect than previously thought and enable appropriate defenses.

Download Full-text

Opportunistic Use of Crowdsourced Workers for Online Relabeling of Potential Adversarial Examples

10.36227/techrxiv.17088941.v1 ◽

2021 ◽

Author(s):

Shawqi Al-Maliki ◽

Faissal El Bouanani ◽

Kashif Ahmad ◽

Mohamed Abdallah ◽

Dinh Hoang ◽

...

Keyword(s):

Language Processing ◽

Deep Neural Networks ◽

Black Box ◽

Threshold Selection ◽

Selection Algorithm ◽

Comparable Performance ◽

Adversarial Examples ◽

System Robustness ◽

Adversarial Example ◽

The Impact

<div>Deep Neural Networks (DDNs) have achieved tremendous success in handling various Machine Learning (ML) tasks, such as speech recognition, Natural Language Processing, and image classification. However, they have shown vulnerability to well-designed inputs called adversarial examples. Researchers in industry and academia have proposed many adversarial example defense techniques. However, none can provide complete robustness. The cutting-edge defense techniques offer partial reliability. Thus, complementing them with another layer of protection is a must, especially for mission-critical applications. This paper proposes a novel Online Selection and Relabeling Algorithm (OSRA) that opportunistically utilizes a limited number of crowdsourced workers (budget-constraint crowdsourcing) to maximize the ML system’s robustness. OSRA strives to use crowdsourced workers effectively by selecting the most suspicious inputs (the potential adversarial examples) and moving them to the crowdsourced workers to be validated and corrected (relabeled). As a result, the impact of adversarial examples gets reduced, and accordingly, the ML system becomes more robust. We also proposed a heuristic threshold selection method that contributes to enhancing the prediction system’s reliability. We empirically validated our proposed algorithm and found that it can efficiently and optimally utilize the allocated budget for crowdsourcing. It is also effectively integrated with a state-ofthe- art black-box (transfer-based) defense technique, resulting in a more robust system. Simulation results show that OSRA can outperform a random selection algorithm by 60% and achieve comparable performance to an optimal offline selection benchmark. They also show that OSRA’s performance has a positive correlation with system robustness.<br></div>

Download Full-text

Hardening Deep Neural Networks in Condition Monitoring Systems against Adversarial Example Attacks

Machine Learning for Cyber Physical Systems - Technologien für die intelligente Automation ◽

10.1007/978-3-662-62746-4_11 ◽

2020 ◽

pp. 103-111

Author(s):

Felix Specht ◽

Jens Otto

Keyword(s):

Neural Network ◽

Neural Networks ◽

Condition Monitoring ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Production Systems ◽

Monitoring Systems ◽

Condition Monitoring Systems ◽

Adversarial Examples ◽

Adversarial Example

AbstractCondition monitoring systems based on deep neural networks are used for system failure detection in cyber-physical production systems. However, deep neural networks are vulnerable to attacks with adversarial examples. Adversarial examples are manipulated inputs, e.g. sensor signals, are able to mislead a deep neural network into misclassification. A consequence of such an attack may be the manipulation of the physical production process of a cyber-physical production system without being recognized by the condition monitoring system. This can result in a serious threat for production systems and employees. This work introduces an approach named CyberProtect to prevent misclassification caused by adversarial example attacks. The approach generates adversarial examples for retraining a deep neural network which results in a hardened variant of the deep neural network. The hardened deep neural network sustains a significant better classification rate (82% compared to 20%) while under attack with adversarial examples, as shown by empirical results.

Download Full-text

Opportunistic Use of Crowdsourced Workers for Online Relabeling of Potential Adversarial Examples

10.36227/techrxiv.17088941 ◽

2021 ◽

Author(s):

Shawqi Al-Maliki ◽

Faissal El Bouanani ◽

Kashif Ahmad ◽

Mohamed Abdallah ◽

Dinh Hoang ◽

...

Keyword(s):

Language Processing ◽

Deep Neural Networks ◽

Black Box ◽

Threshold Selection ◽

Selection Algorithm ◽

Comparable Performance ◽

Adversarial Examples ◽

System Robustness ◽

Adversarial Example ◽

The Impact

<div>Deep Neural Networks (DDNs) have achieved tremendous success in handling various Machine Learning (ML) tasks, such as speech recognition, Natural Language Processing, and image classification. However, they have shown vulnerability to well-designed inputs called adversarial examples. Researchers in industry and academia have proposed many adversarial example defense techniques. However, none can provide complete robustness. The cutting-edge defense techniques offer partial reliability. Thus, complementing them with another layer of protection is a must, especially for mission-critical applications. This paper proposes a novel Online Selection and Relabeling Algorithm (OSRA) that opportunistically utilizes a limited number of crowdsourced workers (budget-constraint crowdsourcing) to maximize the ML system’s robustness. OSRA strives to use crowdsourced workers effectively by selecting the most suspicious inputs (the potential adversarial examples) and moving them to the crowdsourced workers to be validated and corrected (relabeled). As a result, the impact of adversarial examples gets reduced, and accordingly, the ML system becomes more robust. We also proposed a heuristic threshold selection method that contributes to enhancing the prediction system’s reliability. We empirically validated our proposed algorithm and found that it can efficiently and optimally utilize the allocated budget for crowdsourcing. It is also effectively integrated with a state-ofthe- art black-box (transfer-based) defense technique, resulting in a more robust system. Simulation results show that OSRA can outperform a random selection algorithm by 60% and achieve comparable performance to an optimal offline selection benchmark. They also show that OSRA’s performance has a positive correlation with system robustness.<br></div>

Download Full-text

Adv-Plate Attack: Adversarially Perturbed Plate for License Plate Recognition System

Journal of Sensors ◽

10.1155/2021/6473833 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Hyun Kwon ◽

Jang-Woon Baek

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Deep Learning ◽

Deep Neural Networks ◽

Recognition System ◽

License Plate ◽

Learning Technology ◽

License Plate Recognition ◽

Adversarial Examples ◽

Adversarial Example

Deep learning technology has been used to develop improved license plate recognition (LPR) systems. In particular, deep neural networks have brought significant improvements in the LPR system. However, deep neural networks are vulnerable to adversarial examples. In the existing LPR system, adversarial examples study specific spots that are easily identifiable by humans or require human feedback. In this paper, we propose a method of generating adversarial examples in the license plate, which has no human feedback and is difficult to identify by humans. In the proposed method, adversarial noise is added only to the license plate among the entire image to create an adversarial example that is erroneously recognized by the LPR system without being identified by humans. Experiments were performed using the baza silka dataset, and TensorFlow was used as the machine learning library. When epsilon is 0.6 for the first type, and alpha and the iteration of the second type are 0.4 and 1000, respectively, the adversarial examples generated by the first and second type generation methods are reduced to 20% and 15% accuracy in the LPR system.

Download Full-text

Natural Scene Statistics for Detecting Adversarial Examples in Deep Neural Networks

2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP) ◽

10.1109/mmsp48831.2020.9287056 ◽

2020 ◽

Author(s):

Anouar Kherchouche ◽

Sid Ahmed Fezza ◽

Wassim Hamidouche ◽

Olivier Deforges

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Natural Scene ◽

Natural Scene Statistics ◽

Adversarial Examples

Download Full-text