Extended Spatially Localized Perturbation GAN (eSLP-GAN) for Robust Adversarial Camouflage Patches

Yongsu Kim; Hyoeun Kang; Naufal Suryanto; Harashta Tatimma Tatimma Larasati; Afifatul Mukaroh; Howon Kim

doi:10.3390/s21165323

Extended Spatially Localized Perturbation GAN (eSLP-GAN) for Robust Adversarial Camouflage Patches

Sensors ◽

10.3390/s21165323 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5323

Author(s):

Yongsu Kim ◽

Hyoeun Kang ◽

Naufal Suryanto ◽

Harashta Tatimma Tatimma Larasati ◽

Afifatul Mukaroh ◽

...

Keyword(s):

Object Detection ◽

Real World ◽

Deep Neural Networks ◽

Target Object ◽

Human Vision ◽

Target System ◽

Detection Model ◽

Detection Systems ◽

Attack Scenario ◽

Representative Area

Deep neural networks (DNNs), especially those used in computer vision, are highly vulnerable to adversarial attacks, such as adversarial perturbations and adversarial patches. Adversarial patches, often considered more appropriate for a real-world attack, are attached to the target object or its surroundings to deceive the target system. However, most previous research employed adversarial patches that are conspicuous to human vision, making them easy to identify and counter. Previously, the spatially localized perturbation GAN (SLP-GAN) was proposed, in which the perturbation was only added to the most representative area of the input images, creating a spatially localized adversarial camouflage patch that excels in terms of visual fidelity and is, therefore, difficult to detect by human vision. In this study, the use of the method called eSLP-GAN was extended to deceive classifiers and object detection systems. Specifically, the loss function was modified for greater compatibility with an object-detection model attack and to increase robustness in the real world. Furthermore, the applicability of the proposed method was tested on the CARLA simulator for a more authentic real-world attack scenario.

Download Full-text

Exposing Vulnerabilities of Deepfake Detection Systems with Robust Attacks

Digital Threats: Research and Practice ◽

10.1145/3464307 ◽

2021 ◽

Author(s):

Shehzeen Hussain ◽

Paarth Neekhara ◽

Brian Dolhansky ◽

Joanna Bitton ◽

Cristian Canton Ferrer ◽

...

Keyword(s):

Neural Networks ◽

Video Compression ◽

Real World ◽

Deep Neural Networks ◽

Black Box ◽

Detection Methods ◽

Detection Systems ◽

Attack Scenario ◽

Video Manipulation ◽

Image And Video Compression

Recent advances in video manipulation techniques have made the generation of fake videos more accessible than ever before. Manipulated videos can fuel disinformation and reduce trust in media. Therefore detection of fake videos has garnered immense interest in academia and industry. Recently developed Deepfake detection methods rely on Deep Neural Networks (DNNs) to distinguish AI-generated fake videos from real videos. In this work, we demonstrate that it is possible to bypass such detectors by adversarially modifying fake videos synthesized using existing Deepfake generation methods. We further demonstrate that our adversarial perturbations are robust to image and video compression codecs, making them a real-world threat. We present pipelines in both white-box and black-box attack scenarios that can fool DNN based Deepfake detectors into classifying fake videos as real. Finally, we study the extent to which adversarial perturbations transfer across different Deepfake detectors and create more accessible attacks using universal adversarial perturbations that pose a very feasible attack scenario since they can be easily shared amongst attackers.

Download Full-text

Towards real-time object detection on edge with deep neural networks

10.32469/10355/67715 ◽

2018 ◽

Author(s):

◽

Zhi Zhang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Object Detection ◽

Real Time ◽

Real World ◽

Deep Neural Networks ◽

Learning Algorithm ◽

Cloud Services ◽

Training Data ◽

Detection Model

Despite being a core topic for more than several decades, object detection is still receiving increasing attentions due to its irreplaceable importance in a wide variety of applications. Abundant object detectors based on deep neural networks have shown significantly revamped accuracies in recent years. However, it's still the day one for these models to be effectively deployed to real world. In this dissertation, we focus on object detection models which tackle real world problems that are unavailable few years ago. We also aim at making object detectors on the go, which means detectors are not longer required to be run on workstations and cloud services which is latency unfriendly. To achieve these goals, we addressed the problem in two phases: application and deployment. We have done thoughtful research on both areas. Our contribution involves inter-frame information fusing, model knowledge distillation, advanced model flow control for progressive inference, and hardware oriented model design and optimization. More specifically, we proposed a novel cross-frame verification scheme for spatial temporal fused object detection model for sequential images and videos in a proposal and reject favor. To compress model from a learning basis and resolve domain specific training data shortage, we improved the learning algorithm to handle insufficient labeled data by searching for optimal guidance paths from pre-trained models. To further reduce model inference cost, we designed a progressive neural network which run in flexible cost enabled by RNN style decision controller during runtime. We recognize the awkward model deployment problem, especially for object detection models that require excessive customized layers. In response, we propose to use end-to-end neural network which use pure neural network components to substitute traditional post-processing operations. We also applied operator decomposition and graph level and on-device optimization towards real-time object detection on low power edge devices. All these works have achieved state-of-the-art performances and converted to successful applications.

Download Full-text

Deep Active Learning for Surface Defect Detection

Sensors ◽

10.3390/s20061650 ◽

2020 ◽

Vol 20 (6) ◽

pp. 1650 ◽

Cited By ~ 2

Author(s):

Xiaoming Lv ◽

Fajie Duan ◽

Jia-Jia Jiang ◽

Xiao Fu ◽

Lin Gan

Keyword(s):

Active Learning ◽

Object Detection ◽

Defect Detection ◽

Surface Defect ◽

Candidate List ◽

Detection Model ◽

Detection Systems ◽

Learning Framework ◽

Surface Defect Detection ◽

Sampling Scale

Most of the current object detection approaches deliver competitive results with an assumption that a large number of labeled data are generally available and can be fed into a deep network at once. However, due to expensive labeling efforts, it is difficult to deploy the object detection systems into more complex and challenging real-world environments, especially for defect detection in real industries. In order to reduce the labeling efforts, this study proposes an active learning framework for defect detection. First, an Uncertainty Sampling is proposed to produce the candidate list for annotation. Uncertain images can provide more informative knowledge for the learning process. Then, an Average Margin method is designed to set the sampling scale for each defect category. In addition, an iterative pattern of training and selection is adopted to train an effective detection model. Extensive experiments demonstrate that the proposed method can render the required performance with fewer labeled data.

Download Full-text

Real-Time Object Detection Model

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061267 ◽

2020 ◽

Vol 6 (12) ◽

pp. 360-364

Author(s):

Akash Kumar, Dr. Amita Goel Prof. Vasudha Bahl and Prof. Nidhi Sengar

Keyword(s):

Neural Network ◽

Computer Vision ◽

Object Detection ◽

Real Time ◽

Real World ◽

The Real ◽

Detection Model ◽

Bounding Boxes ◽

The Given ◽

Single Neural Network

Object Detection is a study in the field of computer vision. An object detection model recognizes objects of the real world present either in a captured image or in real-time video where the object can belong to any class of objects namely humans, animals, objects, etc. This project is an implementation of an algorithm based on object detection called You Only Look Once (YOLO v3). The architecture of yolo model is extremely fast compared to all previous methods. Yolov3 model executes a single neural network to the given image and then divides the image into predetermined bounding boxes. These boxes are weighted by the predicted probabilities. After non max-suppression it gives the result of recognized objects together with bounding boxes. Yolo trains and directly executes object detection on full images.

Download Full-text

A Baseline for General Music Object Detection with Deep Learning

Applied Sciences ◽

10.3390/app8091488 ◽

2018 ◽

Vol 8 (9) ◽

pp. 1488 ◽

Cited By ~ 8

Author(s):

Alexander Pacha ◽

Jan Hajič ◽

Jorge Calvo-Zaragoza

Keyword(s):

Deep Learning ◽

Object Detection ◽

Evaluation Criteria ◽

Qualitative Comparison ◽

Evaluation Standard ◽

Detection Model ◽

Detection Systems ◽

Domain Specific ◽

Symbol Detection ◽

First Time

Deep learning is bringing breakthroughs to many computer vision subfields including Optical Music Recognition (OMR), which has seen a series of improvements to musical symbol detection achieved by using generic deep learning models. However, so far, each such proposal has been based on a specific dataset and different evaluation criteria, which made it difficult to quantify the new deep learning-based state-of-the-art and assess the relative merits of these detection models on music scores. In this paper, a baseline for general detection of musical symbols with deep learning is presented. We consider three datasets of heterogeneous typology but with the same annotation format, three neural models of different nature, and establish their performance in terms of a common evaluation standard. The experimental results confirm that the direct music object detection with deep learning is indeed promising, but at the same time illustrates some of the domain-specific shortcomings of the general detectors. A qualitative comparison then suggests avenues for OMR improvement, based both on properties of the detection model and how the datasets are defined. To the best of our knowledge, this is the first time that competing music object detection systems from the machine learning paradigm are directly compared to each other. We hope that this work will serve as a reference to measure the progress of future developments of OMR in music object detection.

Download Full-text

Semi-Automatic Dataset Generation for Object Detection and Recognition and its Evaluation on Domestic Service Robots

Journal of Robotics and Mechatronics ◽

10.20965/jrm.2020.p0245 ◽

2020 ◽

Vol 32 (1) ◽

pp. 245-253 ◽

Cited By ~ 2

Author(s):

Yutaro Ishida ◽

◽

Hakaru Tamukoh

Keyword(s):

Neural Network ◽

Object Detection ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Automatic Generation ◽

Target Object ◽

Domestic Service ◽

Service Robots ◽

Service Robot ◽

Background Image

This paper proposes a method for the semi-automatic generation of a dataset for deep neural networks to perform end-to-end object detection and classification from images, which is expected to be applied to domestic service robots. In the proposed method, the background image of the floor or furniture is first captured. Subsequently, objects are captured from various viewpoints. Then, the background image and the object images are composited by the system (software) to generate images of the virtual scenes expected to be encountered by the robot. At this point, the annotation files, which will be used as teaching signals by the deep neural network, are automatically generated, as the region and category of the object composited with the background image are known. This reduces the human workload for dataset generation. Experiment results showed that the proposed method reduced the time taken to generate a data unit from 167 s, when performed manually, to 0.58 s, i.e., by a factor of approximately 1/287. The dataset generated using the proposed method was used to train a deep neural network, which was then applied to a domestic service robot for evaluation. The robot was entered into the World Robot Challenge, in which, out of ten trials, it succeeded in touching the target object eight times and grasping it four times.

Download Full-text

Domain-Specific On-Device Object Detection Method

Entropy ◽

10.3390/e24010077 ◽

2022 ◽

Vol 24 (1) ◽

pp. 77

Author(s):

Seongju Kang ◽

Jaegi Hwang ◽

Kwangsue Chung

Keyword(s):

Object Detection ◽

High Frequency ◽

Deep Neural Networks ◽

Detection Method ◽

Significant Activity ◽

Detection Model ◽

Domain Specific ◽

Conventional Models ◽

F Measure ◽

Constrained Devices

Object detection is a significant activity in computer vision, and various approaches have been proposed to detect varied objects using deep neural networks (DNNs). However, because DNNs are computation-intensive, it is difficult to apply them to resource-constrained devices. Here, we propose an on-device object detection method using domain-specific models. In the proposed method, we define object of interest (OOI) groups that contain objects with a high frequency of appearance in specific domains. Compared with the existing DNN model, the layers of the domain-specific models are shallower and narrower, reducing the number of trainable parameters; thus, speeding up the object detection. To ensure a lightweight network design, we combine various network structures to obtain the best-performing lightweight detection model. The experimental results reveal that the size of the proposed lightweight model is 21.7 MB, which is 91.35% and 36.98% smaller than those of YOLOv3-SPP and Tiny-YOLO, respectively. The f-measure achieved on the MS COCO 2017 dataset were 18.3%, 11.9% and 20.3% higher than those of YOLOv3-SPP, Tiny-YOLO and YOLO-Nano, respectively. The results demonstrated that the lightweight model achieved higher efficiency and better performance on non-GPU devices, such as mobile devices and embedded boards, than conventional models.

Download Full-text

Underwater Object Detection model based on YOLOv3 architecture using Deep Neural Networks

2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS) ◽

10.1109/icaccs51430.2021.9441905 ◽

2021 ◽

Author(s):

P Athira. ◽

T.P. Mithun Haridas ◽

M.H. Supriya

Keyword(s):

Neural Networks ◽

Object Detection ◽

Deep Neural Networks ◽

Detection Model ◽

Model Based ◽

Underwater Object

Download Full-text

Evaluation of Deep Neural Networks for Advanced Intrusion Detection Systems

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA) ◽

10.1109/iceca49313.2020.9297515 ◽

2020 ◽

Author(s):

Raj Kishore ◽

Anamika Chauhan

Keyword(s):

Neural Networks ◽

Intrusion Detection ◽

Deep Neural Networks ◽

Intrusion Detection Systems ◽

Detection Systems

Download Full-text

Automatic Roadway Features Detection with Oriented Object Detection

Applied Sciences ◽

10.3390/app11083531 ◽

2021 ◽

Vol 11 (8) ◽

pp. 3531

Author(s):

Hesham M. Eraqi ◽

Karim Soliman ◽

Dalia Said ◽

Omar R. Elezaby ◽

Mohamed N. Moustafa ◽

...

Keyword(s):

Object Detection ◽

Safety Evaluation ◽

Autonomous Driving ◽

Detection Accuracy ◽

The Road ◽

Detection Model ◽

Detection Approach ◽

Roadway Safety ◽

Safety Features ◽

Oriented Object

Extensive research efforts have been devoted to identify and improve roadway features that impact safety. Maintaining roadway safety features relies on costly manual operations of regular road surveying and data analysis. This paper introduces an automatic roadway safety features detection approach, which harnesses the potential of artificial intelligence (AI) computer vision to make the process more efficient and less costly. Given a front-facing camera and a global positioning system (GPS) sensor, the proposed system automatically evaluates ten roadway safety features. The system is composed of an oriented (or rotated) object detection model, which solves an orientation encoding discontinuity problem to improve detection accuracy, and a rule-based roadway safety evaluation module. To train and validate the proposed model, a fully-annotated dataset for roadway safety features extraction was collected covering 473 km of roads. The proposed method baseline results are found encouraging when compared to the state-of-the-art models. Different oriented object detection strategies are presented and discussed, and the developed model resulted in improving the mean average precision (mAP) by 16.9% when compared with the literature. The roadway safety feature average prediction accuracy is 84.39% and ranges between 91.11% and 63.12%. The introduced model can pervasively enable/disable autonomous driving (AD) based on safety features of the road; and empower connected vehicles (CV) to send and receive estimated safety features, alerting drivers about black spots or relatively less-safe segments or roads.

Download Full-text