Designing Trojan Detectors in Neural Networks Using Interactive Simulations

Peter Bajcsy; Nicholas J. Schaub; Michael Majurski

doi:10.3390/app11041865

Designing Trojan Detectors in Neural Networks Using Interactive Simulations

Applied Sciences ◽

10.3390/app11041865 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1865

Author(s):

Peter Bajcsy ◽

Nicholas J. Schaub ◽

Michael Majurski

Keyword(s):

Neural Networks ◽

Memory Storage ◽

Classification Models ◽

Class Label ◽

Model Sensitivity ◽

Kl Divergence ◽

Fully Connected ◽

Interactive Simulations

This paper addresses the problem of designing trojan detectors in neural networks (NNs) using interactive simulations. Trojans in NNs are defined as triggers in inputs that cause misclassification of such inputs into a class (or classes) unintended by the design of a NN-based model. The goal of our work is to understand encodings of a variety of trojan types in fully connected layers of neural networks. Our approach is: (1) to simulate nine types of trojan embeddings into dot patterns; (2) to devise measurements of NN states; and (3) to design trojan detectors in NN-based classification models. The interactive simulations are built on top of TensorFlow Playground with in-memory storage of data and NN coefficients. The simulations provide analytical, visualization, and output operations performed on training datasets and NN architectures. The measurements of a NN include: (a) model inefficiency using modified Kullback–Liebler (KL) divergence from uniformly distributed states; and (b) model sensitivity to variables related to data and NNs. Using the KL divergence measurements at each NN layer and per each predicted class label, a trojan detector is devised to discriminate NN models with or without trojans. To document robustness of such a trojan detector with respect to NN architectures, dataset perturbations, and trojan types, several properties of the KL divergence measurement are presented.

Download Full-text

Neural Layer Bypassing Network

10.36227/techrxiv.16806928.v2 ◽

2022 ◽

Author(s):

Amogh Palasamudram

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Classification Models ◽

Neural Network Architecture ◽

Network Layers ◽

Time Period ◽

Computationally Intensive ◽

Fully Connected ◽

The Given

<p>This research introduces and evaluates the Neural Layer Bypassing Network (NLBN), a new neural network architecture to improve the speed and effectiveness of forward propagation in deep learning. This architecture utilizes 1 additional (fully connected) neural network layer after every layer in the main network. This new layer determines whether finishing the rest of the forward propagation is required to predict the output of the given input. To test the effectiveness of the NLBN, I programmed coding examples for this architecture with 3 different image classification models trained on 3 different datasets: MNIST Handwritten Digits Dataset, Horses or Humans Dataset, and Colorectal Histology Dataset. After training 1 standard convolutional neural network (CNN) and 1 NLBN per dataset (both of equivalent architectures), I performed 5 trials per dataset to analyze the performance of these two architectures. For the NLBN, I also collected data regarding the accuracy, time period, and speed of the network with respect to the percentage of the model the inputs are passed through. It was found that this architecture increases the speed of forward propagation by 6% - 25% while the accuracy tended to decrease by 0% - 4%; the results vary based on the dataset and structure of the model, but the increase in speed was normally at least twice the decrease in accuracy. In addition to the NLBN’s performance during predictions, it takes roughly 40% longer to train and requires more memory due to its complexity. However, the architecture can be made more efficient if integrated into TensorFlow libraries. Overall, by being able to autonomously skip neural network layers, this architecture can potentially be a foundation for neural networks to teach themselves to become more efficient for applications that require fast, accurate, and less computationally intensive predictions.<br></p>

Download Full-text

gbt-HIPS: Explaining the Classifications of Gradient Boosted Tree Ensembles

Applied Sciences ◽

10.3390/app11062511 ◽

2021 ◽

Vol 11 (6) ◽

pp. 2511

Author(s):

Julian Hatwell ◽

Mohamed Medhat Gaber ◽

R. Muhammad Atif Azad

Keyword(s):

State Of The Art ◽

Heuristic Method ◽

Good Explanation ◽

Classification Rule ◽

Data Sets ◽

Classification Models ◽

Boundary Values ◽

Class Label ◽

Input Space ◽

Boosted Tree

This research presents Gradient Boosted Tree High Importance Path Snippets (gbt-HIPS), a novel, heuristic method for explaining gradient boosted tree (GBT) classification models by extracting a single classification rule (CR) from the ensemble of decision trees that make up the GBT model. This CR contains the most statistically important boundary values of the input space as antecedent terms. The CR represents a hyper-rectangle of the input space inside which the GBT model is, very reliably, classifying all instances with the same class label as the explanandum instance. In a benchmark test using nine data sets and five competing state-of-the-art methods, gbt-HIPS offered the best trade-off between coverage (0.16–0.75) and precision (0.85–0.98). Unlike competing methods, gbt-HIPS is also demonstrably guarded against under- and over-fitting. A further distinguishing feature of our method is that, unlike much prior work, our explanations also provide counterfactual detail in accordance with widely accepted recommendations for what makes a good explanation.

Download Full-text

An Adversarial Generative Network for Crop Classification from Remote Sensing Timeseries Images

Remote Sensing ◽

10.3390/rs13010065 ◽

2020 ◽

Vol 13 (1) ◽

pp. 65

Author(s):

Jingtao Li ◽

Yonglin Shen ◽

Chao Yang

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Short Term Memory ◽

Complete Classification ◽

Support Vector ◽

Classification Models ◽

Training Samples ◽

Agricultural Applications ◽

Crop Classification ◽

Increasing Demand

Due to the increasing demand for the monitoring of crop conditions and food production, it is a challenging and meaningful task to identify crops from remote sensing images. The state-of the-art crop classification models are mostly built on supervised classification models such as support vector machines (SVM), convolutional neural networks (CNN), and long- and short-term memory neural networks (LSTM). Meanwhile, as an unsupervised generative model, the adversarial generative network (GAN) is rarely used to complete classification tasks for agricultural applications. In this work, we propose a new method that combines GAN, CNN, and LSTM models to classify crops of corn and soybeans from remote sensing time-series images, in which GAN’s discriminator was used as the final classifier. The method is feasible on the condition that the training samples are small, and it fully takes advantage of spectral, spatial, and phenology features of crops from satellite data. The classification experiments were conducted on crops of corn, soybeans, and others. To verify the effectiveness of the proposed method, comparisons with models of SVM, SegNet, CNN, LSTM, and different combinations were also conducted. The results show that our method achieved the best classification results, with the Kappa coefficient of 0.7933 and overall accuracy of 0.86. Experiments in other study areas also demonstrate the extensibility of the proposed method.

Download Full-text

A novel structured sparse fully connected layer in convolutional neural networks

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6213 ◽

2021 ◽

Author(s):

Naoki Matsumura ◽

Yasuaki Ito ◽

Koji Nakano ◽

Akihiko Kasagi ◽

Tsuguchika Tabaru

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Fully Connected

Download Full-text

Structural Damage Identification of Composite Rotors Based on Fully Connected Neural Networks and Convolutional Neural Networks

Sensors ◽

10.3390/s21062005 ◽

2021 ◽

Vol 21 (6) ◽

pp. 2005

Author(s):

Veronika Scholz ◽

Peter Winkler ◽

Andreas Hornig ◽

Maik Gude ◽

Angelos Filippatos

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Composite Structures ◽

Damage Identification ◽

Vibration Response ◽

Damage Initiation ◽

Matrix Cracks ◽

Operational Life ◽

Multiple Data Sets ◽

Fully Connected

Damage identification of composite structures is a major ongoing challenge for a secure operational life-cycle due to the complex, gradual damage behaviour of composite materials. Especially for composite rotors in aero-engines and wind-turbines, a cost-intensive maintenance service has to be performed in order to avoid critical failure. A major advantage of composite structures is that they are able to safely operate after damage initiation and under ongoing damage propagation. Therefore, a robust, efficient diagnostic damage identification method would allow monitoring the damage process with intervention occurring only when necessary. This study investigates the structural vibration response of composite rotors by applying machine learning methods and the ability to identify, localise and quantify the present damage. To this end, multiple fully connected neural networks and convolutional neural networks were trained on vibration response spectra from damaged composite rotors with barely visible damage, mostly matrix cracks and local delaminations using dimensionality reduction and data augmentation. A databank containing 720 simulated test cases with different damage states is used as a basis for the generation of multiple data sets. The trained models are tested using k-fold cross validation and they are evaluated based on the sensitivity, specificity and accuracy. Convolutional neural networks perform slightly better providing a performance accuracy of up to 99.3% for the damage localisation and quantification.

Download Full-text

Classification of Approximal Caries in Bitewing Radiographs Using Convolutional Neural Networks

Sensors ◽

10.3390/s21155192 ◽

2021 ◽

Vol 21 (15) ◽

pp. 5192

Author(s):

Maira Moran ◽

Marcelo Faria ◽

Gilson Giraldi ◽

Luciana Bastos ◽

Larissa Oliveira ◽

...

Keyword(s):

Neural Networks ◽

Dental Caries ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Clinical Analysis ◽

Diagnostic Process ◽

Radiographic Evaluation ◽

Classification Models ◽

Approximal Caries ◽

Lesion Severity

Dental caries is an extremely common problem in dentistry that affects a significant part of the population. Approximal caries are especially difficult to identify because their position makes clinical analysis difficult. Radiographic evaluation—more specifically, bitewing images—are mostly used in such cases. However, incorrect interpretations may interfere with the diagnostic process. To aid dentists in caries evaluation, computational methods and tools can be used. In this work, we propose a new method that combines image processing techniques and convolutional neural networks to identify approximal dental caries in bitewing radiographic images and classify them according to lesion severity. For this study, we acquired 112 bitewing radiographs. From these exams, we extracted individual tooth images from each exam, applied a data augmentation process, and used the resulting images to train CNN classification models. The tooth images were previously labeled by experts to denote the defined classes. We evaluated classification models based on the Inception and ResNet architectures using three different learning rates: 0.1, 0.01, and 0.001. The training process included 2000 iterations, and the best results were achieved by the Inception model with a 0.001 learning rate, whose accuracy on the test set was 73.3%. The results can be considered promising and suggest that the proposed method could be used to assist dentists in the evaluation of bitewing images, and the definition of lesion severity and appropriate treatments.

Download Full-text

Structured (De)composable Representations Trained with Neural Networks

Computers ◽

10.3390/computers9040079 ◽

2020 ◽

Vol 9 (4) ◽

pp. 79

Author(s):

Graham Spinks ◽

Marie-Francine Moens

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Contextual Information ◽

Learning To Learn ◽

Class Label ◽

Impact Performance ◽

Novel Technique ◽

Language Data ◽

Concept Classes ◽

Generic Representation

This paper proposes a novel technique for representing templates and instances of concept classes. A template representation refers to the generic representation that captures the characteristics of an entire class. The proposed technique uses end-to-end deep learning to learn structured and composable representations from input images and discrete labels. The obtained representations are based on distance estimates between the distributions given by the class label and those given by contextual information, which are modeled as environments. We prove that the representations have a clear structure allowing decomposing the representation into factors that represent classes and environments. We evaluate our novel technique on classification and retrieval tasks involving different modalities (visual and language data). In various experiments, we show how the representations can be compressed and how different hyperparameters impact performance.

Download Full-text

Constructive algorithm for fully connected cascade feedforward neural networks

Neurocomputing ◽

10.1016/j.neucom.2015.12.003 ◽

2016 ◽

Vol 182 ◽

pp. 154-164 ◽

Cited By ~ 20

Author(s):

Junfei Qiao ◽

Fanjun Li ◽

Honggui Han ◽

Wenjing Li

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Constructive Algorithm ◽

Fully Connected

Download Full-text

On global asymptotic stability of fully connected recurrent neural networks

2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100) ◽

10.1109/icassp.2000.860132 ◽

2002 ◽

Cited By ~ 2

Author(s):

D.P. Mandic ◽

J.A. Chambers ◽

M.M. Bozic

Keyword(s):

Neural Networks ◽

Asymptotic Stability ◽

Global Asymptotic Stability ◽

Recurrent Neural Networks ◽

Fully Connected

Download Full-text

Attention-based deep learning networks for identification of human gait using radar micro-Doppler spectrograms

International Journal of Microwave and Wireless Technologies ◽

10.1017/s1759078721000830 ◽

2021 ◽

pp. 1-6

Author(s):

Hannah Garcia Doherty ◽

Roberto Arnaiz Burgueño ◽

Roeland P. Trommel ◽

Vasileios Papanastasiou ◽

Ronny I. A. Harmanny

Keyword(s):

Neural Networks ◽

Feature Vector ◽

Classification Performance ◽

Input Image ◽

Human Gait ◽

Learning Networks ◽

Class Label ◽

Deep Convolutional Neural Networks ◽

Network Layers ◽

Feature Dimension

Abstract Identification of human individuals within a group of 39 persons using micro-Doppler (μ-D) features has been investigated. Deep convolutional neural networks with two different training procedures have been used to perform classification. Visualization of the inner network layers revealed the sections of the input image most relevant when determining the class label of the target. A convolutional block attention module is added to provide a weighted feature vector in the channel and feature dimension, highlighting the relevant μ-D feature-filled areas in the image and improving classification performance.

Download Full-text