Improving Classification Performance of Deep Learning Models Using Bio-Inspired Computing

A Study on the Auxiliary Diagnosis of Thyroid Disease Images Based on Multiple Dimensional Deep Learning Algorithms

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405615666190115155223 ◽

2020 ◽

Vol 16 (3) ◽

pp. 199-205

Author(s):

Yuejun Liu ◽

Yifei Xu ◽

Xiangzheng Meng ◽

Xuguang Wang ◽

Tianxu Bai

Keyword(s):

Deep Learning ◽

Learning Algorithms ◽

Region Of Interest ◽

Classification Performance ◽

Thyroid Diseases ◽

Great Success ◽

Learning Models ◽

Good Classification Performance ◽

Spect Images

Background: Medical imaging plays an important role in the diagnosis of thyroid diseases. In the field of machine learning, multiple dimensional deep learning algorithms are widely used in image classification and recognition, and have achieved great success. Objective: The method based on multiple dimensional deep learning is employed for the auxiliary diagnosis of thyroid diseases based on SPECT images. The performances of different deep learning models are evaluated and compared. Methods: Thyroid SPECT images are collected with three types, they are hyperthyroidism, normal and hypothyroidism. In the pre-processing, the region of interest of thyroid is segmented and the amount of data sample is expanded. Four CNN models, including CNN, Inception, VGG16 and RNN, are used to evaluate deep learning methods. Results: Deep learning based methods have good classification performance, the accuracy is 92.9%-96.2%, AUC is 97.8%-99.6%. VGG16 model has the best performance, the accuracy is 96.2% and AUC is 99.6%. Especially, the VGG16 model with a changing learning rate works best. Conclusion: The standard CNN, Inception, VGG16, and RNN four deep learning models are efficient for the classification of thyroid diseases with SPECT images. The accuracy of the assisted diagnostic method based on deep learning is higher than that of other methods reported in the literature.

Download Full-text

Vulnerability in Deep Transfer Learning Models to Adversarial Fast Gradient Sign Attack for COVID-19 Prediction from Chest Radiography Images

Applied Sciences ◽

10.3390/app11094233 ◽

2021 ◽

Vol 11 (9) ◽

pp. 4233

Author(s):

Biprodip Pal ◽

Debashis Gupta ◽

Md. Rashed-Al-Mahfuz ◽

Salem A. Alyami ◽

Mohammad Ali Moni

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Chest Radiography ◽

High Sensitivity ◽

Ct Images ◽

Classification Performance ◽

Learning Models ◽

X Ray ◽

Image Capturing ◽

Fast Gradient

The COVID-19 pandemic requires the rapid isolation of infected patients. Thus, high-sensitivity radiology images could be a key technique to diagnose patients besides the polymerase chain reaction approach. Deep learning algorithms are proposed in several studies to detect COVID-19 symptoms due to the success in chest radiography image classification, cost efficiency, lack of expert radiologists, and the need for faster processing in the pandemic area. Most of the promising algorithms proposed in different studies are based on pre-trained deep learning models. Such open-source models and lack of variation in the radiology image-capturing environment make the diagnosis system vulnerable to adversarial attacks such as fast gradient sign method (FGSM) attack. This study therefore explored the potential vulnerability of pre-trained convolutional neural network algorithms to the FGSM attack in terms of two frequently used models, VGG16 and Inception-v3. Firstly, we developed two transfer learning models for X-ray and CT image-based COVID-19 classification and analyzed the performance extensively in terms of accuracy, precision, recall, and AUC. Secondly, our study illustrates that misclassification can occur with a very minor perturbation magnitude, such as 0.009 and 0.003 for the FGSM attack in these models for X-ray and CT images, respectively, without any effect on the visual perceptibility of the perturbation. In addition, we demonstrated that successful FGSM attack can decrease the classification performance to 16.67% and 55.56% for X-ray images, as well as 36% and 40% in the case of CT images for VGG16 and Inception-v3, respectively, without any human-recognizable perturbation effects in the adversarial images. Finally, we analyzed that correct class probability of any test image which is supposed to be 1, can drop for both considered models and with increased perturbation; it can drop to 0.24 and 0.17 for the VGG16 model in cases of X-ray and CT images, respectively. Thus, despite the need for data sharing and automated diagnosis, practical deployment of such program requires more robustness.

Download Full-text

Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals

Sensors ◽

10.3390/s19010210 ◽

2019 ◽

Vol 19 (1) ◽

pp. 210 ◽

Cited By ~ 32

Author(s):

Zied Tayeb ◽

Juri Fedjaev ◽

Nejla Ghaboosi ◽

Christoph Richter ◽

Lukas Everding ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Convolutional Neural Network ◽

Motor Imagery ◽

Classification Performance ◽

Feature Engineering ◽

Learning Models ◽

Eeg Signals ◽

Learning Methods

Non-invasive, electroencephalography (EEG)-based brain-computer interfaces (BCIs) on motor imagery movements translate the subject’s motor intention into control signals through classifying the EEG patterns caused by different imagination tasks, e.g., hand movements. This type of BCI has been widely studied and used as an alternative mode of communication and environmental control for disabled patients, such as those suffering from a brainstem stroke or a spinal cord injury (SCI). Notwithstanding the success of traditional machine learning methods in classifying EEG signals, these methods still rely on hand-crafted features. The extraction of such features is a difficult task due to the high non-stationarity of EEG signals, which is a major cause by the stagnating progress in classification performance. Remarkable advances in deep learning methods allow end-to-end learning without any feature engineering, which could benefit BCI motor imagery applications. We developed three deep learning models: (1) A long short-term memory (LSTM); (2) a spectrogram-based convolutional neural network model (CNN); and (3) a recurrent convolutional neural network (RCNN), for decoding motor imagery movements directly from raw EEG signals without (any manual) feature engineering. Results were evaluated on our own publicly available, EEG data collected from 20 subjects and on an existing dataset known as 2b EEG dataset from “BCI Competition IV”. Overall, better classification performance was achieved with deep learning models compared to state-of-the art machine learning techniques, which could chart a route ahead for developing new robust techniques for EEG signal decoding. We underpin this point by demonstrating the successful real-time control of a robotic arm using our CNN based BCI.

Download Full-text

A Mask-guided Attention Deep Learning Model for COVID-19 Diagnosis based on an Integrated CT Scan Images Database

10.36227/techrxiv.18166667.v1 ◽

2022 ◽

Author(s):

Maede Maftouni ◽

Bo Shen ◽

Andrew Chung Chee Law ◽

Niloofar Ayoobi Yazdi ◽

Zhenyu Kong

Keyword(s):

Deep Learning ◽

Ct Scan ◽

Imaging Modality ◽

Learning Model ◽

Classification Performance ◽

Computer Assisted ◽

Learning Approach ◽

Learning Models ◽

Task Learning ◽

Data Efficiency

The global extent of COVID-19 mutations and the consequent depletion of hospital resources highlighted the necessity of effective computer-assisted medical diagnosis. COVID-19 detection mediated by deep learning models can help diagnose this highly contagious disease and lower infectivity and mortality rates. Computed tomography (CT) is the preferred imaging modality for building automatic COVID-19 screening and diagnosis models. It is well-known that the training set size significantly impacts the performance and generalization of deep learning models. However, accessing a large dataset of CT scan images from an emerging disease like COVID-19 is challenging. Therefore, data efficiency becomes a significant factor in choosing a learning model. To this end, we present a multi-task learning approach, namely, a mask-guided attention (MGA) classifier, to improve the generalization and data efficiency of COVID-19 classification on lung CT scan images.The novelty of this method is compensating for the scarcity of data by employing more supervision with lesion masks, increasing the sensitivity of the model to COVID-19 manifestations, and helping both generalization and classification performance. Our proposed model achieves better overall performance than the single-task baseline and state-of-the-art models, as measured by various popular metrics. In our experiment with different percentages of data from our curated dataset, the classification performance gain from this multi-task learning approach is more significant for the smaller training sizes. Furthermore, experimental results demonstrate that our method enhances the focus on the lesions, as witnessed by bothattention and attribution maps, resulting in a more interpretable model.

Download Full-text

On the Depth of Deep Learning Models for Splice Site Identification

10.1101/380667 ◽

2018 ◽

Author(s):

Reem Elsousy ◽

Nagarajan Kathiresan ◽

Sabri Boughorbel

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Splice Site ◽

Dna Sequences ◽

Classification Performance ◽

Model Parameters ◽

Learning Models ◽

Trade Off ◽

Good Trade ◽

Site Identification

AbstractThe success of deep learning has been shown in various fields including computer vision, speech recognition, natural language processing and bioinformatics. The advance of Deep Learning in Computer Vision has been an important source of inspiration for other research fields. The objective of this work is to adapt known deep learning models borrowed from computer vision such as VGGNet, Resnet and AlexNet for the classification of biological sequences. In particular, we are interested by the task of splice site identification based on raw DNA sequences. We focus on the role of model architecture depth on model training and classification performance.We show that deep learning models outperform traditional classification methods (SVM, Random Forests, and Logistic Regression) for large training sets of raw DNA sequences. Three model families are analyzed in this work namely VGGNet, AlexNet and ResNet. Three depth levels are defined for each model family. The models are benchmarked using the following metrics: Area Under ROC curve (AUC), Number of model parameters, number of floating operations. Our extensive experimental evaluation show that shallow architectures have an overall better performance than deep models. We introduced a shallow version of ResNet, named S-ResNet. We show that it gives a good trade-off between model complexity and classification performance.Author summaryDeep Learning has been widely applied to various fields in research and industry. It has been also succesfully applied to genomics and in particular to splice site identification. We are interested in the use of advanced neural networks borrowed from computer vision. We explored well-known models and their usability for the problem of splice site identification from raw sequences. Our extensive experimental analysis shows that shallow models outperform deep models. We introduce a new model called S-ResNet, which gives a good trade-off between computational complexity and classification accuracy.

Download Full-text

An Interpretation Architecture for Deep Learning Models with the Application of COVID-19 Diagnosis

Entropy ◽

10.3390/e23020204 ◽

2021 ◽

Vol 23 (2) ◽

pp. 204

Author(s):

Yuchai Wan ◽

Hongen Zhou ◽

Xun Zhang

Keyword(s):

Deep Learning ◽

Predictive Value ◽

Classification Performance ◽

Learning Approaches ◽

Learning Models ◽

Visual Interpretation ◽

Deep Model ◽

Diagnostic Decision ◽

Sensitivity Specificity ◽

Learned Features

The Coronavirus disease 2019 (COVID-19) has become one of the threats to the world. Computed tomography (CT) is an informative tool for the diagnosis of COVID-19 patients. Many deep learning approaches on CT images have been proposed and brought promising performance. However, due to the high complexity and non-transparency of deep models, the explanation of the diagnosis process is challenging, making it hard to evaluate whether such approaches are reliable. In this paper, we propose a visual interpretation architecture for the explanation of the deep learning models and apply the architecture in COVID-19 diagnosis. Our architecture designs a comprehensive interpretation about the deep model from different perspectives, including the training trends, diagnostic performance, learned features, feature extractors, the hidden layers, the support regions for diagnostic decision, and etc. With the interpretation architecture, researchers can make a comparison and explanation about the classification performance, gain insight into what the deep model learned from images, and obtain the supports for diagnostic decisions. Our deep model achieves the diagnostic result of 94.75%, 93.22%, 96.69%, 97.27%, and 91.88% in the criteria of accuracy, sensitivity, specificity, positive predictive value, and negative predictive value, which are 8.30%, 4.32%, 13.33%, 10.25%, and 6.19% higher than that of the compared traditional methods. The visualized features in 2-D and 3-D spaces provide the reasons for the superiority of our deep model. Our interpretation architecture would allow researchers to understand more about how and why deep models work, and can be used as interpretation solutions for any deep learning models based on convolutional neural network. It can also help deep learning methods to take a step forward in the clinical COVID-19 diagnosis field.

Download Full-text

Identification of pancreatic adenocarcinoma molecular subtypes on histology slides using deep learning models.

Journal of Clinical Oncology ◽

10.1200/jco.2021.39.15_suppl.4141 ◽

2021 ◽

Vol 39 (15_suppl) ◽

pp. 4141-4141

Author(s):

Charlie Saillard ◽

Flore Delecourt ◽

Benoit Schmauch ◽

Olivier Moindrot ◽

Magali Svrcek ◽

...

Keyword(s):

Clinical Trials ◽

Deep Learning ◽

Pancreatic Adenocarcinoma ◽

Validation Cohort ◽

Molecular Subtypes ◽

Routine Care ◽

Classification Performance ◽

Molecular Heterogeneity ◽

Learning Models ◽

Detection Model

4141 Background: Pancreatic adenocarcinoma (PAC) is predicted to be the second cause of death by cancer in 2030 and its prognosis has seen little improvement in the last decades. PAC is a very heterogeneous tumor with preeminent stroma and multiple histological aspects. Omic studies confirmed its molecular heterogeneity, possibly one of the main factors explaining the failure of most clinical trials. Two and three transcriptomic subtypes of tumor cells and stroma respectively, were described with major prognostic and predictive implications. The tumor subtypes, Basal-like and Classical, have been shown by several groups to be predictive of the response to first line chemotherapy. As of today, these subtypes can only be defined by RNA profiling which is limited by the quantity and quality of the samples (formalin fixation and low cellularity) as well as by the analytical delay that may restrict its application in routine care. In addition, tumors may harbor a mixture of several subtypes limiting their interpretation using bulk transcriptomic approaches and thereby their clinical use. Here, we propose a multistep approach using deep learning models to predict tumor components and their molecular subtypes on routine histological preparations. Methods: 728 whole-slide digitized histological slides corresponding to 350 consecutive resected PAC from four centers with clinical and transcriptomic data were assembled and used as a discovery set. PAC from TCGA (n = 134) was used as a validation set. Tumor regions from slides of the discovery set were annotated to train a multistep deep learning model that first recognizes tumor tissue and then predicts tumor and stroma cells molecular subtypes assessed by the published PurIST algorithm. Results: The tumor detection model was very efficient (AUC = 0.98 in the TCGA validation cohort). In the discovery set, the Basal-like/Classical classification performance of the model by cross validation was 0.79 (AUC) and reached 0.86 when restricted to samples with a high-confidence RNA-defined molecular subtype.Subtypes defined by the model were independently associated with overall survival in multivariate analysis (HR = 2.56 [1.87 - 3.49], pval < 0.001), and association was higher relatively to PurIST RNA subtypes (HR = 1.60 [1.17 - 2.19] pval < 0.001). In the validation cohort, the model had an overall AUC of 0.82, and 0.89 in the subset of “subtype-pure” tumors. In addition to demonstrating the value of histology-based deep learning models for tumor subtyping in PAC, these results also show the limit of molecular-based subtyping in highly heterogeneous samples. Conclusions: This study provides the first PAC subtyping tool usable worldwide in clinical practice, finally opening the possibility of patient molecular stratification in routine care and clinical trials.

Download Full-text

Payload-Based Traffic Classification Using Multi-Layer LSTM in Software Defined Networks

Applied Sciences ◽

10.3390/app9122550 ◽

2019 ◽

Vol 9 (12) ◽

pp. 2550 ◽

Cited By ~ 1

Author(s):

Lim ◽

Kim ◽

Hong ◽

Han

Keyword(s):

Deep Learning ◽

Network Traffic ◽

Short Term Memory ◽

Single Layer ◽

Classification Performance ◽

Traffic Classification ◽

Software Defined Networks ◽

Learning Models ◽

Network Operator ◽

Network Traffic Classification

Recently, with the advent of various Internet of Things (IoT) applications, a massive amount of network traffic is being generated. A network operator must provide different quality of service, according to the service provided by each application. Toward this end, many studies have investigated how to classify various types of application network traffic accurately. Especially, since many applications use temporary or dynamic IP or Port numbers in the IoT environment, only payload-based network traffic classification technology is more suitable than the classification using the packet header information as well as payload. Furthermore, to automatically respond to various applications, it is necessary to classify traffic using deep learning without the network operator intervention. In this study, we propose a traffic classification scheme using a deep learning model in software defined networks. We generate flow-based payload datasets through our own network traffic pre-processing, and train two deep learning models: 1) the multi-layer long short-term memory (LSTM) model and 2) the combination of convolutional neural network and single-layer LSTM models, to perform network traffic classification. We also execute a model tuning procedure to find the optimal hyper-parameters of the two deep learning models. Lastly, we analyze the network traffic classification performance on the basis of the F1-score for the two deep learning models, and show the superiority of the multi-layer LSTM model for network packet classification.

Download Full-text

Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals

10.20944/preprints201809.0481.v1 ◽

2018 ◽

Author(s):

Zied Tayeb ◽

Juri Fedjaev ◽

Nejla Ghaboosi ◽

Christoph Richter ◽

Lukas Everding ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Convolutional Neural Network ◽

Motor Imagery ◽

Classification Performance ◽

Feature Engineering ◽

Learning Models ◽

Eeg Signals ◽

Learning Methods

Non-invasive, electroencephalography (EEG)-based brain-computer interfaces (BCIs) on motor imagery movements translate the subject’s motor intention into control signals through classifying the EEG patterns caused by different imagination tasks, e.g. hand movements. This type of BCI has been widely studied and used as an alternative mode of communication and environmental control for disabled patients, such as those suffering from a brainstem stroke or a spinal cord injury (SCI). Notwithstanding the success of traditional machine learning methods in classifying EEG signals, these methods still rely on hand-crafted features. The extraction of such features is a difficult task due to the high non-stationarity of EEG signals, which is a major cause for the stagnating progress in classification performance. Remarkable advances in deep learning methods allow end-to-end learning without any feature engineering, which could benefit BCI motor imagery applications. We developed three deep learning models: 1) a long short-term memory (LSTM); 2) a proposed spectrogram-based convolutional neural network model (pCNN); and 3) a recurrent convolutional neural network (RCNN), for decoding motor imagery movements directly from raw EEG signals without (manual) feature engineering. Results were evaluated on our own, publicly available, EEG data collected from 20 subjects and on an existing dataset known as 2b EEG dataset from "BCI Competition IV". Overall, better classification performance was achieved with deep learning models compared to state-of-the art machine learning techniques, which could chart a route ahead for developing new robust techniques for EEG signal decoding. We underpin this point by demonstrating the successful real-time control of a robotic arm using our CNN based BCI.

Download Full-text

CLASSIFICATION PERFORMANCE COMPARISONS OF DEEP LEARNING MODELS IN PNEUMONIA DIAGNOSIS USING CHEST X-RAY IMAGES

Turkish Journal of Engineering ◽

10.31127/tuje.652358 ◽

2019 ◽

Author(s):

Osman Doğuş Gülgün ◽

Prof. Dr. Hamza EROL

Keyword(s):

Deep Learning ◽

Classification Performance ◽

Learning Models ◽

X Ray ◽

Chest X Ray ◽

Pneumonia Diagnosis ◽

Performance Comparisons

Download Full-text