SMOTE-Based Weighted Deep Rotation Forest for the Imbalanced Hyperspectral Data Classification

Yinghui Quan; Xian Zhong; Wei Feng; Jonathan Cheung-Wai Chan; Qiang Li; Mengdao Xing

doi:10.3390/rs13030464

SMOTE-Based Weighted Deep Rotation Forest for the Imbalanced Hyperspectral Data Classification

Remote Sensing ◽

10.3390/rs13030464 ◽

2021 ◽

Vol 13 (3) ◽

pp. 464

Author(s):

Yinghui Quan ◽

Xian Zhong ◽

Wei Feng ◽

Jonathan Cheung-Wai Chan ◽

Qiang Li ◽

...

Keyword(s):

Deep Learning ◽

Spatial Information ◽

Fundamental Problem ◽

Data Classification ◽

Hyperspectral Data ◽

Support Vector ◽

Great Success ◽

Learning Approaches ◽

Rotation Forest ◽

Training Time

Conventional classification algorithms have shown great success in balanced hyperspectral data classification. However, the imbalanced class distribution is a fundamental problem of hyperspectral data, and it is regarded as one of the great challenges in classification tasks. To solve this problem, a non-ANN based deep learning, namely SMOTE-Based Weighted Deep Rotation Forest (SMOTE-WDRoF) is proposed in this paper. First, the neighboring pixels of instances are introduced as the spatial information and balanced datasets are created by using the SMOTE algorithm. Second, these datasets are fed into the WDRoF model that consists of the rotation forest and the multi-level cascaded random forests. Specifically, the rotation forest is used to generate rotation feature vectors, which are input into the subsequent cascade forest. Furthermore, the output probability of each level and the original data are stacked as the dataset of the next level. And the sample weights are automatically adjusted according to the dynamic weight function constructed by the classification results of each level. Compared with the traditional deep learning approaches, the proposed method consumes much less training time. The experimental results on four public hyperspectral data demonstrate that the proposed method can get better performance than support vector machine, random forest, rotation forest, SMOTE combined rotation forest, convolutional neural network, and rotation-based deep forest in multiclass imbalance learning.

Download Full-text

Literature review and analysis on big data stream classification techniques

International Journal of Knowledge-based and Intelligent Engineering Systems ◽

10.3233/kes-200042 ◽

2020 ◽

Vol 24 (3) ◽

pp. 205-215

Author(s):

B. Srivani ◽

N. Sandhya ◽

B. Padmaja Rani

Keyword(s):

Neural Network ◽

Big Data ◽

Deep Learning ◽

Data Classification ◽

Support Vector ◽

Complex Data ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Data Stream Classification ◽

Big Data Classification

Rapid growth in technology and information lead the human to witness the improved growth in velocity, volume of data, and variety. The data in the business organizations demonstrate the development of big data applications. Because of the improving demand of applications, analysis of sophisticated streaming big data tends to become a significant area in data mining. One of the significant aspects of the research is employing deep learning approaches for effective extraction of complex data representations. Accordingly, this survey provides the detailed review of big data classification methodologies, like deep learning based techniques, Convolutional Neural Network (CNN) based techniques, K-Nearest Neighbor (KNN) based techniques, Neural Network (NN) based techniques, fuzzy based techniques, and Support vector based techniques, and so on. Moreover, a detailed study is made by concerning the parameters, like evaluation metrics, implementation tool, employed framework, datasets utilized, adopted classification methods, and accuracy range obtained by various techniques. Eventually, the research gaps and issues of various big data classification schemes are presented.

Download Full-text

Unsupervised Multi-Level Feature Extraction for Improvement of Hyperspectral Classification

Remote Sensing ◽

10.3390/rs13081602 ◽

2021 ◽

Vol 13 (8) ◽

pp. 1602

Author(s):

Qiaoqiao Sun ◽

Xuefeng Liu ◽

Salah Bourennane

Keyword(s):

Feature Extraction ◽

Deep Learning ◽

Spatial Information ◽

Hyperspectral Data ◽

Great Promise ◽

Learning Models ◽

Single Level ◽

Multiple Networks ◽

Multi Level ◽

Hyperspectral Classification

Deep learning models have strong abilities in learning features and they have been successfully applied in hyperspectral images (HSIs). However, the training of most deep learning models requires labeled samples and the collection of labeled samples are labor-consuming in HSI. In addition, single-level features from a single layer are usually considered, which may result in the loss of some important information. Using multiple networks to obtain multi-level features is a solution, but at the cost of longer training time and computational complexity. To solve these problems, a novel unsupervised multi-level feature extraction framework that is based on a three dimensional convolutional autoencoder (3D-CAE) is proposed in this paper. The designed 3D-CAE is stacked by fully 3D convolutional layers and 3D deconvolutional layers, which allows for the spectral-spatial information of targets to be mined simultaneously. Besides, the 3D-CAE can be trained in an unsupervised way without involving labeled samples. Moreover, the multi-level features are directly obtained from the encoded layers with different scales and resolutions, which is more efficient than using multiple networks to get them. The effectiveness of the proposed multi-level features is verified on two hyperspectral data sets. The results demonstrate that the proposed method has great promise in unsupervised feature learning and can help us to further improve the hyperspectral classification when compared with single-level features.

Download Full-text

Detection of Malicious Software by Analyzing Distinct Artifacts Using Machine Learning and Deep Learning Algorithms

Electronics ◽

10.3390/electronics10141694 ◽

2021 ◽

Vol 10 (14) ◽

pp. 1694

Author(s):

Mathew Ashik ◽

A. Jyothish ◽

S. Anandaram ◽

P. Vinod ◽

Francesco Mercaldo ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Support Vector ◽

Malware Analysis ◽

Learning Approaches ◽

Dynamic Features ◽

System Calls ◽

Prevention Methods ◽

Structural Aspects

Malware is one of the most significant threats in today’s computing world since the number of websites distributing malware is increasing at a rapid rate. Malware analysis and prevention methods are increasingly becoming necessary for computer systems connected to the Internet. This software exploits the system’s vulnerabilities to steal valuable information without the user’s knowledge, and stealthily send it to remote servers controlled by attackers. Traditionally, anti-malware products use signatures for detecting known malware. However, the signature-based method does not scale in detecting obfuscated and packed malware. Considering that the cause of a problem is often best understood by studying the structural aspects of a program like the mnemonics, instruction opcode, API Call, etc. In this paper, we investigate the relevance of the features of unpacked malicious and benign executables like mnemonics, instruction opcodes, and API to identify a feature that classifies the executable. Prominent features are extracted using Minimum Redundancy and Maximum Relevance (mRMR) and Analysis of Variance (ANOVA). Experiments were conducted on four datasets using machine learning and deep learning approaches such as Support Vector Machine (SVM), Naïve Bayes, J48, Random Forest (RF), and XGBoost. In addition, we also evaluate the performance of the collection of deep neural networks like Deep Dense network, One-Dimensional Convolutional Neural Network (1D-CNN), and CNN-LSTM in classifying unknown samples, and we observed promising results using APIs and system calls. On combining APIs/system calls with static features, a marginal performance improvement was attained comparing models trained only on dynamic features. Moreover, to improve accuracy, we implemented our solution using distinct deep learning methods and demonstrated a fine-tuned deep neural network that resulted in an F1-score of 99.1% and 98.48% on Dataset-2 and Dataset-3, respectively.

Download Full-text

Analysis of the Nosema Cells Identification for Microscopic Images

Sensors ◽

10.3390/s21093068 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3068

Author(s):

Soumaya Dghim ◽

Carlos M. Travieso-González ◽

Radim Burget

Keyword(s):

Neural Network ◽

Machine Learning ◽

Image Processing ◽

Deep Learning ◽

The Other ◽

Support Vector ◽

Learning Approaches ◽

Microscopic Images ◽

Trained Neural Network ◽

Nosema Disease

The use of image processing tools, machine learning, and deep learning approaches has become very useful and robust in recent years. This paper introduces the detection of the Nosema disease, which is considered to be one of the most economically significant diseases today. This work shows a solution for recognizing and identifying Nosema cells between the other existing objects in the microscopic image. Two main strategies are examined. The first strategy uses image processing tools to extract the most valuable information and features from the dataset of microscopic images. Then, machine learning methods are applied, such as a neural network (ANN) and support vector machine (SVM) for detecting and classifying the Nosema disease cells. The second strategy explores deep learning and transfers learning. Several approaches were examined, including a convolutional neural network (CNN) classifier and several methods of transfer learning (AlexNet, VGG-16 and VGG-19), which were fine-tuned and applied to the object sub-images in order to identify the Nosema images from the other object images. The best accuracy was reached by the VGG-16 pre-trained neural network with 96.25%.

Download Full-text

Early Detection of Plant Viral Disease Using Hyperspectral Imaging and Deep Learning

Sensors ◽

10.3390/s21030742 ◽

2021 ◽

Vol 21 (3) ◽

pp. 742

Author(s):

Canh Nguyen ◽

Vasit Sagan ◽

Matthew Maimaitiyiming ◽

Maitiniyazi Maimaitijiang ◽

Sourav Bhadra ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Early Detection ◽

Convolutional Neural Network ◽

Near Infrared ◽

Hyperspectral Data ◽

Viral Diseases ◽

Support Vector ◽

Spectral Features ◽

Feature Spaces

Early detection of grapevine viral diseases is critical for early interventions in order to prevent the disease from spreading to the entire vineyard. Hyperspectral remote sensing can potentially detect and quantify viral diseases in a nondestructive manner. This study utilized hyperspectral imagery at the plant level to identify and classify grapevines inoculated with the newly discovered DNA virus grapevine vein-clearing virus (GVCV) at the early asymptomatic stages. An experiment was set up at a test site at South Farm Research Center, Columbia, MO, USA (38.92 N, −92.28 W), with two grapevine groups, namely healthy and GVCV-infected, while other conditions were controlled. Images of each vine were captured by a SPECIM IQ 400–1000 nm hyperspectral sensor (Oulu, Finland). Hyperspectral images were calibrated and preprocessed to retain only grapevine pixels. A statistical approach was employed to discriminate two reflectance spectra patterns between healthy and GVCV vines. Disease-centric vegetation indices (VIs) were established and explored in terms of their importance to the classification power. Pixel-wise (spectral features) classification was performed in parallel with image-wise (joint spatial–spectral features) classification within a framework involving deep learning architectures and traditional machine learning. The results showed that: (1) the discriminative wavelength regions included the 900–940 nm range in the near-infrared (NIR) region in vines 30 days after sowing (DAS) and the entire visual (VIS) region of 400–700 nm in vines 90 DAS; (2) the normalized pheophytization index (NPQI), fluorescence ratio index 1 (FRI1), plant senescence reflectance index (PSRI), anthocyanin index (AntGitelson), and water stress and canopy temperature (WSCT) measures were the most discriminative indices; (3) the support vector machine (SVM) was effective in VI-wise classification with smaller feature spaces, while the RF classifier performed better in pixel-wise and image-wise classification with larger feature spaces; and (4) the automated 3D convolutional neural network (3D-CNN) feature extractor provided promising results over the 2D convolutional neural network (2D-CNN) in learning features from hyperspectral data cubes with a limited number of samples.

Download Full-text

Deep learning para la clasificación de usos de suelo agrícola con Sentinel-2

Revista de Teledetección ◽

10.4995/raet.2020.13337 ◽

2020 ◽

pp. 35

Author(s):

M. Campos-Taberner ◽

F.J. García-Haro ◽

B. Martínez ◽

M.A. Gilabert

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Near Infrared ◽

Spatial Information ◽

Learning Algorithm ◽

Support Vector ◽

Sensing Applications ◽

Learning Techniques ◽

Remote Sensing Applications ◽

Sentinel 2

The use of deep learning techniques for remote sensing applications has recently increased. These algorithms have proven to be successful in estimation of parameters and classification of images. However, little effort has been made to make them understandable, leading to their implementation as “black boxes”. This work aims to evaluate the performance and clarify the operation of a deep learning algorithm, based on a bi-directional recurrent network of long short-term memory (2-BiLSTM). The land use classification in the Valencian Community based on Sentinel-2 image time series in the framework of the common agricultural policy (CAP) is used as an example. It is verified that the accuracy of the deep learning techniques is superior (98.6 % overall success) to that other algorithms such as decision trees (DT), k-nearest neighbors (k-NN), neural networks (NN), support vector machines (SVM) and random forests (RF). The performance of the classifier has been studied as a function of time and of the predictors used. It is concluded that, in the study area, the most relevant information used by the network in the classification are the images corresponding to summer and the spectral and spatial information derived from the red and near infrared bands. These results open the door to new studies in the field of the explainable deep learning in remote sensing applications.

Download Full-text

SPECTRAL-SPATIAL CLASSIFICATION OF HYPERSPECTRAL IMAGERY USING NEURAL NETWORK ALGORITHM AND HIERARCHICAL SEGMENTATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w12-1-2019 ◽

2019 ◽

Vol XLII-2/W12 ◽

pp. 1-5

Author(s):

D. Akbari ◽

M. Moradizadeh ◽

M. Akbari

Keyword(s):

Neural Network ◽

Spatial Information ◽

Hyperspectral Data ◽

Support Vector ◽

Washington Dc ◽

Hierarchical Segmentation ◽

Vector Machines ◽

Neural Network Algorithm ◽

New Framework

Abstract. This paper describes a new framework for classification of hyperspectral images, based on both spectral and spatial information. The spatial information is obtained by an enhanced Marker-based Hierarchical Segmentation (MHS) algorithm. The hyperspectral data is first fed into the Multi-Layer Perceptron (MLP) neural network classification algorithm. Then, the MHS algorithm is applied in order to increase the accuracy of less-accurately classified land-cover types. In the proposed approach, the markers are extracted from the classification maps obtained by MLP and Support Vector Machines (SVM) classifiers. Experimental results on Washington DC Mall hyperspectral dataset, demonstrate the superiority of proposed approach compared to the MLP and the original MHS algorithms.

Download Full-text

Comparison of partial least square regression, support vector machine, and deep-learning techniques for estimating soil salinity from hyperspectral data

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.12.022204 ◽

2018 ◽

Vol 12 (02) ◽

pp. 1 ◽

Cited By ~ 10

Author(s):

Wenzhi Zeng ◽

Dongying Zhang ◽

Yuanhao Fang ◽

Jingwei Wu ◽

Jiesheng Huang

Keyword(s):

Support Vector Machine ◽

Deep Learning ◽

Soil Salinity ◽

Hyperspectral Data ◽

Partial Least Square ◽

Least Square ◽

Partial Least Square Regression ◽

Support Vector ◽

Least Square Regression ◽

Learning Techniques

Download Full-text

Evaluation of Hyperparameter Optimization in Machine and Deep Learning Methods for Decoding Imagined Speech EEG

Sensors ◽

10.3390/s20164629 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4629 ◽

Cited By ~ 1

Author(s):

Ciaran Cooney ◽

Attila Korik ◽

Raffaella Folli ◽

Damien Coyle

Keyword(s):

Deep Learning ◽

Support Vector ◽

Great Success ◽

Hyperparameter Optimization ◽

Linear Discriminant ◽

Classifier Performance ◽

Open Question ◽

The Impact ◽

Regularized Linear Discriminant Analysis

Classification of electroencephalography (EEG) signals corresponding to imagined speech production is important for the development of a direct-speech brain–computer interface (DS-BCI). Deep learning (DL) has been utilized with great success across several domains. However, it remains an open question whether DL methods provide significant advances over traditional machine learning (ML) approaches for classification of imagined speech. Furthermore, hyperparameter (HP) optimization has been neglected in DL-EEG studies, resulting in the significance of its effects remaining uncertain. In this study, we aim to improve classification of imagined speech EEG by employing DL methods while also statistically evaluating the impact of HP optimization on classifier performance. We trained three distinct convolutional neural networks (CNN) on imagined speech EEG using a nested cross-validation approach to HP optimization. Each of the CNNs evaluated was designed specifically for EEG decoding. An imagined speech EEG dataset consisting of both words and vowels facilitated training on both sets independently. CNN results were compared with three benchmark ML methods: Support Vector Machine, Random Forest and regularized Linear Discriminant Analysis. Intra- and inter-subject methods of HP optimization were tested and the effects of HPs statistically analyzed. Accuracies obtained by the CNNs were significantly greater than the benchmark methods when trained on both datasets (words: 24.97%, p < 1 × 10–7, chance: 16.67%; vowels: 30.00%, p < 1 × 10–7, chance: 20%). The effects of varying HP values, and interactions between HPs and the CNNs were both statistically significant. The results of HP optimization demonstrate how critical it is for training CNNs to decode imagined speech.

Download Full-text

Mixed 2D/3D Convolutional Network for Hyperspectral Image Super-Resolution

Remote Sensing ◽

10.3390/rs12101660 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1660 ◽

Cited By ~ 2

Author(s):

Qiang Li ◽

Qi Wang ◽

Xuelong Li

Keyword(s):

Spatial Information ◽

Hyperspectral Image ◽

Feature Fusion ◽

Super Resolution ◽

Superior Performance ◽

Spectral Information ◽

Great Success ◽

Convolutional Network ◽

Training Time ◽

Image Super Resolution

Deep learning-based hyperspectral image super-resolution (SR) methods have achieved great success recently. However, there are two main problems in the previous works. One is to use the typical three-dimensional convolution analysis, resulting in more parameters of the network. The other is not to pay more attention to the mining of hyperspectral image spatial information, when the spectral information can be extracted. To address these issues, in this paper, we propose a mixed convolutional network (MCNet) for hyperspectral image super-resolution. We design a novel mixed convolutional module (MCM) to extract the potential features by 2D/3D convolution instead of one convolution, which enables the network to more mine spatial features of hyperspectral image. To explore the effective features from 2D unit, we design the local feature fusion to adaptively analyze from all the hierarchical features in 2D units. In 3D unit, we employ spatial and spectral separable 3D convolution to extract spatial and spectral information, which reduces unaffordable memory usage and training time. Extensive evaluations and comparisons on three benchmark datasets demonstrate that the proposed approach achieves superior performance in comparison to existing state-of-the-art methods.

Download Full-text