Spatial-Spectral Transformer for Hyperspectral Image Classification

Xin He; Yushi Chen; Zhouhan Lin

doi:10.3390/rs13030498

Spatial-Spectral Transformer for Hyperspectral Image Classification

Remote Sensing ◽

10.3390/rs13030498 ◽

2021 ◽

Vol 13 (3) ◽

pp. 498

Author(s):

Xin He ◽

Yushi Chen ◽

Zhouhan Lin

Keyword(s):

Hyperspectral Image ◽

Sequential Data ◽

Dynamic Feature ◽

Long Distance ◽

Classification Framework ◽

Training Samples ◽

Feature Augmentation ◽

Limited Training Samples ◽

Transformer Model ◽

Overfitting Problem

Recently, a great many deep convolutional neural network (CNN)-based methods have been proposed for hyperspectral image (HSI) classification. Although the proposed CNN-based methods have the advantages of spatial feature extraction, they are difficult to handle the sequential data with and CNNs are not good at modeling the long-range dependencies. However, the spectra of HSI are a kind of sequential data, and HSI usually contains hundreds of bands. Therefore, it is difficult for CNNs to handle HSI processing well. On the other hand, the Transformer model, which is based on an attention mechanism, has proved its advantages in processing sequential data. To address the issue of capturing relationships of sequential spectra in HSI in a long distance, in this study, Transformer is investigated for HSI classification. Specifically, in this study, a new classification framework titled spatial-spectral Transformer (SST) is proposed for HSI classification. In the proposed SST, a well-designed CNN is used to extract the spatial features, and a modified Transformer (a Transformer with dense connection, i.e., DenseTransformer) is proposed to capture sequential spectra relationships, and multilayer perceptron is used to finish the final classification task. Furthermore, dynamic feature augmentation, which aims to alleviate the overfitting problem and therefore generalize the model well, is proposed and added to the SST (SST-FA). In addition, to address the issue of limited training samples in HSI classification, transfer learning is combined with SST, and another classification framework titled transferring-SST (T-SST) is proposed. At last, to mitigate the overfitting problem and improve the classification accuracy, label smoothing is introduced for the T-SST-based classification framework (T-SST-L). The proposed SST, SST-FA, T-SST, and T-SST-L are tested on three widely used hyperspectral datasets. The obtained results reveal that the proposed models provide competitive results compared to the state-of-the-art methods, which shows that the concept of Transformer opens a new window for HSI classification.

Download Full-text

Dual Graph Convolutional Network for Hyperspectral Image Classification With Limited Training Samples

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2021.3061088 ◽

2021 ◽

pp. 1-18

Author(s):

Xin He ◽

Yushi Chen ◽

Pedram Ghamisi

Keyword(s):

Image Classification ◽

Hyperspectral Image ◽

Dual Graph ◽

Hyperspectral Image Classification ◽

Convolutional Network ◽

Training Samples ◽

Limited Training Samples

Download Full-text

A DIVERSIFIED DEEP BELIEF NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xli-b7-443-2016 ◽

2016 ◽

Vol XLI-B7 ◽

pp. 443-449

Author(s):

P. Zhong ◽

Z. Q. Gong ◽

C. Schönlieb

Keyword(s):

Image Classification ◽

Real World ◽

Hyperspectral Image ◽

Classification Performance ◽

Fine Tuning ◽

Hyperspectral Image Classification ◽

Latent Factors ◽

Deep Model ◽

Training Samples ◽

Limited Training Samples

In recent years, researches in remote sensing demonstrated that deep architectures with multiple layers can potentially extract abstract and invariant features for better hyperspectral image classification. Since the usual real-world hyperspectral image classification task cannot provide enough training samples for a supervised deep model, such as convolutional neural networks (CNNs), this work turns to investigate the deep belief networks (DBNs), which allow unsupervised training. The DBN trained over limited training samples usually has many “dead” (never responding) or “potential over-tolerant” (always responding) latent factors (neurons), which decrease the DBN’s description ability and thus finally decrease the hyperspectral image classification performance. This work proposes a new diversified DBN through introducing a diversity promoting prior over the latent factors during the DBN pre-training and fine-tuning procedures. The diversity promoting prior in the training procedures will encourage the latent factors to be uncorrelated, such that each latent factor focuses on modelling unique information, and all factors will be summed up to capture a large proportion of information and thus increase description ability and classification performance of the diversified DBNs. The proposed method was evaluated over the well-known real-world hyperspectral image dataset. The experiments demonstrate that the diversified DBNs can obtain much better results than original DBNs and comparable or even better performances compared with other recent hyperspectral image classification methods.

Download Full-text

Adaptive Spatial Pyramid Constraint for Hyperspectral Image Classification With Limited Training Samples

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2021.3095056 ◽

2021 ◽

pp. 1-14

Author(s):

Jun Yue ◽

Dingshun Zhu ◽

Leyuan Fang ◽

Pedram Ghamisi ◽

Yaowei Wang

Keyword(s):

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Training Samples ◽

Limited Training Samples ◽

Spatial Pyramid

Download Full-text

Semi-Supervised Classification for Hyperspectral Images Based on Multiple Classifiers and Relaxation Strategy

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi7070284 ◽

2018 ◽

Vol 7 (7) ◽

pp. 284 ◽

Cited By ~ 1

Author(s):

Fuding Xie ◽

Dongcui Hu ◽

Fangfei Li ◽

Jun Yang ◽

Deshan Liu

Keyword(s):

Supervised Classification ◽

Nearest Neighbor ◽

Hyperspectral Image ◽

Multinomial Logistic Regression ◽

Hyperspectral Data ◽

Classification Result ◽

Training Samples ◽

Comparison Results ◽

Local Mean ◽

Limited Training Samples

Hyperspectral image (HSI) classification is a fundamental and challenging problem in remote sensing and its various applications. However, it is difficult to perfectly classify remotely sensed hyperspectral data by directly using classification techniques developed in pattern recognition. This is partially owing to a multitude of noise points and the limited training samples. Based on multinomial logistic regression (MLR), the local mean-based pseudo nearest neighbor (LMPNN) rule, and the discontinuity preserving relaxation (DPR) method, in this paper, a semi-supervised method for HSI classification is proposed. In pre-processing and post-processing, the DPR strategy is adopted to denoise the original hyperspectral data and improve the classification accuracy, respectively. The application of two classifiers, MLR and LMPNN, can automatically acquire more labeled samples in terms of a few labeled instances per class. This is termed the pre-classification procedure. The final classification result of the HSI is obtained by employing the MLRsub approach. The effectiveness of the proposal is experimentally evaluated by two real hyperspectral datasets, which are widely used to test the performance of the HSI classification algorithm. The comparison results using several competing methods confirm that the proposed method is effective, even for limited training samples.

Download Full-text

Spatial–Spectral Feature Refinement for Hyperspectral Image Classification Based on Attention-Dense 3D-2D-CNN

Sensors ◽

10.3390/s20185191 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5191

Author(s):

Jin Zhang ◽

Fengyuan Wei ◽

Fan Feng ◽

Chunyang Wang

Keyword(s):

Spatial Information ◽

Hyperspectral Image ◽

Spectral Feature ◽

Small Sample ◽

Training Data ◽

Spectral Features ◽

Training Samples ◽

Proposed Model ◽

Limited Training Samples ◽

Hyperspectral Classification

Convolutional neural networks provide an ideal solution for hyperspectral image (HSI) classification. However, the classification effect is not satisfactory when limited training samples are available. Focused on “small sample” hyperspectral classification, we proposed a novel 3D-2D-convolutional neural network (CNN) model named AD-HybridSN (Attention-Dense-HybridSN). In our proposed model, a dense block was used to reuse shallow features and aimed at better exploiting hierarchical spatial–spectral features. Subsequent depth separable convolutional layers were used to discriminate the spatial information. Further refinement of spatial–spectral features was realized by the channel attention method and spatial attention method, which were performed behind every 3D convolutional layer and every 2D convolutional layer, respectively. Experiment results indicate that our proposed model can learn more discriminative spatial–spectral features using very few training data. In Indian Pines, Salinas and the University of Pavia, AD-HybridSN obtain 97.02%, 99.59% and 98.32% overall accuracy using only 5%, 1% and 1% labeled data for training, respectively, which are far better than all the contrast models.

Download Full-text

Spatial–Spectral Relation Network for Hyperspectral Image Classification With Limited Training Samples

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2019.2957047 ◽

2019 ◽

Vol 12 (12) ◽

pp. 5086-5100

Author(s):

Mengbin Rao ◽

Ping Tang ◽

Zheng Zhang

Keyword(s):

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Training Samples ◽

Limited Training Samples

Download Full-text

CSR-Net: Camera Spectral Response Network for Dimensionality Reduction and Classification in Hyperspectral Imagery

Remote Sensing ◽

10.3390/rs12203294 ◽

2020 ◽

Vol 12 (20) ◽

pp. 3294

Author(s):

Yunhao Zou ◽

Ying Fu ◽

Yinqiang Zheng ◽

Wei Li

Keyword(s):

Hyperspectral Image ◽

State Of The Art ◽

Spectral Response ◽

Curse Of Dimensionality ◽

Classification Problems ◽

Spectral Bands ◽

Spatial Features ◽

Response Network ◽

Training Samples ◽

Limited Training Samples

Hyperspectral image (HSI) classification has become one of the most significant tasks in the field of hyperspectral analysis. However, classifying each pixel in HSI accurately is challenging due to the curse of dimensionality and limited training samples. In this paper, we present an HSI classification architecture called camera spectral response network (CSR-Net), which can learn the optimal camera spectral response (CSR) function for HSI classification problems and effectively reduce the spectral dimensions of HSI. Specifically, we design a convolutional layer to simulate the capturing process of cameras, which learns the optimal CSR function for HSI classification. Then, spectral and spatial features are further extracted by spectral and spatial attention modules. On one hand, the learned CSR can be implemented physically and directly used to capture scenes, which makes the image acquisition process more convenient. On the other hand, compared with ordinary HSIs, we only need images with far fewer bands, without sacrificing the classification precision and avoiding the curse of dimensionality. The experimental results of four popular public hyperspectral datasets show that our method, with only a few image bands, outperforms state-of-the-art HSI classification methods which utilize the full spectral bands of images.

Download Full-text

Collaborative learning of lightweight convolutional neural network and deep clustering for hyperspectral image semi-supervised classification with limited training samples

ISPRS Journal of Photogrammetry and Remote Sensing ◽

10.1016/j.isprsjprs.2020.01.015 ◽

2020 ◽

Vol 161 ◽

pp. 164-178 ◽

Cited By ~ 10

Author(s):

Bei Fang ◽

Ying Li ◽

Haokui Zhang ◽

Jonathan Cheung-Wai Chan

Keyword(s):

Neural Network ◽

Collaborative Learning ◽

Convolutional Neural Network ◽

Supervised Classification ◽

Hyperspectral Image ◽

Training Samples ◽

Limited Training Samples

Download Full-text

A novel non-parametric weighted feature extraction method for classification of hyperspectral image with limited training samples

2007 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2007.4423106 ◽

2007 ◽

Cited By ~ 1

Author(s):

Jinn-Min Yang ◽

Pao-Ta Yu ◽

Bor-Chen Kuo ◽

Hsiao-Yun Huang

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Hyperspectral Image ◽

Feature Extraction Method ◽

Training Samples ◽

Limited Training Samples ◽

Non Parametric

Download Full-text

Dual-Triple Attention Network for Hyperspectral Image Classification Using Limited Training Samples

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2021.3067348 ◽

2021 ◽

pp. 1-5

Author(s):

Ying Cui ◽

Zikun Yu ◽

Jiacheng Han ◽

Shan Gao ◽

Liguo Wang

Keyword(s):

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Attention Network ◽

Training Samples ◽

Limited Training Samples

Download Full-text