A Two-Stream Deep Fusion Framework for High-Resolution Aerial Scene Classification

Computational Intelligence and Neuroscience ◽

10.1155/2018/8639367 ◽

2018 ◽

Vol 2018 ◽

pp. 1-13 ◽

Cited By ~ 25

Author(s):

Yunlong Yu ◽

Fuxian Liu

Keyword(s):

High Resolution ◽

Classification Accuracy ◽

Feature Fusion ◽

Saliency Detection ◽

Feature Representation ◽

Aerial Image ◽

Scene Classification ◽

Representation Method ◽

Learning Machine ◽

Elm Classifier

One of the challenging problems in understanding high-resolution remote sensing images is aerial scene classification. A well-designed feature representation method and classifier can improve classification accuracy. In this paper, we construct a new two-stream deep architecture for aerial scene classification. First, we use two pretrained convolutional neural networks (CNNs) as feature extractor to learn deep features from the original aerial image and the processed aerial image through saliency detection, respectively. Second, two feature fusion strategies are adopted to fuse the two different types of deep convolutional features extracted by the original RGB stream and the saliency stream. Finally, we use the extreme learning machine (ELM) classifier for final classification with the fused features. The effectiveness of the proposed architecture is tested on four challenging datasets: UC-Merced dataset with 21 scene categories, WHU-RS dataset with 19 scene categories, AID dataset with 30 scene categories, and NWPU-RESISC45 dataset with 45 challenging scene categories. The experimental results demonstrate that our architecture gets a significant classification accuracy improvement over all state-of-the-art references.

Download Full-text

Hierarchical Deep Feature Representation for High-Resolution Scene Classification

IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2019.8898849 ◽

2019 ◽

Author(s):

Xiaoyong Bian ◽

Chunfang Chen ◽

Chunhua Deng ◽

Ruiyao Liu ◽

Qian Du

Keyword(s):

High Resolution ◽

Feature Representation ◽

Scene Classification ◽

Deep Feature

Download Full-text

A Multi-Branch Feature Fusion Strategy Based on an Attention Mechanism for Remote Sensing Image Scene Classification

Remote Sensing ◽

10.3390/rs13101950 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1950

Author(s):

Cuiping Shi ◽

Xin Zhao ◽

Liguo Wang

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Classification Accuracy ◽

Feature Fusion ◽

State Of The Art ◽

Rapid Development ◽

Remote Sensing Image ◽

Classification Performance ◽

Attention Mechanism ◽

Scene Classification

In recent years, with the rapid development of computer vision, increasing attention has been paid to remote sensing image scene classification. To improve the classification performance, many studies have increased the depth of convolutional neural networks (CNNs) and expanded the width of the network to extract more deep features, thereby increasing the complexity of the model. To solve this problem, in this paper, we propose a lightweight convolutional neural network based on attention-oriented multi-branch feature fusion (AMB-CNN) for remote sensing image scene classification. Firstly, we propose two convolution combination modules for feature extraction, through which the deep features of images can be fully extracted with multi convolution cooperation. Then, the weights of the feature are calculated, and the extracted deep features are sent to the attention mechanism for further feature extraction. Next, all of the extracted features are fused by multiple branches. Finally, depth separable convolution and asymmetric convolution are implemented to greatly reduce the number of parameters. The experimental results show that, compared with some state-of-the-art methods, the proposed method still has a great advantage in classification accuracy with very few parameters.

Download Full-text

Dense Connectivity Based Two-Stream Deep Feature Fusion Framework for Aerial Scene Classification

Remote Sensing ◽

10.3390/rs10071158 ◽

2018 ◽

Vol 10 (7) ◽

pp. 1158 ◽

Cited By ~ 27

Author(s):

Yunlong Yu ◽

Fuxian Liu

Keyword(s):

Remote Sensing ◽

Feature Fusion ◽

Local Binary Patterns ◽

Aerial Image ◽

Great Success ◽

Scene Classification ◽

Fusion Model ◽

Deep Architecture ◽

Deep Feature ◽

Training Samples

Aerial scene classification is an active and challenging problem in high-resolution remote sensing imagery understanding. Deep learning models, especially convolutional neural networks (CNNs), have achieved prominent performance in this field. The extraction of deep features from the layers of a CNN model is widely used in these CNN-based methods. Although the CNN-based approaches have obtained great success, there is still plenty of room to further increase the classification accuracy. As a matter of fact, the fusion with other features has great potential for leading to the better performance of aerial scene classification. Therefore, we propose two effective architectures based on the idea of feature-level fusion. The first architecture, i.e., texture coded two-stream deep architecture, uses the raw RGB network stream and the mapped local binary patterns (LBP) coded network stream to extract two different sets of features and fuses them using a novel deep feature fusion model. In the second architecture, i.e., saliency coded two-stream deep architecture, we employ the saliency coded network stream as the second stream and fuse it with the raw RGB network stream using the same feature fusion model. For sake of validation and comparison, our proposed architectures are evaluated via comprehensive experiments with three publicly available remote sensing scene datasets. The classification accuracies of saliency coded two-stream architecture with our feature fusion model achieve 97.79%, 98.90%, 94.09%, 95.99%, 85.02%, and 87.01% on the UC-Merced dataset (50% and 80% training samples), the Aerial Image Dataset (AID) (20% and 50% training samples), and the NWPU-RESISC45 dataset (10% and 20% training samples), respectively, overwhelming state-of-the-art methods.

Download Full-text

Nonlinear Analysis of Electrocardiography Signals for Atrial Fibrillation

The Scientific World JOURNAL ◽

10.1155/2013/509784 ◽

2013 ◽

Vol 2013 ◽

pp. 1-4 ◽

Cited By ~ 4

Author(s):

Necmettin Sezgin

Keyword(s):

Atrial Fibrillation ◽

Classification Accuracy ◽

Heart Beat ◽

Ecg Signal ◽

Bispectral Analysis ◽

Classification Result ◽

Ecg Signals ◽

Learning Machine ◽

Elm Classifier ◽

Normal Ecgs

This paper aims to analyze the electrocardiography (ECG) signals for patient with atrial fibrillation (AF) by using bispectrum and extreme learning machine (ELM). AF is the most common irregular heart beat disease which may cause many cardiac diseases as well. Bispectral analysis was used to extract the nonlinear information in the ECG signals. The bispectral features of each ECG episode were determined and fed to the ELM classifier. The classification accuracy of ELM to distinguish nonterminating, terminating AF, and terminating immediately AF was 96.25%. In this study, the normal ECG signal was also compared with AF ECG signal due to the nonlinearity which was determined by bispectrum. The classification result of ELM was 99.15% to distinguish AF ECGs from normal ECGs.

Download Full-text

Retraction: Zhu R. et al. Attention-Based Deep Feature Fusion for the Scene Classification of High-Resolution Remote Sensing Images. Remote Sensing. 2019, 11(17), 1996

Remote Sensing ◽

10.3390/rs12040742 ◽

2020 ◽

Vol 12 (4) ◽

pp. 742 ◽

Cited By ~ 1

Author(s):

Ruixi Zhu ◽

Li Yan ◽

Nan Mo ◽

Yi Liu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Research Method ◽

Feature Fusion ◽

Scene Classification ◽

Remote Sensing Images ◽

Deep Feature

We have been made aware that the innovative contributions, research method and the majority of the content of this article [...]

Download Full-text

HROM: Learning High-Resolution Representation and Object-Aware Masks for Visual Object Tracking

Sensors ◽

10.3390/s20174807 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4807

Author(s):

Dawei Zhang ◽

Zhonglong Zheng ◽

Tianxiang Wang ◽

Yiran He

Keyword(s):

High Resolution ◽

High Performance ◽

Feature Fusion ◽

Feature Representation ◽

Visual Object ◽

Low Resolution ◽

Geometric Transformations ◽

Siamese Network ◽

Target Template ◽

Efficiency And Effectiveness

Siamese network-based trackers consider tracking as features cross-correlation between the target template and the search region. Therefore, feature representation plays an important role for constructing a high-performance tracker. However, all existing Siamese networks extract the deep but low-resolution features of the entire patch, which is not robust enough to estimate the target bounding box accurately. In this work, to address this issue, we propose a novel high-resolution Siamese network, which connects the high-to-low resolution convolution streams in parallel as well as repeatedly exchanges the information across resolutions to maintain high-resolution representations. The resulting representation is semantically richer and spatially more precise by a simple yet effective multi-scale feature fusion strategy. Moreover, we exploit attention mechanisms to learn object-aware masks for adaptive feature refinement, and use deformable convolution to handle complex geometric transformations. This makes the target more discriminative against distractors and background. Without bells and whistles, extensive experiments on popular tracking benchmarks containing OTB100, UAV123, VOT2018 and LaSOT demonstrate that the proposed tracker achieves state-of-the-art performance and runs in real time, confirming its efficiency and effectiveness.

Download Full-text

Rich CNN Features for Water-Body Segmentation from Very High Resolution Aerial and Satellite Imagery

Remote Sensing ◽

10.3390/rs13101912 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1912

Author(s):

Zhili Zhang ◽

Meng Lu ◽

Shunping Ji ◽

Huafen Yu ◽

Chenhui Nie

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

High Resolution ◽

Satellite Imagery ◽

Water Body ◽

Feature Fusion ◽

Feature Representation ◽

Water Bodies ◽

Feature Maps ◽

Very High

Extracting water-bodies accurately is a great challenge from very high resolution (VHR) remote sensing imagery. The boundaries of a water body are commonly hard to identify due to the complex spectral mixtures caused by aquatic vegetation, distinct lake/river colors, silts near the bank, shadows from the surrounding tall plants, and so on. The diversity and semantic information of features need to be increased for a better extraction of water-bodies from VHR remote sensing images. In this paper, we address these problems by designing a novel multi-feature extraction and combination module. This module consists of three feature extraction sub-modules based on spatial and channel correlations in feature maps at each scale, which extract the complete target information from the local space, larger space, and between-channel relationship to achieve a rich feature representation. Simultaneously, to better predict the fine contours of water-bodies, we adopt a multi-scale prediction fusion module. Besides, to solve the semantic inconsistency of feature fusion between the encoding stage and the decoding stage, we apply an encoder-decoder semantic feature fusion module to promote fusion effects. We carry out extensive experiments in VHR aerial and satellite imagery respectively. The result shows that our method achieves state-of-the-art segmentation performance, surpassing the classic and recent methods. Moreover, our proposed method is robust in challenging water-body extraction scenarios.

Download Full-text

Residual Dense Network Based on Channel-Spatial Attention for the Scene Classification of a High-Resolution Remote Sensing Image

Remote Sensing ◽

10.3390/rs12111887 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1887 ◽

Cited By ~ 5

Author(s):

Xiaolei Zhao ◽

Jing Zhang ◽

Jimiao Tian ◽

Li Zhuo ◽

Jie Zhang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Spatial Attention ◽

Remote Sensing Image ◽

Learning Ability ◽

Aerial Image ◽

Dense Network ◽

Scene Classification ◽

Training Requirements

The scene classification of a remote sensing image has been widely used in various fields as an important task of understanding the content of a remote sensing image. Specially, a high-resolution remote sensing scene contains rich information and complex content. Considering that the scene content in a remote sensing image is very tight to the spatial relationship characteristics, how to design an effective feature extraction network directly decides the quality of classification by fully mining the spatial information in a high-resolution remote sensing image. In recent years, convolutional neural networks (CNNs) have achieved excellent performance in remote sensing image classification, especially the residual dense network (RDN) as one of the representative networks of CNN, which shows a stronger feature learning ability as it fully utilizes all the convolutional layer information. Therefore, we design an RDN based on channel-spatial attention for scene classification of a high-resolution remote sensing image. First, multi-layer convolutional features are fused with residual dense blocks. Then, a channel-spatial attention module is added to obtain more effective feature representation. Finally, softmax classifier is applied to classify the scene after adopting data augmentation strategy for meeting the training requirements of the network parameters. Five experiments are conducted on the UC Merced Land-Use Dataset (UCM) and Aerial Image Dataset (AID), and the competitive results demonstrate that our method can extract more effective features and is more conducive to classifying a scene.

Download Full-text

FEATURE FUSION FOR CROSS-MODAL SCENE CLASSIFICATION OF REMOTE SENSING IMAGE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliv-m-3-2021-63-2021 ◽

2021 ◽

Vol XLIV-M-3-2021 ◽

pp. 63-66

Author(s):

W. Geng ◽

W. Zhou ◽

S. Jin

Keyword(s):

Remote Sensing ◽

Feature Fusion ◽

Remote Sensing Image ◽

Aerial Images ◽

Aerial Image ◽

Svm Classifier ◽

Scene Classification ◽

Street View ◽

Modal Model

Abstract. Scene classification plays an important role in remote sensing field. Traditional approaches use high-resolution remote sensing images as data source to extract powerful features. Although these kind of methods are common, the model performance is severely affected by the image quality of the dataset, and the single modal (source) of images tend to cause the mission of some scene semantic information, which eventually degrade the classification accuracy. Nowadays, multi-modal remote sensing data become easy to obtain since the development of remote sensing technology. How to carry out scene classification of cross-modal data has become an interesting topic in the field. To solve the above problems, this paper proposes using feature fusion for cross-modal scene classification of remote sensing image, i.e., aerial and ground street view images, expecting to use the advantages of aerial images and ground street view data to complement each other. Our cross- modal model is based on Siamese Network. Specifically, we first train the cross-modal model by pairing different sources of data with aerial image and ground data. Then, the trained model is used to extract the deep features of the aerial and ground image pair, and the features of the two perspectives are fused to train a SVM classifier for scene classification. Our approach has been demonstrated using two public benchmark datasets, AiRound and CV-BrCT. The preliminary results show that the proposed method achieves state-of-the-art performance compared with the traditional methods, indicating that the information from ground data can contribute to aerial image classification.

Download Full-text

Deep Feature Fusion for High-Resolution Aerial Scene Classification

Neural Processing Letters ◽

10.1007/s11063-019-10119-4 ◽

2019 ◽

Vol 51 (1) ◽

pp. 853-865

Author(s):

Heng Wang ◽

Yunlong Yu

Keyword(s):

High Resolution ◽

Feature Fusion ◽

Scene Classification ◽

Deep Feature

Download Full-text