Real-Time Dense Semantic Labeling with Dual-Path Framework for High-Resolution Remote Sensing Image

Yuhao Wang; Chen Chen; Meng Ding; Jiangyun Li

doi:10.3390/rs11243020

Real-Time Dense Semantic Labeling with Dual-Path Framework for High-Resolution Remote Sensing Image

Remote Sensing ◽

10.3390/rs11243020 ◽

2019 ◽

Vol 11 (24) ◽

pp. 3020 ◽

Cited By ~ 2

Author(s):

Yuhao Wang ◽

Chen Chen ◽

Meng Ding ◽

Jiangyun Li

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Spatial Information ◽

Feature Fusion ◽

Remote Sensing Image ◽

Light Weight ◽

Fiber Network ◽

Recent Success ◽

Semantic Labeling ◽

Learning Procedure

Dense semantic labeling plays a pivotal role in high-resolution remote sensing image research. It provides pixel-level classification which is crucial in land cover mapping and urban planning. With the recent success of the convolutional neural network (CNN), accuracy has been greatly improved by previous works. However, most networks boost performance by involving too many parameters and computational overheads, which results in more inference time and hardware resources, while some attempts with light-weight networks do not achieve satisfactory results due to the insufficient feature extraction ability. In this work, we propose an efficient light-weight CNN based on dual-path architecture to address this issue. Our model utilizes three convolution layers as the spatial path to enhance the extraction of spatial information. Meanwhile, we develop the context path with the multi-fiber network (MFNet) followed by the pyramid pooling module (PPM) to obtain a sufficient receptive field. On top of these two paths, we adopt the channel attention block to refine the features from the context path and apply a feature fusion module to combine spatial information with context information. Moreover, a weighted cascade loss function is employed to enhance the learning procedure. With all these components, the performance can be significantly improved. Experiments on the Potsdam and Vaihingen datasets demonstrate that our network performs better than other light-weight networks, even some classic networks. Compared to the state-of-the-art U-Net, our model achieves higher accuracy on the two datasets with 2.5 times less network parameters and 22 times less computational floating point operations (FLOPs).

Download Full-text

Full Convolutional Neural Network Based on Multi-Scale Feature Fusion for the Class Imbalance Remote Sensing Image Classification

Remote Sensing ◽

10.3390/rs12213547 ◽

2020 ◽

Vol 12 (21) ◽

pp. 3547 ◽

Cited By ~ 2

Author(s):

Yuanyuan Ren ◽

Xianfeng Zhang ◽

Yongjian Ma ◽

Qiyuan Yang ◽

Chuanjian Wang ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Spatial Information ◽

Feature Fusion ◽

Remote Sensing Image ◽

Machine Learning Algorithms ◽

Image Feature ◽

Remote Sensing Images ◽

Land Covers

Remote sensing image segmentation with samples imbalance is always one of the most important issues. Typically, a high-resolution remote sensing image has the characteristics of high spatial resolution and low spectral resolution, complex large-scale land covers, small class differences for some land covers, vague foreground, and imbalanced distribution of samples. However, traditional machine learning algorithms have limitations in deep image feature extraction and dealing with sample imbalance issue. In the paper, we proposed an improved full-convolution neural network, called DeepLab V3+, with loss function based solution of samples imbalance. In addition, we select Sentinel-2 remote sensing images covering the Yuli County, Bayingolin Mongol Autonomous Prefecture, Xinjiang Uygur Autonomous Region, China as data sources, then a typical region image dataset is built by data augmentation. The experimental results show that the improved DeepLab V3+ model can not only utilize the spectral information of high-resolution remote sensing images, but also consider its rich spatial information. The classification accuracy of the proposed method on the test dataset reaches 97.97%. The mean Intersection-over-Union reaches 87.74%, and the Kappa coefficient 0.9587. The work provides methodological guidance to sample imbalance correction, and the established data resource can be a reference to further study in the future.

Download Full-text

Change Detection Method of High Resolution Remote Sensing Image Based on D-S Evidence Theory Feature Fusion

IEEE Access ◽

10.1109/access.2020.3047915 ◽

2021 ◽

Vol 9 ◽

pp. 4673-4687

Author(s):

Jixiang Zhao ◽

Shanwei Liu ◽

Jianhua Wan ◽

Muhammad Yasir ◽

Huayu Li

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Change Detection ◽

Detection Method ◽

Feature Fusion ◽

Evidence Theory ◽

Remote Sensing Image

Download Full-text

High resolution remote sensing image segmentation based on graph theory and fractal net evolution approach

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xl-7-w4-197-2015 ◽

2015 ◽

Vol XL-7/W4 ◽

pp. 197-201 ◽

Cited By ~ 3

Author(s):

Y. Yang ◽

H. T. Li ◽

Y. S. Han ◽

H. Y. Gu

Keyword(s):

Remote Sensing ◽

Image Segmentation ◽

Graph Theory ◽

High Resolution ◽

Spatial Information ◽

Remote Sensing Image ◽

Sensing Applications ◽

Initial Object ◽

Object Layer ◽

Evolution Approach

Image segmentation is the foundation of further object-oriented image analysis, understanding and recognition. It is one of the key technologies in high resolution remote sensing applications. In this paper, a new fast image segmentation algorithm for high resolution remote sensing imagery is proposed, which is based on graph theory and fractal net evolution approach (FNEA). Firstly, an image is modelled as a weighted undirected graph, where nodes correspond to pixels, and edges connect adjacent pixels. An initial object layer can be obtained efficiently from graph-based segmentation, which runs in time nearly linear in the number of image pixels. Then FNEA starts with the initial object layer and a pairwise merge of its neighbour object with the aim to minimize the resulting summed heterogeneity. Furthermore, according to the character of different features in high resolution remote sensing image, three different merging criterions for image objects based on spectral and spatial information are adopted. Finally, compared with the commercial remote sensing software eCognition, the experimental results demonstrate that the efficiency of the algorithm has significantly improved, and the result can maintain good feature boundaries.

Download Full-text

Object-oriented multi-feature fusion change detection method for high resolution remote sensing image

2009 17th International Conference on Geoinformatics ◽

10.1109/geoinformatics.2009.5293463 ◽

2009 ◽

Cited By ~ 1

Author(s):

Wen-jie Wang ◽

Zhong-ming Zhao ◽

Hai-qing Zhu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Change Detection ◽

Detection Method ◽

Feature Fusion ◽

Object Oriented ◽

Remote Sensing Image

Download Full-text

HA-MPPNet: Height Aware-Multi Path Parallel Network for High Spatial Resolution Remote Sensing Image Semantic Seg-Mentation

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10100672 ◽

2021 ◽

Vol 10 (10) ◽

pp. 672

Author(s):

Suting Chen ◽

Chaoqun Wu ◽

Mithun Mukherjee ◽

Yujie Zheng

Keyword(s):

Remote Sensing ◽

Spatial Resolution ◽

Spatial Information ◽

Feature Fusion ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Surface Model ◽

Semantic Features ◽

Low Level ◽

Parallel Network

Semantic segmentation of remote sensing images (RSI) plays a significant role in urban management and land cover classification. Due to the richer spatial information in the RSI, existing convolutional neural network (CNN)-based methods cannot segment images accurately and lose some edge information of objects. In addition, recent studies have shown that leveraging additional 3D geometric data with 2D appearance is beneficial to distinguish the pixels’ category. However, most of them require height maps as additional inputs, which severely limits their applications. To alleviate the above issues, we propose a height aware-multi path parallel network (HA-MPPNet). Our proposed MPPNet first obtains multi-level semantic features while maintaining the spatial resolution in each path for preserving detailed image information. Afterward, gated high-low level feature fusion is utilized to complement the lack of low-level semantics. Then, we designed the height feature decode branch to learn the height features under the supervision of digital surface model (DSM) images and used the learned embeddings to improve semantic context by height feature guide propagation. Note that our module does not need a DSM image as additional input after training and is end-to-end. Our method outperformed other state-of-the-art methods for semantic segmentation on publicly available remote sensing image datasets.

Download Full-text

Dense Semantic Labeling with Atrous Spatial Pyramid Pooling and Decoder for High-Resolution Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs11010020 ◽

2018 ◽

Vol 11 (1) ◽

pp. 20 ◽

Cited By ~ 32

Author(s):

Yuhao Wang ◽

Binxiu Liang ◽

Meng Ding ◽

Jiangyun Li

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Spatial Information ◽

Conditional Random Field ◽

Convolutional Network ◽

Remote Sensing Imagery ◽

Multi Scale ◽

Semantic Labeling ◽

Spatial Pyramid Pooling ◽

Spatial Pyramid

Dense semantic labeling is significant in high-resolution remote sensing imagery research and it has been widely used in land-use analysis and environment protection. With the recent success of fully convolutional networks (FCN), various types of network architectures have largely improved performance. Among them, atrous spatial pyramid pooling (ASPP) and encoder-decoder are two successful ones. The former structure is able to extract multi-scale contextual information and multiple effective field-of-view, while the latter structure can recover the spatial information to obtain sharper object boundaries. In this study, we propose a more efficient fully convolutional network by combining the advantages from both structures. Our model utilizes the deep residual network (ResNet) followed by ASPP as the encoder and combines two scales of high-level features with corresponding low-level features as the decoder at the upsampling stage. We further develop a multi-scale loss function to enhance the learning procedure. In the postprocessing, a novel superpixel-based dense conditional random field is employed to refine the predictions. We evaluate the proposed method on the Potsdam and Vaihingen datasets and the experimental results demonstrate that our method performs better than other machine learning or deep learning methods. Compared with the state-of-the-art DeepLab_v3+ our model gains 0.4% and 0.6% improvements in overall accuracy on these two datasets respectively.

Download Full-text

Semantic Multigranularity Feature Learning for High-Resolution Remote Sensing Image Scene Classification

Applied Sciences ◽

10.3390/app11199204 ◽

2021 ◽

Vol 11 (19) ◽

pp. 9204

Author(s):

Xinyi Ma ◽

Zhifeng Xiao ◽

Hong-sik Yun ◽

Seung-Jun Lee

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Spatial Information ◽

Feature Learning ◽

Remote Sensing Image ◽

Input Image ◽

Training Data ◽

Aerial Image ◽

Scene Classification ◽

Feature Maps

High-resolution remote sensing image scene classification is a challenging visual task due to the large intravariance and small intervariance between the categories. To accurately recognize the scene categories, it is essential to learn discriminative features from both global and local critical regions. Recent efforts focus on how to encourage the network to learn multigranularity features with the destruction of the spatial information on the input image at different scales, which leads to meaningless edges that are harmful to training. In this study, we propose a novel method named Semantic Multigranularity Feature Learning Network (SMGFL-Net) for remote sensing image scene classification. The core idea is to learn both global and multigranularity local features from rearranged intermediate feature maps, thus, eliminating the meaningless edges. These features are then fused for the final prediction. Our proposed framework is compared with a collection of state-of-the-art (SOTA) methods on two fine-grained remote sensing image scene datasets, including the NWPU-RESISC45 and Aerial Image Datasets (AID). We justify several design choices, including the branch granularities, fusion strategies, pooling operations, and necessity of feature map rearrangement through a comparative study. Moreover, the overall performance results show that SMGFL-Net consistently outperforms other peer methods in classification accuracy, and the superiority is more apparent with less training data, demonstrating the efficacy of feature learning of our approach.

Download Full-text

A Multi-Level Feature Fusion Network for Remote Sensing Image Segmentation

Sensors ◽

10.3390/s21041267 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1267

Author(s):

Sijun Dong ◽

Zhengchao Chen

Keyword(s):

Remote Sensing ◽

Image Segmentation ◽

High Resolution ◽

Feature Fusion ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Environmental Disaster ◽

Forestry Industry ◽

Multi Level ◽

Industrial Level

High-resolution remote sensing image segmentation is a mature application in many industrial-level image applications and it also has military and civil applications. The scene analysis needs to be automated as much as possible with high-resolution remote sensing images. This plays a significant role in environmental disaster monitoring, forestry industry, agricultural farming, urban planning, and road analysis. This study proposes a multi-level feature fusion network (MFNet) that can integrate the multi-level features in the backbone to obtain different types of image information. Finally, the experiments in this study demonstrate that the proposed network can achieve good segmentation results in the Vaihingen and Potsdam datasets. By aiming to achieve a large difference in the scale of the target objects in remote sensing images and achieving a poor recognition result for small objects, a multi-level feature fusion solution is proposed in this study. This investigation improves the recognition results of the remote sensing image segmentation to a certain extent.

Download Full-text

Building Extraction from High Resolution Remote Sensing Image by Attribute Filtering and Context Analysis

JOURNAL OF ELECTRONICS INFORMATION TECHNOLOGY ◽

10.3724/sp.j.1146.2012.00615 ◽

2013 ◽

Vol 34 (12) ◽

pp. 2985-2991

Author(s):

Li-bing Jiang ◽

Zhuang Wang ◽

Wen-xian Yu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Remote Sensing Image ◽

Building Extraction ◽

Context Analysis

Download Full-text

Road Centreline Extraction of High-Resolution Remote Sensing Image with Improved Beamlet Transform and K-Means Clustering

Arabian Journal for Science and Engineering ◽

10.1007/s13369-021-05412-1 ◽

2021 ◽

Vol 46 (4) ◽

pp. 4153-4162

Author(s):

Hancong Fu ◽

Fan Deng ◽

Yanlin Shao ◽

Yuangang Liu ◽

Jiahua Zhang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Remote Sensing Image ◽

Beamlet Transform

Download Full-text