Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images

Jing Zhang; Shaofu Lin; Lei Ding; Lorenzo Bruzzone

doi:10.3390/rs12040701

Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images

Remote Sensing ◽

10.3390/rs12040701 ◽

2020 ◽

Vol 12 (4) ◽

pp. 701 ◽

Cited By ~ 4

Author(s):

Jing Zhang ◽

Shaofu Lin ◽

Lei Ding ◽

Lorenzo Bruzzone

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Information ◽

Contextual Information ◽

Semantic Segmentation ◽

Spatial Correlations ◽

Remote Sensing Images ◽

Localization Accuracy ◽

Spatial Pooling ◽

Spatial Size

The semantic segmentation of remote sensing images (RSIs) is important in a variety of applications. Conventional encoder-decoder-based convolutional neural networks (CNNs) use cascade pooling operations to aggregate the semantic information, which results in a loss of localization accuracy and in the preservation of spatial details. To overcome these limitations, we introduce the use of the high-resolution network (HRNet) to produce high-resolution features without the decoding stage. Moreover, we enhance the low-to-high features extracted from different branches separately to strengthen the embedding of scale-related contextual information. The low-resolution features contain more semantic information and have a small spatial size; thus, they are utilized to model the long-term spatial correlations. The high-resolution branches are enhanced by introducing an adaptive spatial pooling (ASP) module to aggregate more local contexts. By combining these context aggregation designs across different levels, the resulting architecture is capable of exploiting spatial context at both global and local levels. The experimental results obtained on two RSI datasets show that our approach significantly improves the accuracy with respect to the commonly used CNNs and achieves state-of-the-art performance.

Download Full-text

Knowledge and Geo-Object Based Graph Convolutional Network for Remote Sensing Semantic Segmentation

Sensors ◽

10.3390/s21113848 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3848

Author(s):

Wei Cui ◽

Meng Yao ◽

Yuanjie Hao ◽

Ziwei Wang ◽

Xin He ◽

...

Keyword(s):

Remote Sensing ◽

Prior Knowledge ◽

Contextual Information ◽

Information Aggregation ◽

Semantic Segmentation ◽

Spatial Correlations ◽

Convolutional Network ◽

Object Based ◽

Graph Neural Networks ◽

Salt And Pepper

Pixel-based semantic segmentation models fail to effectively express geographic objects and their topological relationships. Therefore, in semantic segmentation of remote sensing images, these models fail to avoid salt-and-pepper effects and cannot achieve high accuracy either. To solve these problems, object-based models such as graph neural networks (GNNs) are considered. However, traditional GNNs directly use similarity or spatial correlations between nodes to aggregate nodes’ information, which rely too much on the contextual information of the sample. The contextual information of the sample is often distorted, which results in a reduction in the node classification accuracy. To solve this problem, a knowledge and geo-object-based graph convolutional network (KGGCN) is proposed. The KGGCN uses superpixel blocks as nodes of the graph network and combines prior knowledge with spatial correlations during information aggregation. By incorporating the prior knowledge obtained from all samples of the study area, the receptive field of the node is extended from its sample context to the study area. Thus, the distortion of the sample context is overcome effectively. Experiments demonstrate that our model is improved by 3.7% compared with the baseline model named Cluster GCN and 4.1% compared with U-Net.

Download Full-text

Conditional Generative Adversarial Network-Based Training Sample Set Improvement Model for the Semantic Segmentation of High-Resolution Remote Sensing Images

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2020.3033816 ◽

2020 ◽

pp. 1-17

Author(s):

Xin Pan ◽

Jian Zhao ◽

Jun Xu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Training Sample ◽

Remote Sensing Images ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Sample Set

Download Full-text

Semantic Segmentation of High Resolution Remote Sensing Images with Extra Context Attention Mechanism

2020 IEEE 20th International Conference on Communication Technology (ICCT) ◽

10.1109/icct50939.2020.9295814 ◽

2020 ◽

Author(s):

Weifu Fu ◽

Qing Peng ◽

Yanxiang Gong ◽

Mei Xie ◽

Shicheng Wang ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Attention Mechanism ◽

Remote Sensing Images

Download Full-text

HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images

Remote Sensing ◽

10.3390/rs13010071 ◽

2020 ◽

Vol 13 (1) ◽

pp. 71

Author(s):

Zhiyong Xu ◽

Weicun Zhang ◽

Tianxiang Zhang ◽

Jiangyun Li

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

High Resolution ◽

Spatial Information ◽

Semantic Segmentation ◽

Context Information ◽

Remote Sensing Images ◽

Global Context ◽

Boundary Information ◽

Extraction Stage

Semantic segmentation is a significant method in remote sensing image (RSIs) processing and has been widely used in various applications. Conventional convolutional neural network (CNN)-based semantic segmentation methods are likely to lose the spatial information in the feature extraction stage and usually pay little attention to global context information. Moreover, the imbalance of category scale and uncertain boundary information meanwhile exists in RSIs, which also brings a challenging problem to the semantic segmentation task. To overcome these problems, a high-resolution context extraction network (HRCNet) based on a high-resolution network (HRNet) is proposed in this paper. In this approach, the HRNet structure is adopted to keep the spatial information. Moreover, the light-weight dual attention (LDA) module is designed to obtain global context information in the feature extraction stage and the feature enhancement feature pyramid (FEFP) structure is promoted and employed to fuse the contextual information of different scales. In addition, to achieve the boundary information, we design the boundary aware (BA) module combined with the boundary aware loss (BAloss) function. The experimental results evaluated on Potsdam and Vaihingen datasets show that the proposed approach can significantly improve the boundary and segmentation performance up to 92.0% and 92.3% on overall accuracy scores, respectively. As a consequence, it is envisaged that the proposed HRCNet model will be an advantage in remote sensing images segmentation.

Download Full-text

Efficient Patch-Wise Semantic Segmentation for Large-Scale Remote Sensing Images

Sensors ◽

10.3390/s18103232 ◽

2018 ◽

Vol 18 (10) ◽

pp. 3232 ◽

Cited By ~ 17

Author(s):

Yan Liu ◽

Qirui Ren ◽

Jiahui Geng ◽

Meng Ding ◽

Jiangyun Li

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Large Scale ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Training Data ◽

Land Resources ◽

Remote Sensing Images ◽

Training Strategy ◽

The Impact

Efficient and accurate semantic segmentation is the key technique for automatic remote sensing image analysis. While there have been many segmentation methods based on traditional hand-craft feature extractors, it is still challenging to process high-resolution and large-scale remote sensing images. In this work, a novel patch-wise semantic segmentation method with a new training strategy based on fully convolutional networks is presented to segment common land resources. First, to handle the high-resolution image, the images are split as local patches and then a patch-wise network is built. Second, training data is preprocessed in several ways to meet the specific characteristics of remote sensing images, i.e., color imbalance, object rotation variations and lens distortion. Third, a multi-scale training strategy is developed to solve the severe scale variation problem. In addition, the impact of conditional random field (CRF) is studied to improve the precision. The proposed method was evaluated on a dataset collected from a capital city in West China with the Gaofen-2 satellite. The dataset contains ten common land resources (Grassland, Road, etc.). The experimental results show that the proposed algorithm achieves 54.96% in terms of mean intersection over union (MIoU) and outperforms other state-of-the-art methods in remote sensing image segmentation.

Download Full-text

A Semantic Segmentation Approach Based on DeepLab Network in High-Resolution Remote Sensing Images

Lecture Notes in Computer Science - Image and Graphics ◽

10.1007/978-3-030-34113-8_25 ◽

2019 ◽

pp. 292-304

Author(s):

Hangtao Hu ◽

Shuo Cai ◽

Wei Wang ◽

Peng Zhang ◽

Zhiyong Li

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Segmentation Approach

Download Full-text

Semantic Segmentation of High-resolution Remote Sensing Images using Multiscale Skip Connection Network

IEEE Sensors Journal ◽

10.1109/jsen.2021.3139629 ◽

2021 ◽

pp. 1-1

Author(s):

Bifang Ma ◽

Chih-Yung Chang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Remote Sensing Images

Download Full-text

Fully convolutional DenseNet with adversarial training for semantic segmentation of high-resolution remote sensing images

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.15.016520 ◽

2021 ◽

Vol 15 (01) ◽

Author(s):

Xuejun Guo ◽

Zehua Chen ◽

Chengyi Wang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Adversarial Training

Download Full-text

U-net Network for Building Information Extraction of Remote-Sensing Imagery

International Journal of Online and Biomedical Engineering (iJOE) ◽

10.3991/ijoe.v14i12.9335 ◽

2018 ◽

Vol 14 (12) ◽

pp. 179

Author(s):

Jingtan Li ◽

Maolin Xu ◽

Hongling Xiu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Information Extraction ◽

Image Data ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Training Set ◽

Building Information ◽

The Face

With the resolution of remote sensing images is getting higher and higher, high-resolution remote sensing images are widely used in many areas. Among them, image information extraction is one of the basic applications of remote sensing images. In the face of massive high-resolution remote sensing image data, the traditional method of target recognition is difficult to cope with. Therefore, this paper proposes a remote sensing image extraction based on U-net network. Firstly, the U-net semantic segmentation network is used to train the training set, and the validation set is used to verify the training set at the same time, and finally the test set is used for testing. The experimental results show that U-net can be applied to the extraction of buildings.

Download Full-text

SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2020.2988294 ◽

2020 ◽

pp. 1-5 ◽

Cited By ~ 1

Author(s):

Haifeng Li ◽

Kaijian Qiu ◽

Li Chen ◽

Xiaoming Mei ◽

Liang Hong ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Attention Mechanism ◽

Remote Sensing Images

Download Full-text