Building Footprint Extraction from High-Resolution Images via Spatial Residual Inception Convolutional Neural Network

Penghua Liu; Xiaoping Liu; Mengxi Liu; Qian Shi; Jinxing Yang; Xiaocong Xu; Yuanying Zhang

doi:10.3390/rs11070830

Building Footprint Extraction from High-Resolution Images via Spatial Residual Inception Convolutional Neural Network

Remote Sensing ◽

10.3390/rs11070830 ◽

2019 ◽

Vol 11 (7) ◽

pp. 830 ◽

Cited By ~ 26

Author(s):

Penghua Liu ◽

Xiaoping Liu ◽

Mengxi Liu ◽

Qian Shi ◽

Jinxing Yang ◽

...

Keyword(s):

Remote Sensing ◽

Large Scale ◽

Rapid Development ◽

Morphological Characteristics ◽

Aerial Image ◽

Model Parameters ◽

Building Detection ◽

Remote Sensing Images ◽

Convolutional Network ◽

Proposed Model

The rapid development in deep learning and computer vision has introduced new opportunities and paradigms for building extraction from remote sensing images. In this paper, we propose a novel fully convolutional network (FCN), in which a spatial residual inception (SRI) module is proposed to capture and aggregate multi-scale contexts for semantic understanding by successively fusing multi-level features. The proposed SRI-Net is capable of accurately detecting large buildings that might be easily omitted while retaining global morphological characteristics and local details. On the other hand, to improve computational efficiency, depthwise separable convolutions and convolution factorization are introduced to significantly decrease the number of model parameters. The proposed model is evaluated on the Inria Aerial Image Labeling Dataset and the Wuhan University (WHU) Aerial Building Dataset. The experimental results show that the proposed methods exhibit significant improvements compared with several state-of-the-art FCNs, including SegNet, U-Net, RefineNet, and DeepLab v3+. The proposed model shows promising potential for building detection from remote sensing images on a large scale.

Download Full-text

Boundary-Assisted Learning for Building Extraction from Optical Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs13040760 ◽

2021 ◽

Vol 13 (4) ◽

pp. 760

Author(s):

Sheng He ◽

Wanshou Jiang

Keyword(s):

Remote Sensing ◽

Receptive Fields ◽

Morphological Characteristics ◽

Learning Task ◽

Aerial Image ◽

Model Parameters ◽

Optical Remote Sensing ◽

Building Extraction ◽

Convolutional Network ◽

Remote Sensing Imagery

Deep learning methods have been shown to significantly improve the performance of building extraction from optical remote sensing imagery. However, keeping the morphological characteristics, especially the boundaries, is still a challenge that requires further study. In this paper, we propose a novel fully convolutional network (FCN) for accurately extracting buildings, in which a boundary learning task is embedded to help maintain the boundaries of buildings. Specifically, in the training phase, our framework simultaneously learns the extraction of buildings and boundary detection and only outputs extraction results while testing. In addition, we introduce spatial variation fusion (SVF) to establish an association between the two tasks, thus coupling them and making them share the latent semantics and interact with each other. On the other hand, we utilize separable convolution with a larger kernel to enlarge the receptive fields while reducing the number of model parameters and adopt the convolutional block attention module (CBAM) to boost the network. The proposed framework was extensively evaluated on the WHU Building Dataset and the Inria Aerial Image Labeling Dataset. The experiments demonstrate that our method achieves state-of-the-art performance on building extraction. With the assistance of boundary learning, the boundary maintenance of buildings is ameliorated.

Download Full-text

EU-Net: An Efficient Fully Convolutional Network for Building Extraction from Optical Remote Sensing Images

Remote Sensing ◽

10.3390/rs11232813 ◽

2019 ◽

Vol 11 (23) ◽

pp. 2813 ◽

Cited By ~ 4

Author(s):

Wenchao Kang ◽

Yuming Xiang ◽

Feng Wang ◽

Hongjian You

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Ground Truth ◽

Aerial Image ◽

Optical Remote Sensing ◽

Building Extraction ◽

Remote Sensing Images ◽

Convolutional Network ◽

Practical Applications ◽

The Impact

Automatic building extraction from high-resolution remote sensing images has many practical applications, such as urban planning and supervision. However, fine details and various scales of building structures in high-resolution images bring new challenges to building extraction. An increasing number of neural network-based models have been proposed to handle these issues, while they are not efficient enough, and still suffer from the error ground truth labels. To this end, we propose an efficient end-to-end model, EU-Net, in this paper. We first design the dense spatial pyramid pooling (DSPP) to extract dense and multi-scale features simultaneously, which facilitate the extraction of buildings at all scales. Then, the focal loss is used in reverse to suppress the impact of the error labels in ground truth, making the training stage more stable. To assess the universality of the proposed model, we tested it on three public aerial remote sensing datasets: WHU aerial imagery dataset, Massachusetts buildings dataset, and Inria aerial image labeling dataset. Experimental results show that the proposed EU-Net is superior to the state-of-the-art models of all three datasets and increases the prediction efficiency by two to four times.

Download Full-text

Cohesion Intensive Deep Hashing for Remote Sensing Image Retrieval

Remote Sensing ◽

10.3390/rs12010101 ◽

2019 ◽

Vol 12 (1) ◽

pp. 101 ◽

Cited By ~ 3

Author(s):

Lirong Han ◽

Peng Li ◽

Xiao Bai ◽

Christos Grecos ◽

Xiaoyu Zhang ◽

...

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Large Scale ◽

Remote Sensing Data ◽

Optimization Method ◽

Remote Sensing Image ◽

Model Parameters ◽

Remote Sensing Images ◽

Deep Hashing ◽

Deep Model

Recently, the demand for remote sensing image retrieval is growing and attracting the interest of many researchers because of the increasing number of remote sensing images. Hashing, as a method of retrieving images, has been widely applied to remote sensing image retrieval. In order to improve hashing performance, we develop a cohesion intensive deep hashing model for remote sensing image retrieval. The underlying architecture of our deep model is motivated by the state-of-the-art residual net. Residual nets aim at avoiding gradient vanishing and gradient explosion when the net reaches a certain depth. However, different from the residual net which outputs multiple class-labels, we present a residual hash net that is terminated by a Heaviside-like function for binarizing remote sensing images. In this scenario, the representational power of the residual net architecture is exploited to establish an end-to-end deep hashing model. The residual hash net is trained subject to a weighted loss strategy that intensifies the cohesiveness of image hash codes within one class. This effectively addresses the data imbalance problem normally arising in remote sensing image retrieval tasks. Furthermore, we adopted a gradualness optimization method for obtaining optimal model parameters in order to favor accurate binary codes with little quantization error. We conduct comparative experiments on large-scale remote sensing data sets such as UCMerced and AID. The experimental results validate the hypothesis that our method improves the performance of current remote sensing image retrieval.

Download Full-text

MILL: Channel Attention–based Deep Multiple Instance Learning for Landslide Recognition

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3454009 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-11

Author(s):

Xiaochuan Tang ◽

Mingzhe Liu ◽

Hao Zhong ◽

Yuanzhen Ju ◽

Weile Li ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Large Scale ◽

Remote Sensing Image ◽

Disaster Risk ◽

Multiple Instance Learning ◽

Remote Sensing Images ◽

Loess Area ◽

Remote Sensing Image Classification ◽

Natural Disaster Risk

Landslide recognition is widely used in natural disaster risk management. Traditional landslide recognition is mainly conducted by geologists, which is accurate but inefficient. This article introduces multiple instance learning (MIL) to perform automatic landslide recognition. An end-to-end deep convolutional neural network is proposed, referred to as Multiple Instance Learning–based Landslide classification (MILL). First, MILL uses a large-scale remote sensing image classification dataset to build pre-train networks for landslide feature extraction. Second, MILL extracts instances and assign instance labels without pixel-level annotations. Third, MILL uses a new channel attention–based MIL pooling function to map instance-level labels to bag-level label. We apply MIL to detect landslides in a loess area. Experimental results demonstrate that MILL is effective in identifying landslides in remote sensing images.

Download Full-text

Building Change Detection for Remote Sensing Images Using a Dual-Task Constrained Deep Siamese Convolutional Network Model

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2020.2988032 ◽

2021 ◽

pp. 1-5

Author(s):

Yi Liu ◽

Chao Pang ◽

Zongqian Zhan ◽

Xiaomeng Zhang ◽

Xue Yang

Keyword(s):

Remote Sensing ◽

Change Detection ◽

Network Model ◽

Dual Task ◽

Remote Sensing Images ◽

Convolutional Network

Download Full-text

Semantic Relation Model and Dataset for Remote Sensing Scene Understanding

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10070488 ◽

2021 ◽

Vol 10 (7) ◽

pp. 488

Author(s):

Peng Li ◽

Dezheng Zhang ◽

Aziguli Wulamu ◽

Xin Liu ◽

Peng Chen

Keyword(s):

Remote Sensing ◽

Scene Understanding ◽

Deep Understanding ◽

Remote Sensing Images ◽

Convolutional Network ◽

Scene Graph ◽

Multi Scale ◽

Relationship Extraction ◽

High Level ◽

Graph Generation

A deep understanding of our visual world is more than an isolated perception on a series of objects, and the relationships between them also contain rich semantic information. Especially for those satellite remote sensing images, the span is so large that the various objects are always of different sizes and complex spatial compositions. Therefore, the recognition of semantic relations is conducive to strengthen the understanding of remote sensing scenes. In this paper, we propose a novel multi-scale semantic fusion network (MSFN). In this framework, dilated convolution is introduced into a graph convolutional network (GCN) based on an attentional mechanism to fuse and refine multi-scale semantic context, which is crucial to strengthen the cognitive ability of our model Besides, based on the mapping between visual features and semantic embeddings, we design a sparse relationship extraction module to remove meaningless connections among entities and improve the efficiency of scene graph generation. Meanwhile, to further promote the research of scene understanding in remote sensing field, this paper also proposes a remote sensing scene graph dataset (RSSGD). We carry out extensive experiments and the results show that our model significantly outperforms previous methods on scene graph generation. In addition, RSSGD effectively bridges the huge semantic gap between low-level perception and high-level cognition of remote sensing images.

Download Full-text

UAV Image Multi-Labeling with Data-Efficient Transformers

Applied Sciences ◽

10.3390/app11093974 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3974

Author(s):

Laila Bashmal ◽

Yakoub Bazi ◽

Mohamad Mahmoud Al Rahhal ◽

Haikel Alhichri ◽

Naif Al Ajlan

Keyword(s):

Data Augmentation ◽

Feature Representation ◽

Aerial Image ◽

Remote Sensing Images ◽

Training Set ◽

Proposed Model ◽

Class Labels ◽

Using Data ◽

Uav Image

In this paper, we present an approach for the multi-label classification of remote sensing images based on data-efficient transformers. During the training phase, we generated a second view for each image from the training set using data augmentation. Then, both the image and its augmented version were reshaped into a sequence of flattened patches and then fed to the transformer encoder. The latter extracts a compact feature representation from each image with the help of a self-attention mechanism, which can handle the global dependencies between different regions of the high-resolution aerial image. On the top of the encoder, we mounted two classifiers, a token and a distiller classifier. During training, we minimized a global loss consisting of two terms, each corresponding to one of the two classifiers. In the test phase, we considered the average of the two classifiers as the final class labels. Experiments on two datasets acquired over the cities of Trento and Civezzano with a ground resolution of two-centimeter demonstrated the effectiveness of the proposed model.

Download Full-text

The use of remote sensing satellite using deep learning in emergency monitoring of high-level landslides disaster in Jinsha River

The Journal of Supercomputing ◽

10.1007/s11227-020-03604-4 ◽

2021 ◽

Author(s):

Leijin Long ◽

Feng He ◽

Hongjiang Liu

Keyword(s):

Remote Sensing ◽

Southwest China ◽

Influence Factors ◽

Classification Error ◽

Model Parameters ◽

Detection Accuracy ◽

Remote Sensing Images ◽

Jinsha River ◽

Detection Model ◽

High Level

AbstractIn order to monitor the high-level landslides frequently occurring in Jinsha River area of Southwest China, and protect the lives and property safety of people in mountainous areas, the data of satellite remote sensing images are combined with various factors inducing landslides and transformed into landslide influence factors, which provides data basis for the establishment of landslide detection model. Then, based on the deep belief networks (DBN) and convolutional neural network (CNN) algorithm, two landslide detection models DBN and convolutional neural-deep belief network (CDN) are established to monitor the high-level landslide in Jinsha River. The influence of the model parameters on the landslide detection results is analyzed, and the accuracy of DBN and CDN models in dealing with actual landslide problems is compared. The results show that when the number of neurons in the DBN is 100, the overall error is the minimum, and when the number of learning layers is 3, the classification error is the minimum. The detection accuracy of DBN and CDN is 97.56% and 97.63%, respectively, which indicates that both DBN and CDN models are feasible in dealing with landslides from remote sensing images. This exploration provides a reference for the study of high-level landslide disasters in Jinsha River.

Download Full-text

Cloud-based intelligent self-diagnosis and department recommendation service using Chinese medical BERT

Journal of Cloud Computing Advances Systems and Applications ◽

10.1186/s13677-020-00218-2 ◽

2021 ◽

Vol 10 (1) ◽

Author(s):

Junshu Wang ◽

Guoming Zhang ◽

Wei Wang ◽

Ka Zhang ◽

Yehua Sheng

Keyword(s):

Cloud Computing ◽

Large Scale ◽

Medical Service ◽

Rapid Development ◽

Medical Knowledge ◽

Language Models ◽

Computing Environment ◽

Computing Power ◽

Cloud Computing Environment ◽

Proposed Model

AbstractWith the rapid development of hospital informatization and Internet medical service in recent years, most hospitals have launched online hospital appointment registration systems to remove patient queues and improve the efficiency of medical services. However, most of the patients lack professional medical knowledge and have no idea of how to choose department when registering. To instruct the patients to seek medical care and register effectively, we proposed CIDRS, an intelligent self-diagnosis and department recommendation framework based on Chinese medical Bidirectional Encoder Representations from Transformers (BERT) in the cloud computing environment. We also established a Chinese BERT model (CHMBERT) trained on a large-scale Chinese medical text corpus. This model was used to optimize self-diagnosis and department recommendation tasks. To solve the limited computing power of terminals, we deployed the proposed framework in a cloud computing environment based on container and micro-service technologies. Real-world medical datasets from hospitals were used in the experiments, and results showed that the proposed model was superior to the traditional deep learning models and other pre-trained language models in terms of performance.

Download Full-text

Multi-Scale Graph Convolutional Network and Dynamic Iterative Class Loss for Ship Segmentation in Remote Sensing Images

10.1145/3469877.3497699 ◽

2021 ◽

Author(s):

Yanru Jiang ◽

Chengyu Zheng ◽

Zhaoxin Wang ◽

Rui Wang ◽

Min Ye ◽

...

Keyword(s):

Remote Sensing ◽

Remote Sensing Images ◽

Convolutional Network ◽

Multi Scale

Download Full-text