UAV Remote Sensing Image Automatic Registration Based on Deep Residual Features

Xin Luo; Guangling Lai; Xiao Wang; Yuwei Jin; Xixu He; Wenbo Xu; Weimin Hou

doi:10.3390/rs13183605

UAV Remote Sensing Image Automatic Registration Based on Deep Residual Features

Remote Sensing ◽

10.3390/rs13183605 ◽

2021 ◽

Vol 13 (18) ◽

pp. 3605

Author(s):

Xin Luo ◽

Guangling Lai ◽

Xiao Wang ◽

Yuwei Jin ◽

Xixu He ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Rapid Development ◽

Image Features ◽

Automatic Registration ◽

Remote Sensing Images ◽

Registration Accuracy ◽

Registration Method ◽

Uav Images ◽

High Level

With the rapid development of unmanned aerial vehicle (UAV) technology, UAV remote sensing images are increasing sharply. However, due to the limitation of the perspective of UAV remote sensing, the UAV images obtained from different viewpoints of a same scene need to be stitched together for further applications. Therefore, an automatic registration method of UAV remote sensing images based on deep residual features is proposed in this work. It needs no additional training and does not depend on image features, such as points, lines and shapes, or on specific image contents. This registration framework is built as follows: Aimed at the problem that most of traditional registration methods only use low-level features for registration, we adopted deep residual neural network features extracted by an excellent deep neural network, ResNet-50. Then, a tensor product was employed to construct feature description vectors through exacted high-level abstract features. At last, the progressive consistency algorithm (PROSAC) was exploited to remove false matches and fit a geometric transform model so as to enhance registration accuracy. The experimental results for different typical scene images with different resolutions acquired by different UAV image sensors indicate that the improved algorithm can achieve higher registration accuracy than a state-of-the-art deep learning registration algorithm and other popular registration algorithms.

Download Full-text

An automatic registration method for multitemporal remote sensing images using land cover patches in rural regoins

2012 First International Conference on Agro- Geoinformatics (Agro-Geoinformatics) ◽

10.1109/agro-geoinformatics.2012.6311721 ◽

2012 ◽

Author(s):

Sen Cao ◽

Qiuyan Yu ◽

Jinshui Zhang

Keyword(s):

Remote Sensing ◽

Land Cover ◽

Automatic Registration ◽

Remote Sensing Images ◽

Registration Method ◽

Multitemporal Remote Sensing

Download Full-text

Remote Sensing Image Registration Using Multiple Image Features

10.20944/preprints201705.0027.v2 ◽

2017 ◽

Author(s):

Kun Yang ◽

Anning Pan ◽

Yang Yang ◽

Su Zhang ◽

Sim Heng Ong ◽

...

Keyword(s):

Remote Sensing ◽

Image Registration ◽

Mixture Model ◽

Damage Assessment ◽

Remote Sensing Image ◽

Image Features ◽

Google Earth ◽

Geometric Distortion ◽

Remote Sensing Images ◽

Registration Method

Remote sensing image registration plays an important role in military and civilian fields, such as natural disaster damage assessment, military damage assessment and ground targets identification, etc. However, due to the ground relief variations and imaging viewpoint changes, non-rigid geometric distortion occurs between remote sensing images with different viewpoint, which further increases the difficulty of remote sensing image registration. To address the problem, we propose a multi-viewpoint remote sensing image registration method which contains the following contributions. (i) A multiple features based finite mixture model is constructed for dealing with different types of image features. (ii) Three features are combined and substituted into the mixture model to form a feature complementation, i.e., the Euclidean distance and shape context are used to measure the similarity of geometric structure, and the SIFT (scale-invariant feature transform) distance which is endowed with the intensity information is used to measure the scale space extrema. (iii) To prevent the ill-posed problem, a geometric constraint term is introduced into the L2E-based energy function for better behaving the non-rigid transformation. We evaluated the performances of the proposed method by three series of remote sensing images obtained from the unmanned aerial vehicle (UAV) and Google Earth, and compared with five state-of-the-art methods where our method shows the best alignments in most cases.

Download Full-text

Automatic Registration Method for Optical Remote Sensing Images with Large Background Variations Using Line Segments

Remote Sensing ◽

10.3390/rs8050426 ◽

2016 ◽

Vol 8 (5) ◽

pp. 426 ◽

Cited By ~ 10

Author(s):

Xiaolong Shi ◽

Jie Jiang

Keyword(s):

Remote Sensing ◽

Optical Remote Sensing ◽

Automatic Registration ◽

Remote Sensing Images ◽

Registration Method ◽

Line Segments ◽

Large Background

Download Full-text

A NOVEL FRAMEWORK FOR REMOTE SENSING IMAGE SCENE CLASSIFICATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-657-2018 ◽

2018 ◽

Vol XLII-3 ◽

pp. 657-663 ◽

Cited By ~ 5

Author(s):

S. Jiang ◽

H. Zhao ◽

W. Wu ◽

Q. Tan

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Semantic Category ◽

Distribution Patterns ◽

Remote Sensing Image ◽

Scene Classification ◽

Remote Sensing Images ◽

Training Stage ◽

Feature Extractor ◽

High Level

High resolution remote sensing (HRRS) images scene classification aims to label an image with a specific semantic category. HRRS images contain more details of the ground objects and their spatial distribution patterns than low spatial resolution images. Scene classification can bridge the gap between low-level features and high-level semantics. It can be applied in urban planning, target detection and other fields. This paper proposes a novel framework for HRRS images scene classification. This framework combines the convolutional neural network (CNN) and XGBoost, which utilizes CNN as feature extractor and XGBoost as a classifier. Then, this framework is evaluated on two different HRRS images datasets: UC-Merced dataset and NWPU-RESISC45 dataset. Our framework achieved satisfying accuracies on two datasets, which is 95.57&thinsp;% and 83.35&thinsp;% respectively. From the experiments result, our framework has been proven to be effective for remote sensing images classification. Furthermore, we believe this framework will be more practical for further HRRS scene classification, since it costs less time on training stage.

Download Full-text

Semantic Segmentation of Remote Sensing Image Based on Convolutional Neural Network and Mask Generation

Mathematical Problems in Engineering ◽

10.1155/2021/2472726 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Binglin Niu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Semantic Segmentation ◽

Layer By Layer ◽

Foreground Object ◽

Remote Sensing Images ◽

Training Time ◽

High Level

High-resolution remote sensing images usually contain complex semantic information and confusing targets, so their semantic segmentation is an important and challenging task. To resolve the problem of inadequate utilization of multilayer features by existing methods, a semantic segmentation method for remote sensing images based on convolutional neural network and mask generation is proposed. In this method, the boundary box is used as the initial foreground segmentation profile, and the edge information of the foreground object is obtained by using the multilayer feature of the convolutional neural network. In order to obtain the rough object segmentation mask, the general shape and position of the foreground object are estimated by using the high-level features in the process of layer-by-layer iteration. Then, based on the obtained rough mask, the mask is updated layer by layer using the neural network characteristics to obtain a more accurate mask. In order to solve the difficulty of deep neural network training and the problem of degeneration after convergence, a framework based on residual learning was adopted, which can simplify the training of those very deep networks and improve the accuracy of the network. For comparison with other advanced algorithms, the proposed algorithm was tested on the Potsdam and Vaihingen datasets. Experimental results show that, compared with other algorithms, the algorithm in this article can effectively improve the overall precision of semantic segmentation of high-resolution remote sensing images and shorten the overall training time and segmentation time.

Download Full-text

An Automatic Registration Method for AVHRR Remote Sensing Images

International Journal of Multimedia and Ubiquitous Engineering ◽

10.14257/ijmue.2014.9.8.33 ◽

2014 ◽

Vol 9 (8) ◽

pp. 355-366 ◽

Cited By ~ 1

Author(s):

Ying Xia ◽

Linjun Zhu ◽

Xiaobo Luo ◽

Hae Young Bae

Keyword(s):

Remote Sensing ◽

Automatic Registration ◽

Remote Sensing Images ◽

Registration Method

Download Full-text

An automatic registration method for different temporal remote sensing images based on improved Fourier-Mellin algorithm

10.1117/12.855154 ◽

2009 ◽

Cited By ~ 2

Author(s):

Rui Li ◽

Jiulin Sun ◽

Fang Yin ◽

Fusheng Guo ◽

Wantong Wang

Keyword(s):

Remote Sensing ◽

Automatic Registration ◽

Remote Sensing Images ◽

Registration Method

Download Full-text

MILL: Channel Attention–based Deep Multiple Instance Learning for Landslide Recognition

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3454009 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-11

Author(s):

Xiaochuan Tang ◽

Mingzhe Liu ◽

Hao Zhong ◽

Yuanzhen Ju ◽

Weile Li ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Large Scale ◽

Remote Sensing Image ◽

Disaster Risk ◽

Multiple Instance Learning ◽

Remote Sensing Images ◽

Loess Area ◽

Remote Sensing Image Classification ◽

Natural Disaster Risk

Landslide recognition is widely used in natural disaster risk management. Traditional landslide recognition is mainly conducted by geologists, which is accurate but inefficient. This article introduces multiple instance learning (MIL) to perform automatic landslide recognition. An end-to-end deep convolutional neural network is proposed, referred to as Multiple Instance Learning–based Landslide classification (MILL). First, MILL uses a large-scale remote sensing image classification dataset to build pre-train networks for landslide feature extraction. Second, MILL extracts instances and assign instance labels without pixel-level annotations. Third, MILL uses a new channel attention–based MIL pooling function to map instance-level labels to bag-level label. We apply MIL to detect landslides in a loess area. Experimental results demonstrate that MILL is effective in identifying landslides in remote sensing images.

Download Full-text

Hyperspectral Remote Sensing Images Classification Using Fully Convolutional Neural Network

2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus) ◽

10.1109/elconrus51938.2021.9396673 ◽

2021 ◽

Author(s):

Nyan Linn Tun ◽

Alexander Gavrilov ◽

Naing Min Tun ◽

Do Minh Trieu ◽

Htet Aung

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Hyperspectral Remote Sensing ◽

Remote Sensing Images ◽

Hyperspectral Remote Sensing Images

Download Full-text

Semantic Relation Model and Dataset for Remote Sensing Scene Understanding

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10070488 ◽

2021 ◽

Vol 10 (7) ◽

pp. 488

Author(s):

Peng Li ◽

Dezheng Zhang ◽

Aziguli Wulamu ◽

Xin Liu ◽

Peng Chen

Keyword(s):

Remote Sensing ◽

Scene Understanding ◽

Deep Understanding ◽

Remote Sensing Images ◽

Convolutional Network ◽

Scene Graph ◽

Multi Scale ◽

Relationship Extraction ◽

High Level ◽

Graph Generation

A deep understanding of our visual world is more than an isolated perception on a series of objects, and the relationships between them also contain rich semantic information. Especially for those satellite remote sensing images, the span is so large that the various objects are always of different sizes and complex spatial compositions. Therefore, the recognition of semantic relations is conducive to strengthen the understanding of remote sensing scenes. In this paper, we propose a novel multi-scale semantic fusion network (MSFN). In this framework, dilated convolution is introduced into a graph convolutional network (GCN) based on an attentional mechanism to fuse and refine multi-scale semantic context, which is crucial to strengthen the cognitive ability of our model Besides, based on the mapping between visual features and semantic embeddings, we design a sparse relationship extraction module to remove meaningless connections among entities and improve the efficiency of scene graph generation. Meanwhile, to further promote the research of scene understanding in remote sensing field, this paper also proposes a remote sensing scene graph dataset (RSSGD). We carry out extensive experiments and the results show that our model significantly outperforms previous methods on scene graph generation. In addition, RSSGD effectively bridges the huge semantic gap between low-level perception and high-level cognition of remote sensing images.

Download Full-text