A Two-Stage Deep Learning Registration Method for Remote Sensing Images Based on Sub-Image Matching

Yuan Chen; Jie Jiang

doi:10.3390/rs13173443

A Two-Stage Deep Learning Registration Method for Remote Sensing Images Based on Sub-Image Matching

Remote Sensing ◽

10.3390/rs13173443 ◽

2021 ◽

Vol 13 (17) ◽

pp. 3443

Author(s):

Yuan Chen ◽

Jie Jiang

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Image Matching ◽

Google Earth ◽

Geometric Transformation ◽

Remote Sensing Images ◽

Two Stage ◽

Registration Method ◽

Transformation Parameters ◽

The Impact

The registration of multi-temporal remote sensing images with abundant information and complex changes is an important preprocessing step for subsequent applications. This paper presents a novel two-stage deep learning registration method based on sub-image matching. Unlike the conventional registration framework, the proposed network learns the mapping between matched sub-images and the geometric transformation parameters directly. In the first stage, the matching of sub-images (MSI), sub-images cropped from the images are matched through the corresponding heatmaps, which are made of the predicted similarity of each sub-image pairs. The second stage, the estimation of transformation parameters (ETP), a network with weight structure and position embedding estimates the global transformation parameters from the matched pairs. The network can deal with an uncertain number of matched sub-image inputs and reduce the impact of outliers. Furthermore, the sample sharing training strategy and the augmentation based on the bounding rectangle are introduced. We evaluated our method by comparing the conventional and deep learning methods qualitatively and quantitatively on Google Earth, ISPRS, and WHU Building Datasets. The experiments showed that our method obtained the probability of correct keypoints (PCK) of over 99% at α = 0.05 (α: the normalized distance threshold) and achieved a maximum increase of 16.8% at α = 0.01, compared with the latest method. The results demonstrated that our method has good robustness and improved the precision in the registration of optical remote sensing images with great variation.

Download Full-text

An Oil Well Dataset Derived from Satellite-Based Remote Sensing

Remote Sensing ◽

10.3390/rs13061132 ◽

2021 ◽

Vol 13 (6) ◽

pp. 1132

Author(s):

Zhibao Wang ◽

Lu Bai ◽

Guangfu Song ◽

Jie Zhang ◽

Jinhua Tao ◽

...

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Object Detection ◽

State Of The Art ◽

Google Earth ◽

Oil Wells ◽

Oil Well ◽

Optical Remote Sensing ◽

Learning Models ◽

Remote Sensing Images

Estimation of the number and geo-location of oil wells is important for policy holders considering their impact on energy resource planning. With the recent development in optical remote sensing, it is possible to identify oil wells from satellite images. Moreover, the recent advancement in deep learning frameworks for object detection in remote sensing makes it possible to automatically detect oil wells from remote sensing images. In this paper, we collected a dataset named Northeast Petroleum University–Oil Well Object Detection Version 1.0 (NEPU–OWOD V1.0) based on high-resolution remote sensing images from Google Earth Imagery. Our database includes 1192 oil wells in 432 images from Daqing City, which has the largest oilfield in China. In this study, we compared nine different state-of-the-art deep learning models based on algorithms for object detection from optical remote sensing images. Experimental results show that the state-of-the-art deep learning models achieve high precision on our collected dataset, which demonstrate the great potential for oil well detection in remote sensing.

Download Full-text

A Multi-Level Approach for Change Detection of Buildings Using Satellite Imagery

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213018500318 ◽

2018 ◽

Vol 27 (08) ◽

pp. 1850031 ◽

Cited By ~ 1

Author(s):

Md. Abdul Alim Sheikh ◽

Alok Kole ◽

Tanmoy Maity

Keyword(s):

Remote Sensing ◽

Mutual Information ◽

Change Detection ◽

Satellite Imagery ◽

Image Matching ◽

Google Earth ◽

Remote Sensing Images ◽

Normalized Mutual Information ◽

Novel Technique ◽

Multi Level

In this paper a novel technique for building change detection from remote sensing imagery is presented. It includes two main stages: (1) Object-specific discriminative features are extracted using Morphological Building Index (MBI) to automatically detect the existence of buildings in remote sensing images. (2) Pixel-based image matching is measured on the basis of Mutual Information (MI) of the images by Normalized Mutual Information (NMI). Here, the MBI features values are computed for each of the pair images taken over the same region at two different times and then changes in these two MBI images are measured to indicate the building change. MI is estimated locally for all the pixels for image matching and then thresholding is applied for eliminating those pixels which are responsible for strong similarity. Finally, after getting the MBI and NMI images, a further fusion of these two images is done for refinement of the change result. For evaluation purpose, the experiments are carried on QuickBird, IKONOS images and images taken from Google Earth. The results show that the proposed technique can attain acceptable correctness rates above 90% with Overall Accuracy (OA) 89.52%.

Download Full-text

Remote Sensing Image Registration Using Multiple Image Features

10.20944/preprints201705.0027.v2 ◽

2017 ◽

Author(s):

Kun Yang ◽

Anning Pan ◽

Yang Yang ◽

Su Zhang ◽

Sim Heng Ong ◽

...

Keyword(s):

Remote Sensing ◽

Image Registration ◽

Mixture Model ◽

Damage Assessment ◽

Remote Sensing Image ◽

Image Features ◽

Google Earth ◽

Geometric Distortion ◽

Remote Sensing Images ◽

Registration Method

Remote sensing image registration plays an important role in military and civilian fields, such as natural disaster damage assessment, military damage assessment and ground targets identification, etc. However, due to the ground relief variations and imaging viewpoint changes, non-rigid geometric distortion occurs between remote sensing images with different viewpoint, which further increases the difficulty of remote sensing image registration. To address the problem, we propose a multi-viewpoint remote sensing image registration method which contains the following contributions. (i) A multiple features based finite mixture model is constructed for dealing with different types of image features. (ii) Three features are combined and substituted into the mixture model to form a feature complementation, i.e., the Euclidean distance and shape context are used to measure the similarity of geometric structure, and the SIFT (scale-invariant feature transform) distance which is endowed with the intensity information is used to measure the scale space extrema. (iii) To prevent the ill-posed problem, a geometric constraint term is introduced into the L2E-based energy function for better behaving the non-rigid transformation. We evaluated the performances of the proposed method by three series of remote sensing images obtained from the unmanned aerial vehicle (UAV) and Google Earth, and compared with five state-of-the-art methods where our method shows the best alignments in most cases.

Download Full-text

Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection

Remote Sensing ◽

10.3390/rs13112187 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2187

Author(s):

Liegang Xia ◽

Xiongbo Zhang ◽

Junxia Zhang ◽

Haiping Yang ◽

Tingting Chen

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Edge Detection ◽

Network Architecture ◽

Semantic Segmentation ◽

Google Earth ◽

Remote Sensing Images ◽

Building Roof ◽

Social Applications

The automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in images often depends on a large number of manually labeled samples, the production of which is time-consuming and expensive. Thus, this study focuses on reducing the number of labeled samples used and proposing a semi-supervised deep learning approach based on an edge detection network (SDLED), which is the first to introduce semi-supervised learning to the edge detection neural network for extracting building roof boundaries from high-resolution remote sensing images. This approach uses a small number of labeled samples and abundant unlabeled images for joint training. An expert-level semantic edge segmentation model is trained based on labeled samples, which guides unlabeled images to generate pseudo-labels automatically. The inaccurate label sets and manually labeled samples are used to update the semantic edge model together. Particularly, we modified the semantic segmentation network D-LinkNet to obtain high-quality pseudo-labels. Specifically, the main network architecture of D-LinkNet is retained while the multi-scale fusion is added in its second half to improve its performance on edge detection. The SDLED was tested on high-spatial-resolution remote sensing images taken from Google Earth. Results show that the SDLED performs better than the fully supervised method. Moreover, when the trained models were used to predict buildings in the neighboring counties, our approach was superior to the supervised way, with line IoU improvement of at least 6.47% and F1 score improvement of at least 7.49%.

Download Full-text

Change detection based on dimension reduction SLIC and image matching for remote sensing images

Journal of Physics Conference Series ◽

10.1088/1742-6596/2078/1/012040 ◽

2021 ◽

Vol 2078 (1) ◽

pp. 012040

Author(s):

Zhenliang Chang ◽

Xiaogang Yang ◽

Ruitao Lu ◽

Hao Zhuang ◽

Pan Huang

Keyword(s):

Remote Sensing ◽

Image Segmentation ◽

Change Detection ◽

Dimension Reduction ◽

Error Detection ◽

Image Matching ◽

Detection Accuracy ◽

Remote Sensing Images ◽

Image Change Detection ◽

The Impact

Abstract The detection accuracy of traditional change detection algorithms is seriously affected by the low accuracy and high rate of omission, the radiometric correction accuracy, and the classification threshold for difference image. A change detection method based on image segmentation and image matching was proposed for remote sensing images. In this method, super-pixel-based dimension reduction SLIC image segmentation algorithm and SURF algorithms were used. The homogeneous region was used as the segmentation standard, and the homogeneity method was proposed to suppress the impact of inconsistent image segmentation on the change detection results. The experimental results show that this method improves the accuracy of remote sensing image change detection, has good robustness to the problem of redundant data, significantly reduces the error detection rate of image change detection, and can effectively accelerate the speed of change detection.

Download Full-text

Application of Remote Sensing Data for Slum Identification Using Geography Information System (Case: Former Harbor, Singaraja City)

Media Komunikasi FPIPS ◽

10.23887/mkfis.v20i1.30421 ◽

2021 ◽

Vol 20 (1) ◽

pp. 8

Author(s):

Ruhilatul Janah ◽

A Sediyo Adi Nugraha

Keyword(s):

Remote Sensing ◽

Information Systems ◽

Geographic Information Systems ◽

Remote Sensing Data ◽

Geographic Information ◽

Google Earth ◽

Remote Sensing Images ◽

Geography Information ◽

The Impact ◽

Slum Settlements

This research was conducted in Singaraja City using high-resolution remote sensing images and geographic information systems. The purpose is to use remote sensing images and geographic information systems to identify slum settlements, especially the former harbor area in Singaraja City. Slum settlement is the impact of population growth that is difficult to control. As a result, the remote sensing image can identify three features: slums, non-slum settlement, and non-slum areas. Most slum settlements are located in coastal areas, and non-slum settlements are located in areas close to economic locations and tourist sites and offices. The most significant introduction to slum identification comes from the building area. Based on these results, it can be concluded that slum settlement can be identified through images obtained from Google earth and recognized visually through interpretation keys.

Download Full-text

Deep Learning Approaches Applied to Remote Sensing Datasets for Road Extraction: A State-Of-The-Art Review

Remote Sensing ◽

10.3390/rs12091444 ◽

2020 ◽

Vol 12 (9) ◽

pp. 1444 ◽

Cited By ~ 9

Author(s):

Abolfazl Abdollahi ◽

Biswajeet Pradhan ◽

Nagesh Shukla ◽

Subrata Chakraborty ◽

Abdullah Alamri

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Traffic Management ◽

Google Earth ◽

Future Research ◽

Learning Approaches ◽

Road Extraction ◽

Research Subjects ◽

Remote Sensing Images ◽

Generative Adversarial Network

One of the most challenging research subjects in remote sensing is feature extraction, such as road features, from remote sensing images. Such an extraction influences multiple scenes, including map updating, traffic management, emergency tasks, road monitoring, and others. Therefore, a systematic review of deep learning techniques applied to common remote sensing benchmarks for road extraction is conducted in this study. The research is conducted based on four main types of deep learning methods, namely, the GANs model, deconvolutional networks, FCNs, and patch-based CNNs models. We also compare these various deep learning models applied to remote sensing datasets to show which method performs well in extracting road parts from high-resolution remote sensing images. Moreover, we describe future research directions and research gaps. Results indicate that the largest reported performance record is related to the deconvolutional nets applied to remote sensing images, and the F1 score metric of the generative adversarial network model, DenseNet method, and FCN-32 applied to UAV and Google Earth images are high: 96.08%, 95.72%, and 94.59%, respectively.

Download Full-text

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Mobile Networks and Applications ◽

10.1007/s11036-020-01703-3 ◽

2021 ◽

Vol 26 (1) ◽

pp. 200-215

Author(s):

Muhammad Alam ◽

Jian-Feng Wang ◽

Cong Guangpei ◽

LV Yunrong ◽

Yuanfang Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Image Processing ◽

Deep Learning ◽

Semantic Segmentation ◽

Natural Scene ◽

Remote Sensing Images ◽

Advantages And Disadvantages ◽

Target Segmentation

AbstractIn recent years, the success of deep learning in natural scene image processing boosted its application in the analysis of remote sensing images. In this paper, we applied Convolutional Neural Networks (CNN) on the semantic segmentation of remote sensing images. We improve the Encoder- Decoder CNN structure SegNet with index pooling and U-net to make them suitable for multi-targets semantic segmentation of remote sensing images. The results show that these two models have their own advantages and disadvantages on the segmentation of different objects. In addition, we propose an integrated algorithm that integrates these two models. Experimental results show that the presented integrated algorithm can exploite the advantages of both the models for multi-target segmentation and achieve a better segmentation compared to these two models.

Download Full-text

Self-Attention in Reconstruction Bias U-Net for Semantic Segmentation of Building Rooftops in Optical Remote Sensing Images

Remote Sensing ◽

10.3390/rs13132524 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2524

Author(s):

Ziyi Chen ◽

Dilong Li ◽

Wentao Fan ◽

Haiyan Guan ◽

Cheng Wang ◽

...

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Semantic Segmentation ◽

Extraction Methods ◽

The Self ◽

Optical Remote Sensing ◽

Building Extraction ◽

Learning Models ◽

Remote Sensing Images ◽

Segmentation Methods

Deep learning models have brought great breakthroughs in building extraction from high-resolution optical remote-sensing images. Among recent research, the self-attention module has called up a storm in many fields, including building extraction. However, most current deep learning models loading with the self-attention module still lose sight of the reconstruction bias’s effectiveness. Through tipping the balance between the abilities of encoding and decoding, i.e., making the decoding network be much more complex than the encoding network, the semantic segmentation ability will be reinforced. To remedy the research weakness in combing self-attention and reconstruction-bias modules for building extraction, this paper presents a U-Net architecture that combines self-attention and reconstruction-bias modules. In the encoding part, a self-attention module is added to learn the attention weights of the inputs. Through the self-attention module, the network will pay more attention to positions where there may be salient regions. In the decoding part, multiple large convolutional up-sampling operations are used for increasing the reconstruction ability. We test our model on two open available datasets: the WHU and Massachusetts Building datasets. We achieve IoU scores of 89.39% and 73.49% for the WHU and Massachusetts Building datasets, respectively. Compared with several recently famous semantic segmentation methods and representative building extraction methods, our method’s results are satisfactory.

Download Full-text

A Public Dataset for Fine-Grained Ship Classification in Optical Remote Sensing Images

Remote Sensing ◽

10.3390/rs13040747 ◽

2021 ◽

Vol 13 (4) ◽

pp. 747

Author(s):

Yanghua Di ◽

Zhiguo Jiang ◽

Haopeng Zhang

Keyword(s):

Remote Sensing ◽

Image Data ◽

Remote Sensing Image ◽

Google Earth ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Visual Categorization ◽

Class Differences ◽

Fine Grained ◽

Ship Classification

Fine-grained visual categorization (FGVC) is an important and challenging problem due to large intra-class differences and small inter-class differences caused by deformation, illumination, angles, etc. Although major advances have been achieved in natural images in the past few years due to the release of popular datasets such as the CUB-200-2011, Stanford Cars and Aircraft datasets, fine-grained ship classification in remote sensing images has been rarely studied because of relative scarcity of publicly available datasets. In this paper, we investigate a large amount of remote sensing image data of sea ships and determine most common 42 categories for fine-grained visual categorization. Based our previous DSCR dataset, a dataset for ship classification in remote sensing images, we collect more remote sensing images containing warships and civilian ships of various scales from Google Earth and other popular remote sensing image datasets including DOTA, HRSC2016, NWPU VHR-10, We call our dataset FGSCR-42, meaning a dataset for Fine-Grained Ship Classification in Remote sensing images with 42 categories. The whole dataset of FGSCR-42 contains 9320 images of most common types of ships. We evaluate popular object classification algorithms and fine-grained visual categorization algorithms to build a benchmark. Our FGSCR-42 dataset is publicly available at our webpages.

Download Full-text