Exploit Direction Information for Remote Ship Detection

Zhenbiao Tan; Zekun Zhang; Tingzhuang Xing; Xiao Huang; Junbin Gong; Jie Ma

doi:10.3390/rs13112155

Exploit Direction Information for Remote Ship Detection

Remote Sensing ◽

10.3390/rs13112155 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2155

Author(s):

Zhenbiao Tan ◽

Zekun Zhang ◽

Tingzhuang Xing ◽

Xiao Huang ◽

Junbin Gong ◽

...

Keyword(s):

Remote Sensing ◽

State Of The Art ◽

Normalization Method ◽

Training Data ◽

Arbitrary Orientation ◽

Data Set ◽

Ship Detection ◽

Variable Direction ◽

Direction Information ◽

Object Features

Ship detection in remote sensing has been achieving increasing significance recently. In remote sensing, ships are arbitrary oriented and the detector has to learn the object features of arbitrary orientation by rote, which demands a large amount of training data to prevent overfitting. In addition, plenty of ships have a distinct direction from the center point to the head point. However, little attention has been paid to the direction information of ships and previous studies just predict the bow directions of ships. In this paper, we propose to further exploit the ship direction information to solve the arbitrary orientation problem, including direction augmentation, direction prediction, and direction normalization. A Variable-Direction Rotated RoI Align module is designed for direction augmentation and normalization with an additional feature extraction direction as input. The direction augmentation method directly augments the features of ship RRoIs and brings great diversities to the training data set. The direction prediction introduces additional direction information for learning and helps to reduce noise. In the direction normalization method, the predicted ship directions are utilized to normalize the directions of ship features from stern to bow through the VDR RoI Align module, making the ship features present in one orientation and easier to be identified by the detector. On the L1 task of the HRSC2016 data set, the direction augmentation method and direction normalization method boost the RoI Transformer baseline from 86.2% to 90.4% and 90.6%, respectively, achieving the state-of-the-art performance.

Download Full-text

Incorporating Deep Features into GEOBIA Paradigm for Remote Sensing Imagery Classification: A Patch-Based Approach

Remote Sensing ◽

10.3390/rs12183007 ◽

2020 ◽

Vol 12 (18) ◽

pp. 3007 ◽

Cited By ~ 3

Author(s):

Bo Liu ◽

Shihong Du ◽

Shouji Du ◽

Xiuyuan Zhang

Keyword(s):

Remote Sensing ◽

State Of The Art ◽

Remote Sensing Imagery ◽

Image Objects ◽

Deep Feature ◽

Object Based ◽

Feature Aggregation ◽

Art Methods ◽

Object Features ◽

Land Cover Maps

The fast and accurate creation of land use/land cover maps from very-high-resolution (VHR) remote sensing imagery is crucial for urban planning and environmental monitoring. Geographic object-based image analysis methods (GEOBIA) provide an effective solution using image objects instead of individual pixels in VHR remote sensing imagery analysis. Simultaneously, convolutional neural networks (CNN) have been widely used in the image processing field because of their powerful feature extraction capabilities. This study presents a patch-based strategy for integrating deep features into GEOBIA for VHR remote sensing imagery classification. To extract deep features from irregular image objects through CNN, a patch-based approach is proposed for representing image objects and learning patch-based deep features, and a deep features aggregation method is proposed for aggregating patch-based deep features into object-based deep features. Finally, both object and deep features are integrated into a GEOBIA paradigm for classifying image objects. We explored the influences of segmentation scales and patch sizes in our method and explored the effectiveness of deep and object features in classification. Moreover, we performed 5-fold stratified cross validations 50 times to explore the uncertainty of our method. Additionally, we explored the importance of deep feature aggregation, and we evaluated our method by comparing it with three state-of-the-art methods in a Beijing dataset and Zurich dataset. The results indicate that smaller segmentation scales were more conducive to VHR remote sensing imagery classification, and it was not appropriate to select too large or too small patches as the patch size should be determined by imagery and its resolution. Moreover, we found that deep features are more effective than object features, while object features still matter for image classification, and deep feature aggregation is a critical step in our method. Finally, our method can achieve the highest overall accuracies compared with the state-of-the-art methods, and the overall accuracies are 91.21% for the Beijing dataset and 99.05% for the Zurich dataset.

Download Full-text

Flash-Flood Susceptibility Assessment Using Multi-Criteria Decision Making and Machine Learning Supported by Remote Sensing and GIS Techniques

Remote Sensing ◽

10.3390/rs12010106 ◽

2019 ◽

Vol 12 (1) ◽

pp. 106 ◽

Cited By ~ 22

Author(s):

Romulus Costache ◽

Quoc Bao Pham ◽

Ehsan Sharifi ◽

Nguyen Thi Thuy Linh ◽

S.I. Abba ◽

...

Keyword(s):

Remote Sensing ◽

Information Gain ◽

Flash Flood ◽

Slope Angle ◽

Training Data ◽

Operating Characteristics ◽

Predictive Capability ◽

Data Set ◽

Flood Susceptibility ◽

Remote Sensing Techniques

Concerning the significant increase in the negative effects of flash-floods worldwide, the main goal of this research is to evaluate the power of the Analytical Hierarchy Process (AHP), fi (kNN), K-Star (KS) algorithms and their ensembles in flash-flood susceptibility mapping. To train the two stand-alone models and their ensembles, for the first stage, the areas affected in the past by torrential phenomena are identified using remote sensing techniques. Approximately 70% of these areas are used as a training data set along with 10 flash-flood predictors. It should be remarked that the remote sensing techniques play a crucial role in obtaining eight out of 10 flash-flood conditioning factors. The predictive capability of predictors is evaluated through the Information Gain Ratio (IGR) method. As expected, the slope angle results in the factor with the highest predictive capability. The application of the AHP model implies the construction of ten pair-wise comparison matrices for calculating the normalized weights of each flash-flood predictor. The computed weights are used as input data in kNN–AHP and KS–AHP ensemble models for calculating the Flash-Flood Potential Index (FFPI). The FFPI also is determined through kNN and KS stand-alone models. The performance of the models is evaluated using statistical metrics (i.e., sensitivity, specificity and accuracy) while the validation of the results is done by constructing the Receiver Operating Characteristics (ROC) Curve and Area Under Curve (AUC) values and by calculating the density of torrential pixels within FFPI classes. Overall, the best performance is obtained by the kNN–AHP ensemble model.

Download Full-text

Unsupervised Representation High-Resolution Remote Sensing Image Scene Classification via Contrastive Learning Convolutional Neural Network

Photogrammetric Engineering & Remote Sensing ◽

10.14358/pers.87.8.577 ◽

2021 ◽

Vol 87 (8) ◽

pp. 577-591

Author(s):

Fengpeng Li ◽

Jiabao Li ◽

Wei Han ◽

Ruyi Feng ◽

Lizhe Wang

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Convolutional Neural Network ◽

State Of The Art ◽

Remote Sensing Image ◽

Scene Classification ◽

Data Set ◽

Unsupervised Deep Learning

Inspired by the outstanding achievement of deep learning, supervised deep learning representation methods for high-spatial-resolution remote sensing image scene classification obtained state-of-the-art performance. However, supervised deep learning representation methods need a considerable amount of labeled data to capture class-specific features, limiting the application of deep learning-based methods while there are a few labeled training samples. An unsupervised deep learning representation, high-resolution remote sensing image scene classification method is proposed in this work to address this issue. The proposed method, called contrastive learning, narrows the distance between positive views: color channels belonging to the same images widens the gaps between negative view pairs consisting of color channels from different images to obtain class-specific data representations of the input data without any supervised information. The classifier uses extracted features by the convolutional neural network (CNN)-based feature extractor with labeled information of training data to set space of each category and then, using linear regression, makes predictions in the testing procedure. Comparing with existing unsupervised deep learning representation high-resolution remote sensing image scene classification methods, contrastive learning CNN achieves state-of-the-art performance on three different scale benchmark data sets: small scale RSSCN7 data set, midscale aerial image data set, and large-scale NWPU-RESISC45 data set.

Download Full-text

DR-Net: An Improved Network for Building Extraction from High Resolution Remote Sensing Image

Remote Sensing ◽

10.3390/rs13020294 ◽

2021 ◽

Vol 13 (2) ◽

pp. 294

Author(s):

Meng Chen ◽

Jianjun Wu ◽

Leizhen Liu ◽

Wenhui Zhao ◽

Feng Tian ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

State Of The Art ◽

Complex Structure ◽

Building Extraction ◽

Training Process ◽

Residual Network ◽

Data Set ◽

Deep Layers ◽

The One

At present, convolutional neural networks (CNN) have been widely used in building extraction from remote sensing imagery (RSI), but there are still some bottlenecks. On the one hand, there are so many parameters in the previous network with complex structure, which will occupy lots of memories and consume much time during training process. On the other hand, low-level features extracted by shallow layers and abstract features extracted by deep layers of artificial neural network cannot be fully fused, which leads to an inaccurate building extraction from RSI. To alleviate these disadvantages, a dense residual neural network (DR-Net) was proposed in this paper. DR-Net uses a deeplabv3+Net encoder/decoder backbone, in combination with densely connected convolution neural network (DCNN) and residual network (ResNet) structure. Compared with deeplabv3+net (containing about 41 million parameters) and BRRNet (containing about 17 million parameters), DR-Net contains about 9 million parameters; So, the number of parameters reduced a lot. The experimental results for both the WHU Building Dataset and Massachusetts Building Dataset, DR-Net show better performance in building extraction than other two state-of-the-art methods. Experiments on WHU building data set showed that Intersection over Union (IoU) increased by 2.4% and F1 score increased by 1.4%; in terms of Massachusetts Building Dataset, IoU increased by 3.8% and F1 score increased by 2.9%.

Download Full-text

Ship Object Detection of Remote Sensing Image Based on Visual Attention

Remote Sensing ◽

10.3390/rs13163192 ◽

2021 ◽

Vol 13 (16) ◽

pp. 3192

Author(s):

Yuxin Dong ◽

Fukun Chen ◽

Shuang Han ◽

Hao Liu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Visual Attention ◽

Object Detection ◽

Remote Sensing Image ◽

Attention Mechanism ◽

Remote Sensing Images ◽

Data Set ◽

Ship Detection ◽

Visual Attention Mechanism

At present, reliable and precise ship detection in high-resolution optical remote sensing images affected by wave clutter, thin clouds, and islands under complex sea conditions is still challenging. At the same time, object detection algorithms in satellite remote sensing images are challenged by color, aspect ratio, complex background, and angle variability. Even the results obtained based on the latest convolutional neural network (CNN) method are not satisfactory. In order to obtain more accurate ship detection results, this paper proposes a remote sensing image ship object detection method based on a brainlike visual attention mechanism. We refer to the robust expression mode of the human brain, design a vector field filter with active rotation capability, and explicitly encode the direction information of the remote sensing object in the neural network. The progressive enhancement learning model guided by the visual attention mechanism is used to dynamically solve the problem, and the object can be discovered and detected through time–space information. To verify the effectiveness of the proposed method, a remote sensing ship object detection data set is established, and the proposed method is compared with other state-of-the-art methods on the established data set. Experiments show that the object detection accuracy of this method and the ability to capture image details have been improved. Compared with other models, the average intersection rate of the joint is 80.12%, which shows a clear advantage. The proposed method is fast enough to meet the needs of ship detection in remote sensing images.

Download Full-text

An Evaluation of State-of-the-Art Approaches to Relation Extraction for Usage on Domain-Specific Corpora

10.5121/csit.2021.112006 ◽

2021 ◽

Author(s):

Christoph Brandl ◽

Jens Albrecht ◽

Renato Budinich

Keyword(s):

State Of The Art ◽

Extraction Procedure ◽

Named Entity Recognition ◽

Relation Extraction ◽

Building Blocks ◽

Training Data ◽

Entity Recognition ◽

Data Set ◽

Named Entity ◽

Domain Specific

The task of relation extraction aims at classifying the semantic relations between entities in a text. When coupled with named-entity recognition these can be used as the building blocks for an information extraction procedure that results in the construction of a Knowledge Graph. While many NLP libraries support named-entity recognition, there is no off-the-shelf solution for relation extraction. In this paper, we evaluate and compare several state-of-the-art approaches on a subset of the FewRel data set as well as a manually annotated corpus. The custom corpus contains six relations from the area of market research and is available for public use. Our approach provides guidance for the selection of models and training data for relation extraction in realworld projects.

Download Full-text

VGG16 Transfer Learning Architecture for Salak Fruit Quality Classification

Telematika ◽

10.31315/telematika.v18i1.4025 ◽

2021 ◽

Vol 18 (1) ◽

pp. 37

Author(s):

Rismiyati Rismiyati ◽

Ardytha Luthfiarta

Keyword(s):

Machine Learning ◽

Transfer Learning ◽

State Of The Art ◽

Learning Rate ◽

Training Data ◽

Data Set ◽

Quality Classification ◽

Testing Data ◽

Pixel Value

Purpose: This study aims to differentiate the quality of salak fruit with machine learning. Salak is classified into two classes, good and bad class.Design/methodology/approach: The algorithm used in this research is transfer learning with the VGG16 architecture. Data set used in this research consist of 370 images of salak, 190 from good class and 180 from bad class. The image is preprocessed by resizing and normalizing pixel value in the image. Preprocessed images is split into 80% training data and 20% testing data. Training data is trained by using pretrained VGG16 model. The parameters that are changed during the training are epoch, momentum, and learning rate. The resulting model is then used for testing. The accuracy, precision and recall is monitored to determine the best model to classify the images.Findings/result: The highest accuracy obtained from this study is 95.83%. This accuracy is obtained by using a learning rate = 0.0001 and momentum 0.9. The precision and recall for this model is 97.2 and 94.6.Originality/value/state of the art: The use of transfer learning to classify salak which never been used before.

Download Full-text

State-of-the-Art and Gaps for Deep Learning on Limited Training Data in Remote Sensing

IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2018.8518681 ◽

2018 ◽

Cited By ~ 3

Author(s):

John E. Ball ◽

Derek T. Anderson ◽

Pan Wei

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

State Of The Art ◽

Training Data

Download Full-text

Unsupervised Haze Removal for High-Resolution Optical Remote-Sensing Images Based on Improved Generative Adversarial Networks

Remote Sensing ◽

10.3390/rs12244162 ◽

2020 ◽

Vol 12 (24) ◽

pp. 4162

Author(s):

Anna Hu ◽

Zhong Xie ◽

Yongyang Xu ◽

Mingyu Xie ◽

Liang Wu ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Training Data ◽

Generative Adversarial Networks ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Data Set ◽

Adversarial Networks ◽

Ground Object ◽

Edge Sharpening

One major limitation of remote-sensing images is bad weather conditions, such as haze. Haze significantly reduces the accuracy of satellite image interpretation. To solve this problem, this paper proposes a novel unsupervised method to remove haze from high-resolution optical remote-sensing images. The proposed method, based on cycle generative adversarial networks, is called the edge-sharpening cycle-consistent adversarial network (ES-CCGAN). Most importantly, unlike existing methods, this approach does not require prior information; the training data are unsupervised, which mitigates the pressure of preparing the training data set. To enhance the ability to extract ground-object information, the generative network replaces a residual neural network (ResNet) with a dense convolutional network (DenseNet). The edge-sharpening loss function of the deep-learning model is designed to recover clear ground-object edges and obtain more detailed information from hazy images. In the high-frequency information extraction model, this study re-trained the Visual Geometry Group (VGG) network using remote-sensing images. Experimental results reveal that the proposed method can recover different kinds of scenes from hazy images successfully and obtain excellent color consistency. Moreover, the ability of the proposed method to obtain clear edges and rich texture feature information makes it superior to the existing methods.

Download Full-text

TEMPERATURE – EMISSIVITY SEPARATION ASSESSMENT IN A SUB-URBAN SCENARIO

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-w3-129-2017 ◽

2017 ◽

Vol XLII-3/W3 ◽

pp. 129-136 ◽

Cited By ~ 1

Author(s):

M. Moscadelli ◽

M. Diani ◽

G. Corsini

Keyword(s):

Remote Sensing ◽

State Of The Art ◽

Real Data ◽

Data Set ◽

Atmospheric Compensation ◽

Urban Scenario ◽

Specific Material ◽

Linear Piecewise

In this paper, a methodology that aims at evaluating the effectiveness of different TES strategies is presented. The methodology takes into account the specific material of interest in the monitored scenario, sensor characteristics, and errors in the atmospheric compensation step. The methodology is proposed in order to predict and analyse algorithms performances during the planning of a remote sensing mission, aimed to discover specific materials of interest in the monitored scenario. As case study, the proposed methodology is applied to a real airborne data set of a suburban scenario. In order to perform the TES problem, three state-of-the-art algorithms, and a recently proposed one, are investigated: Temperature-Emissivity Separation '98 (TES-98) algorithm, Stepwise Refining TES (SRTES) algorithm, Linear piecewise TES (LTES) algorithm, and Optimized Smoothing TES (OSTES) algorithm. At the end, the accuracy obtained with real data, and the ones predicted by means of the proposed methodology are compared and discussed.

Download Full-text