Building Corner Detection in Aerial Images with Fully Convolutional Networks

Weigang Song; Baojiang Zhong; Xun Sun

doi:10.3390/s19081915

Building Corner Detection in Aerial Images with Fully Convolutional Networks

Sensors ◽

10.3390/s19081915 ◽

2019 ◽

Vol 19 (8) ◽

pp. 1915 ◽

Cited By ~ 4

Author(s):

Weigang Song ◽

Baojiang Zhong ◽

Xun Sun

Keyword(s):

Conditional Random Fields ◽

Structural Information ◽

Scale Space ◽

Corner Detection ◽

Aerial Images ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Detection Approach ◽

Corner Points ◽

Fully Connected

In aerial images, corner points can be detected to describe the structural information of buildings for city modeling, geo-localization, and so on. For this specific vision task, the existing generic corner detectors perform poorly, as they are incapable of distinguishing corner points on buildings from those on other objects such as trees and shadows. Recently, fully convolutional networks (FCNs) have been developed for semantic image segmentation that are able to recognize a designated kind of object through a training process with a manually labeled dataset. Motivated by this achievement, an FCN-based approach is proposed in the present work to detect building corners in aerial images. First, a DeepLab model comprised of improved FCNs and fully-connected conditional random fields (CRFs) is trained end-to-end for building region segmentation. The segmentation is then further improved by using a morphological opening operation to increase its accuracy. Corner points are finally detected on the contour curves of building regions by using a scale-space detector. Experimental results show that the proposed building corner detection approach achieves an F-measure of 0.83 in the test image set and outperforms a number of state-of-the-art corner detectors by a large margin.

Download Full-text

Applying Fully Convolutional Architectures for Semantic Segmentation of a Single Tree Species in Urban Environment on High Resolution UAV Optical Imagery

Sensors ◽

10.3390/s20020563 ◽

2020 ◽

Vol 20 (2) ◽

pp. 563 ◽

Cited By ~ 9

Author(s):

Daliana Lobo Torres ◽

Raul Queiroz Feitosa ◽

Patrick Nigri Happ ◽

Laura Elena Cué La Rosa ◽

José Marcato Junior ◽

...

Keyword(s):

Tree Species ◽

Conditional Random Fields ◽

Computational Cost ◽

Semantic Segmentation ◽

Convolutional Networks ◽

Single Tree ◽

Fully Convolutional Networks ◽

Processing Step ◽

Fully Connected ◽

High Computational Cost

This study proposes and evaluates five deep fully convolutional networks (FCNs) for the semantic segmentation of a single tree species: SegNet, U-Net, FC-DenseNet, and two DeepLabv3+ variants. The performance of the FCN designs is evaluated experimentally in terms of classification accuracy and computational load. We also verify the benefits of fully connected conditional random fields (CRFs) as a post-processing step to improve the segmentation maps. The analysis is conducted on a set of images captured by an RGB camera aboard a UAV flying over an urban area. The dataset also contains a mask that indicates the occurrence of an endangered species called Dipteryx alata Vogel, also known as cumbaru, taken as the species to be identified. The experimental analysis shows the effectiveness of each design and reports average overall accuracy ranging from 88.9% to 96.7%, an F1-score between 87.0% and 96.1%, and IoU from 77.1% to 92.5%. We also realize that CRF consistently improves the performance, but at a high computational cost.

Download Full-text

Detecting Objects from Space: An Evaluation of Deep-Learning Modern Approaches

Electronics ◽

10.3390/electronics9040583 ◽

2020 ◽

Vol 9 (4) ◽

pp. 583 ◽

Cited By ~ 6

Author(s):

Khang Nguyen ◽

Nhut T. Huynh ◽

Phat C. Nguyen ◽

Khanh-Duy Nguyen ◽

Nguyen D. Vo ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Unmanned Aircraft ◽

Aerial Images ◽

Great Success ◽

Single Shot ◽

Convolutional Networks ◽

Image Pyramids ◽

Fully Convolutional Networks ◽

Wide Range

Unmanned aircraft systems or drones enable us to record or capture many scenes from the bird’s-eye view and they have been fast deployed to a wide range of practical domains, i.e., agriculture, aerial photography, fast delivery and surveillance. Object detection task is one of the core steps in understanding videos collected from the drones. However, this task is very challenging due to the unconstrained viewpoints and low resolution of captured videos. While deep-learning modern object detectors have recently achieved great success in general benchmarks, i.e., PASCAL-VOC and MS-COCO, the robustness of these detectors on aerial images captured by drones is not well studied. In this paper, we present an evaluation of state-of-the-art deep-learning detectors including Faster R-CNN (Faster Regional CNN), RFCN (Region-based Fully Convolutional Networks), SNIPER (Scale Normalization for Image Pyramids with Efficient Resampling), Single-Shot Detector (SSD), YOLO (You Only Look Once), RetinaNet, and CenterNet for the object detection in videos captured by drones. We conduct experiments on VisDrone2019 dataset which contains 96 videos with 39,988 annotated frames and provide insights into efficient object detectors for aerial images.

Download Full-text

The Temporal Dynamics of Slums Employing a CNN-Based Change Detection Approach

Remote Sensing ◽

10.3390/rs11232844 ◽

2019 ◽

Vol 11 (23) ◽

pp. 2844 ◽

Cited By ~ 9

Author(s):

Ruoyun Liu ◽

Monika Kuffer ◽

Claudio Persello

Keyword(s):

Change Detection ◽

Temporal Dynamics ◽

Spatial Dynamics ◽

Rapid Urbanization ◽

Error Matrix ◽

Global Challenge ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Detection Approach ◽

Accuracy Difference

Along with rapid urbanization, the growth and persistence of slums is a global challenge. While remote sensing imagery is increasingly used for producing slum maps, only a few studies have analyzed their temporal dynamics. This study explores the potential of fully convolutional networks (FCNs) to analyze the temporal dynamics of small clusters of temporary slums using very high resolution (VHR) imagery in Bangalore, India. The study develops two approaches based on FCNs. The first approach uses a post-classification change detection, and the second trains FCNs to directly classify the dynamics of slums. For both approaches, the performances of 3 × 3 kernels and 5 × 5 kernels of the networks were compared. While classification results of individual years exhibit a relatively high F1-score (3 × 3 kernel) of 88.4% on average, the change accuracies are lower. The post-classification results obtained an F1-score of 53.8% and the change-detection networks obtained an F1-score of 53.7%. According to the trajectory error matrix (TEM), the post-classification results scored higher for the overall accuracy but lower for the accuracy difference of change trajectories than the change-detection networks. Although the two methods did not have significant differences in terms of accuracy, the change-detection network was less noisy. Within our study area, the areas of slums show a small overall decrease; the annual growth of slums (between 2012 and 2016) was 7173 m2, in contrast to an annual decline of 8390 m2. However, these numbers hid the spatial dynamics, which were much larger. Interestingly, areas where slums disappeared commonly changed into green areas, not into built-up areas. The proposed change-detection network provides a robust map of the locations of changes with lower confidence about the exact boundaries. This shows the potential of FCNs for detecting the dynamics of slums in VHR imagery.

Download Full-text

Multi-target Detection for Aerial Images Based on Fully Convolutional Networks

2019 Chinese Control Conference (CCC) ◽

10.23919/chicc.2019.8865178 ◽

2019 ◽

Author(s):

Haihong Chi ◽

Xianjie Zhang ◽

Xiangrui Gao

Keyword(s):

Target Detection ◽

Aerial Images ◽

Convolutional Networks ◽

Fully Convolutional Networks

Download Full-text

Implementation of Harris Corner Detection Algorithm for Volumetric Images

Journal of Intelligent Systems with Applications ◽

10.54856/jiswa.201805008 ◽

2018 ◽

pp. 18-22

Author(s):

Ceyda Nur Ozturk ◽

Songul Albayrak

Keyword(s):

Three Dimensional ◽

Scale Space ◽

Detection Algorithm ◽

Corner Detection ◽

Sample Object ◽

Harris Corner ◽

Corner Points ◽

Volumetric Images ◽

Third Dimension ◽

Harris Corner Detection

More effective detection of corner points in three dimensional (3-D) volumetric images can be possible through expansion of Harris corner detection algorithm, which run in two dimensional (2-D) images, into third dimension. In this study, the standard algorithm of Harris that detected corner points in 2-D slices and its 3-D version were implemented in the scale-space to determine the corner points of volumetric object images. The results obtained in sample object images with 2-D and 3-D methods that used different approaches for scale-space construction were qualitatively assessed.

Download Full-text

Attention-Based Context Aware Network for Semantic Comprehension of Aerial Scenery

Sensors ◽

10.3390/s21061983 ◽

2021 ◽

Vol 21 (6) ◽

pp. 1983

Author(s):

Weipeng Shi ◽

Wenhu Qin ◽

Zhonghua Yun ◽

Peng Ping ◽

Kaiyang Wu ◽

...

Keyword(s):

High Resolution ◽

Semantic Segmentation ◽

Aerial Images ◽

Aerial Image ◽

Convolutional Network ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Semantic Labeling ◽

Autonomous Cars ◽

High Resolution Images

It is essential for researchers to have a proper interpretation of remote sensing images (RSIs) and precise semantic labeling of their component parts. Although FCN (Fully Convolutional Networks)-like deep convolutional network architectures have been widely applied in the perception of autonomous cars, there are still two challenges in the semantic segmentation of RSIs. The first is to identify details in high-resolution images with complex scenes and to solve the class-mismatch issues; the second is to capture the edge of objects finely without being confused by the surroundings. HRNET has the characteristics of maintaining high-resolution representation by fusing feature information with parallel multi-resolution convolution branches. We adopt HRNET as a backbone and propose to incorporate the Class-Oriented Region Attention Module (CRAM) and Class-Oriented Context Fusion Module (CCFM) to analyze the relationships between classes and patch regions and between classes and local or global pixels, respectively. Thus, the perception capability of the model for the detailed part in the aerial image can be enhanced. We leverage these modules to develop an end-to-end semantic segmentation model for aerial images and validate it on the ISPRS Potsdam and Vaihingen datasets. The experimental results show that our model improves the baseline accuracy and outperforms some commonly used CNN architectures.

Download Full-text

Brain Tumor Segmentation Using Concurrent Fully Convolutional Networks and Conditional Random Fields

Proceedings of the 3rd International Conference on Multimedia and Image Processing - ICMIP 2018 ◽

10.1145/3195588.3195590 ◽

2018 ◽

Cited By ~ 3

Author(s):

Guangyu Shen ◽

Yi Ding ◽

Tian Lan ◽

Hao Chen ◽

Zhiguang Qin

Keyword(s):

Brain Tumor ◽

Random Fields ◽

Conditional Random Fields ◽

Tumor Segmentation ◽

Brain Tumor Segmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks

Download Full-text

Semantic labeling of high-resolution aerial images using an ensemble of fully convolutional networks

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.11.042617 ◽

2017 ◽

Vol 11 (04) ◽

pp. 1 ◽

Cited By ~ 7

Author(s):

Xiaofeng Sun ◽

Shuhan Shen ◽

Xiangguo Lin ◽

Zhanyi Hu

Keyword(s):

High Resolution ◽

Aerial Images ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Semantic Labeling

Download Full-text

Detection Approach of DDoS Attacks Based on Conditional Random Fields

Journal of Software ◽

10.3724/sp.j.1001.2011.03960 ◽

2011 ◽

Vol 22 (8) ◽

pp. 1897-1910 ◽

Cited By ~ 12

Author(s):

Yun LIU ◽

Zhi-Ping CAI ◽

Ping ZHONG ◽

Jian-Ping YIN ◽

Jie-Ren CHENG

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Ddos Attacks ◽

Detection Approach

Download Full-text

A data augmentation approach to train fully convolutional networks for left ventricle segmentation

Magnetic Resonance Imaging ◽

10.1016/j.mri.2019.08.004 ◽

2020 ◽

Vol 66 ◽

pp. 152-164

Author(s):

Adan Lin ◽

Junhao Wu ◽

Xuan Yang

Keyword(s):

Left Ventricle ◽

Data Augmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Ventricle Segmentation

Download Full-text