Aerial Image Road Extraction Based on an Improved Generative Adversarial Network

Xiangrong Zhang; Xiao Han; Chen Li; Xu Tang; Huiyu Zhou; Licheng Jiao

doi:10.3390/rs11080930

Aerial Image Road Extraction Based on an Improved Generative Adversarial Network

Remote Sensing ◽

10.3390/rs11080930 ◽

2019 ◽

Vol 11 (8) ◽

pp. 930 ◽

Cited By ~ 8

Author(s):

Xiangrong Zhang ◽

Xiao Han ◽

Chen Li ◽

Xu Tang ◽

Huiyu Zhou ◽

...

Keyword(s):

Law Enforcement ◽

State Of The Art ◽

Automated Detection ◽

Aerial Images ◽

Aerial Photographs ◽

Aerial Image ◽

Detection Accuracy ◽

Road Extraction ◽

Generative Adversarial Network ◽

Adversarial Network

Aerial photographs and satellite images are one of the resources used for earth observation. In practice, automated detection of roads on aerial images is of significant values for the application such as car navigation, law enforcement, and fire services. In this paper, we present a novel road extraction method from aerial images based on an improved generative adversarial network, which is an end-to-end framework only requiring a few samples for training. Experimental results on the Massachusetts Roads Dataset show that the proposed method provides better performance than several state of the art techniques in terms of detection accuracy, recall, precision and F1-score.

Download Full-text

Disentangled generative adversarial network for low-dose CT

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00749-z ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Wenchao Du ◽

Hu Chen ◽

Hongyu Yang ◽

Yi Zhang

Keyword(s):

Network Architecture ◽

Low Dose ◽

Noise Suppression ◽

State Of The Art ◽

Visual Quality ◽

Ct Images ◽

Generative Adversarial Network ◽

Low Dose Ct ◽

Adversarial Network ◽

Suppression Method

AbstractGenerative adversarial network (GAN) has been applied for low-dose CT images to predict normal-dose CT images. However, the undesired artifacts and details bring uncertainty to the clinical diagnosis. In order to improve the visual quality while suppressing the noise, in this paper, we mainly studied the two key components of deep learning based low-dose CT (LDCT) restoration models—network architecture and adversarial loss, and proposed a disentangled noise suppression method based on GAN (DNSGAN) for LDCT. Specifically, a generator network, which contains the noise suppression and structure recovery modules, is proposed. Furthermore, a multi-scaled relativistic adversarial loss is introduced to preserve the finer structures of generated images. Experiments on simulated and real LDCT datasets show that the proposed method can effectively remove noise while recovering finer details and provide better visual perception than other state-of-the-art methods.

Download Full-text

Road Detection Based on Edge Feature with GAC Model in Aerial Image

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800142154032x ◽

2021 ◽

Author(s):

Linying Zhou ◽

Zhou Zhou ◽

Hang Ning

Keyword(s):

Transformation Method ◽

Aerial Images ◽

Aerial Image ◽

Initial Contour ◽

Detection Accuracy ◽

Road Detection ◽

Canny Operator ◽

The Road ◽

Gradient Magnitude ◽

Edge Feature

Road detection from aerial images still is a challenging task since it is heavily influenced by spectral reflectance, shadows and occlusions. In order to increase the road detection accuracy, a proposed method for road detection by GAC model with edge feature extraction and segmentation is studied in this paper. First, edge feature can be extracted using the proposed gradient magnitude with Canny operator. Then, a reconstructed gradient map is applied in watershed transformation method, which is segmented for the next initial contour. Last, with the combination of edge feature and initial contour, the boundary stopping function is applied in the GAC model. The road boundary result can be accomplished finally. Experimental results show, by comparing with other methods in [Formula: see text]-measure system, that the proposed method can achieve satisfying results.

Download Full-text

Learning a Generative Model for Fusing Infrared and Visible Images via Conditional Generative Adversarial Network with Dual Discriminators

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/549 ◽

2019 ◽

Cited By ~ 12

Author(s):

Han Xu ◽

Pengwei Liang ◽

Wei Yu ◽

Junjun Jiang ◽

Jiayi Ma

Keyword(s):

Probability Distribution ◽

State Of The Art ◽

Infrared Image ◽

Infrared Images ◽

Generative Adversarial Network ◽

Visible Image ◽

Qualitative And Quantitative ◽

Adversarial Network ◽

Fused Image ◽

Visible Images

In this paper, we propose a new end-to-end model, called dual-discriminator conditional generative adversarial network (DDcGAN), for fusing infrared and visible images of different resolutions. Unlike the pixel-level methods and existing deep learning-based methods, the fusion task is accomplished through the adversarial process between a generator and two discriminators, in addition to the specially designed content loss. The generator is trained to generate real-like fused images to fool discriminators. The two discriminators are trained to calculate the JS divergence between the probability distribution of downsampled fused images and infrared images, and the JS divergence between the probability distribution of gradients of fused images and gradients of visible images, respectively. Thus, the fused images can compensate for the features that are not constrained by the single content loss. Consequently, the prominence of thermal targets in the infrared image and the texture details in the visible image can be preserved or even enhanced in the fused image simultaneously. Moreover, by constraining and distinguishing between the downsampled fused image and the low-resolution infrared image, DDcGAN can be preferably applied to the fusion of different resolution images. Qualitative and quantitative experiments on publicly available datasets demonstrate the superiority of our method over the state-of-the-art.

Download Full-text

Multiple-Oriented and Small Object Detection with Convolutional Neural Networks for Aerial Image

Remote Sensing ◽

10.3390/rs11182176 ◽

2019 ◽

Vol 11 (18) ◽

pp. 2176 ◽

Cited By ~ 3

Author(s):

Chen ◽

Zhong ◽

Tan

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Aerial Images ◽

Superior Performance ◽

Aerial Image ◽

Detection Accuracy ◽

Small Object ◽

Data Set ◽

Orientation Information

Detecting objects in aerial images is a challenging task due to multiple orientations and relatively small size of the objects. Although many traditional detection models have demonstrated an acceptable performance by using the imagery pyramid and multiple templates in a sliding-window manner, such techniques are inefficient and costly. Recently, convolutional neural networks (CNNs) have successfully been used for object detection, and they have demonstrated considerably superior performance than that of traditional detection methods; however, this success has not been expanded to aerial images. To overcome such problems, we propose a detection model based on two CNNs. One of the CNNs is designed to propose many object-like regions that are generated from the feature maps of multi scales and hierarchies with the orientation information. Based on such a design, the positioning of small size objects becomes more accurate, and the generated regions with orientation information are more suitable for the objects arranged with arbitrary orientations. Furthermore, another CNN is designed for object recognition; it first extracts the features of each generated region and subsequently makes the final decisions. The results of the extensive experiments performed on the vehicle detection in aerial imagery (VEDAI) and overhead imagery research data set (OIRDS) datasets indicate that the proposed model performs well in terms of not only the detection accuracy but also the detection speed.

Download Full-text

Insulator Segmentation for Power Line Inspection Based on Modified Conditional Generative Adversarial Network

Journal of Sensors ◽

10.1155/2019/4245329 ◽

2019 ◽

Vol 2019 ◽

pp. 1-8

Author(s):

Zishu Gao ◽

Guodong Yang ◽

En Li ◽

Tianyu Shen ◽

Zhe Wang ◽

...

Keyword(s):

Transmission Lines ◽

Power Supply ◽

Power Transmission ◽

State Of The Art ◽

Power Line ◽

Power Transmission Lines ◽

Generative Adversarial Network ◽

Convolutional Network ◽

Adversarial Network ◽

Feature Information

There are a large number of insulators on the transmission line, and insulator damage will have a major impact on power supply security. Image-based segmentation of the insulators in the power transmission lines is a premise and also a critical task for power line inspection. In this paper, a modified conditional generative adversarial network for insulator pixel-level segmentation is proposed. The generator is reconstructed by encoder-decoder layers with asymmetric convolution kernel which can simplify the network complexity and extract more kinds of feature information. The discriminator is composed of a fully convolutional network based on patchGAN and learns the loss to train the generator. It is verified in experiments that the proposed method has better performances on mIoU and computational efficiency than Pix2pix, SegNet, and other state-of-the-art networks.

Download Full-text

Joint Entity and Event Extraction with Generative Adversarial Imitation Learning

Data Intelligence ◽

10.1162/dint_a_00014 ◽

2019 ◽

Vol 1 (2) ◽

pp. 99-120 ◽

Cited By ~ 6

Author(s):

Tongtao Zhang ◽

Heng Ji ◽

Avirup Sil

Keyword(s):

State Of The Art ◽

Ground Truth ◽

Event Extraction ◽

Imitation Learning ◽

Learning Method ◽

Inverse Reinforcement Learning ◽

Generative Adversarial Network ◽

Adversarial Network ◽

The Difference ◽

New Framework

We propose a new framework for entity and event extraction based on generative adversarial imitation learning—an inverse reinforcement learning method using a generative adversarial network (GAN). We assume that instances and labels yield to various extents of difficulty and the gains and penalties (rewards) are expected to be diverse. We utilize discriminators to estimate proper rewards according to the difference between the labels committed by the ground-truth (expert) and the extractor (agent). Our experiments demonstrate that the proposed framework outperforms state-of-the-art methods.

Download Full-text

InsulatorGAN: A Transmission Line Insulator Detection Model Using Multi-Granularity Conditional Generative Adversarial Nets for UAV Inspection

Remote Sensing ◽

10.3390/rs13193971 ◽

2021 ◽

Vol 13 (19) ◽

pp. 3971

Author(s):

Wenxiang Chen ◽

Yingna Li ◽

Zhengang Zhao

Keyword(s):

Transmission Line ◽

Transmission Lines ◽

State Of The Art ◽

Generative Adversarial Network ◽

Detection Model ◽

Adversarial Network ◽

Monte Carlo Search ◽

Model Training ◽

Inspection Tasks

Insulator detection is one of the most significant issues in high-voltage transmission line inspection using unmanned aerial vehicles (UAVs) and has attracted attention from researchers all over the world. The state-of-the-art models in object detection perform well in insulator detection, but the precision is limited by the scale of the dataset and parameters. Recently, the Generative Adversarial Network (GAN) was found to offer excellent image generation. Therefore, we propose a novel model called InsulatorGAN based on using conditional GANs to detect insulators in transmission lines. However, due to the fixed categories in datasets such as ImageNet and Pascal VOC, the generated insulator images are of a low resolution and are not sufficiently realistic. To solve these problems, we established an insulator dataset called InsuGenSet for model training. InsulatorGAN can generate high-resolution, realistic-looking insulator-detection images that can be used for data expansion. Moreover, InsulatorGAN can be easily adapted to other power equipment inspection tasks and scenarios using one generator and multiple discriminators. To give the generated images richer details, we also introduced a penalty mechanism based on a Monte Carlo search in InsulatorGAN. In addition, we proposed a multi-scale discriminator structure based on a multi-task learning mechanism to improve the quality of the generated images. Finally, experiments on the InsuGenSet and CPLID datasets demonstrated that our model outperforms existing state-of-the-art models by advancing both the resolution and quality of the generated images as well as the position of the detection box in the images.

Download Full-text

Event Factuality Identification via Generative Adversarial Networks with Auxiliary Classification

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/597 ◽

2018 ◽

Cited By ~ 2

Author(s):

Zhong Qian ◽

Peifeng Li ◽

Yue Zhang ◽

Guodong Zhou ◽

Qiaoming Zhu

Keyword(s):

State Of The Art ◽

Experimental Results ◽

Generative Adversarial Networks ◽

Semantic Task ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Traditional Research ◽

Syntactic Information ◽

Event Factuality

Event factuality identification is an important semantic task in NLP. Traditional research heavily relies on annotated texts. This paper proposes a two-step framework, first extracting essential factors related with event factuality from raw texts as the input, and then identifying the factuality of events via a Generative Adversarial Network with Auxiliary Classification (AC-GAN). The use of AC-GAN allows the model to learn more syntactic information and address the imbalance among factuality values. Experimental results on FactBank show that our method significantly outperforms several state-of-the-art baselines, particularly on events with embedded sources, speculative and negative factuality values.

Download Full-text

An Adversarial and Densely Dilated Network for Connectomes Segmentation

Symmetry ◽

10.3390/sym10100467 ◽

2018 ◽

Vol 10 (10) ◽

pp. 467 ◽

Cited By ~ 2

Author(s):

Ke Chen ◽

Dandan Zhu ◽

Jianwei Lu ◽

Ye Luo

Keyword(s):

State Of The Art ◽

Piriform Cortex ◽

Receptive Fields ◽

Ground Truth ◽

Training Data ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Boundary Map ◽

Segmentation Task ◽

The Brain

Automatic reconstructing of neural circuits in the brain is one of the most crucial studies in neuroscience. Connectomes segmentation plays an important role in reconstruction from electron microscopy (EM) images; however, it is rather challenging due to highly anisotropic shapes with inferior quality and various thickness. In our paper, we propose a novel connectomes segmentation framework called adversarial and densely dilated network (ADDN) to address these issues. ADDN is based on the conditional Generative Adversarial Network (cGAN) structure which is the latest advance in machine learning with power to generate images similar to the ground truth especially when the training data is limited. Specifically, we design densely dilated network (DDN) as the segmentor to allow a deeper architecture and larger receptive fields for more accurate segmentation. Discriminator is trained to distinguish generated segmentation from manual segmentation. During training, such adversarial loss function is optimized together with dice loss. Extensive experimental results demonstrate that our ADDN is effective for such connectomes segmentation task, helping to retrieve more accurate segmentation and attenuate the blurry effects of generated boundary map. Our method obtains state-of-the-art performance while requiring less computation on ISBI 2012 EM dataset and mouse piriform cortex dataset.

Download Full-text

Multimodal Structure-Consistent Image-to-Image Translation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6814 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11490-11498

Author(s):

Che-Tsung Lin ◽

Yen-Yi Wu ◽

Po-Hao Hsu ◽

Shang-Hong Lai

Keyword(s):

Data Augmentation ◽

Network Models ◽

Detection Accuracy ◽

Target Domain ◽

Generative Adversarial Network ◽

Image Objects ◽

Perceptual Distance ◽

Adversarial Network ◽

Image Translation ◽

Realistic Images

Unpaired image-to-image translation is proven quite effective in boosting a CNN-based object detector for a different domain by means of data augmentation that can well preserve the image-objects in the translated images. Recently, multimodal GAN (Generative Adversarial Network) models have been proposed and were expected to further boost the detector accuracy by generating a diverse collection of images in the target domain, given only a single/labelled image in the source domain. However, images generated by multimodal GANs would achieve even worse detection accuracy than the ones by a unimodal GAN with better object preservation. In this work, we introduce cycle-structure consistency for generating diverse and structure-preserved translated images across complex domains, such as between day and night, for object detector training. Qualitative results show that our model, Multimodal AugGAN, can generate diverse and realistic images for the target domain. For quantitative comparisons, we evaluate other competing methods and ours by using the generated images to train YOLO, Faster R-CNN and FCN models and prove that our model achieves significant improvement and outperforms other methods on the detection accuracies and the FCN scores. Also, we demonstrate that our model could provide more diverse object appearances in the target domain through comparison on the perceptual distance metric.

Download Full-text