Analysis on the Impact of Data Augmentation on Target Recognition for UAV-Based Transmission Line Inspection

Complexity ◽

10.1155/2020/3107450 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Chunhe Song ◽

Wenxiang Xu ◽

Zhongfeng Wang ◽

Shimao Yu ◽

Peng Zeng ◽

...

Keyword(s):

Deep Learning ◽

Transmission Line ◽

Data Augmentation ◽

Target Recognition ◽

Histogram Equalization ◽

Training Samples ◽

Aerial Vehicle ◽

Gaussian Blur ◽

Model Training ◽

The Impact

Target recognition is one of the core tasks of transmission line inspection based on Unmanned Aerial Vehicle (UAV), and at present plenty of deep learning-based methods have been developed for it. To enhance the generalization ability of the recognition models, a huge number of training samples are needed to cover most of all possible situations. However, due to the complexity of the environmental conditions and targets, and the limitations of images’ collection and annotation, the samples usually are insufficient when training a deep learning model for target recognition, which is one of the main factors reducing the performance of the model. To overcome this issue, some data augmentation methods have been developed to generate additional samples for model training. Although these methods have been widely used, currently there is no quantitative study on the impact of the data augmentation methods on target recognition. In this paper, taking insulator strings as the target, the impact of a series of widely used data augmentation methods on the accuracy of target recognition is studied, including histogram equalization, Gaussian blur, random translation, scaling, cutout, and rotation. Extensive tests are carried out to verify the impact of the augmented samples in the training set, the test set, or the both. Experimental results show that data augmentation plays an important role in improving the accuracy of recognition models, in which the impacts of the data augmentation methods such as Gaussian blur, scaling, and rotation are significant.

Download Full-text

Multi-Aspect SAR Target Recognition Based on Prototypical Network with a Small Number of Training Samples

Sensors ◽

10.3390/s21134333 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4333

Author(s):

Pengfei Zhao ◽

Lijia Huang ◽

Yu Xin ◽

Jiayi Guo ◽

Zongxu Pan

Keyword(s):

Deep Learning ◽

Feature Fusion ◽

Recognition Accuracy ◽

Target Recognition ◽

Automatic Target Recognition ◽

Small Sample ◽

Recognition Method ◽

Training Samples ◽

Model Training ◽

Sar Target Recognition

At present, synthetic aperture radar (SAR) automatic target recognition (ATR) has been deeply researched and widely used in military and civilian fields. SAR images are very sensitive to the azimuth aspect of the imaging geomety; the same target at different aspects differs greatly. Thus, the multi-aspect SAR image sequence contains more information for classification and recognition, which requires the reliable and robust multi-aspect target recognition method. Nowadays, SAR target recognition methods are mostly based on deep learning. However, the SAR dataset is usually expensive to obtain, especially for a certain target. It is difficult to obtain enough samples for deep learning model training. This paper proposes a multi-aspect SAR target recognition method based on a prototypical network. Furthermore, methods such as multi-task learning and multi-level feature fusion are also introduced to enhance the recognition accuracy under the case of a small number of training samples. The experiments by using the MSTAR dataset have proven that the recognition accuracy of our method can be close to the accruacy level by all samples and our method can be applied to other feather extraction models to deal with small sample learning problems.

Download Full-text

Data augmentation for computed tomography angiography via synthetic image generation and neural domain adaptation

Current Directions in Biomedical Engineering ◽

10.1515/cdbme-2020-0015 ◽

2020 ◽

Vol 6 (1) ◽

Author(s):

Malte Seemann ◽

Lennart Bargsten ◽

Alexander Schlaefer

Keyword(s):

Computed Tomography ◽

Neural Networks ◽

Deep Learning ◽

Medical Imaging ◽

Computed Tomography Angiography ◽

Data Augmentation ◽

Domain Adaptation ◽

Synthetic Image ◽

Wide Range ◽

The Impact

AbstractDeep learning methods produce promising results when applied to a wide range of medical imaging tasks, including segmentation of artery lumen in computed tomography angiography (CTA) data. However, to perform sufficiently, neural networks have to be trained on large amounts of high quality annotated data. In the realm of medical imaging, annotations are not only quite scarce but also often not entirely reliable. To tackle both challenges, we developed a two-step approach for generating realistic synthetic CTA data for the purpose of data augmentation. In the first step moderately realistic images are generated in a purely numerical fashion. In the second step these images are improved by applying neural domain adaptation. We evaluated the impact of synthetic data on lumen segmentation via convolutional neural networks (CNNs) by comparing resulting performances. Improvements of up to 5% in terms of Dice coefficient and 20% for Hausdorff distance represent a proof of concept that the proposed augmentation procedure can be used to enhance deep learning-based segmentation for artery lumen in CTA images.

Download Full-text

Underwater Acoustic Target Recognition Based on Generative Adversarial Network Data Augmentation

INTER-NOISE and NOISE-CON Congress and Conference Proceedings ◽

10.3397/in-2021-2737 ◽

2021 ◽

Vol 263 (2) ◽

pp. 4558-4564

Author(s):

Minghong Zhang ◽

Xinwei Luo

Keyword(s):

Data Augmentation ◽

Target Recognition ◽

Training Data ◽

Small Samples ◽

Generative Adversarial Network ◽

Data Set ◽

Underwater Acoustic ◽

Adversarial Network ◽

Acoustic Target ◽

The Impact

Underwater acoustic target recognition is an important aspect of underwater acoustic research. In recent years, machine learning has been developed continuously, which is widely and effectively applied in underwater acoustic target recognition. In order to acquire good recognition results and reduce the problem of overfitting, Adequate data sets are essential. However, underwater acoustic samples are relatively rare, which has a certain impact on recognition accuracy. In this paper, in addition of the traditional audio data augmentation method, a new method of data augmentation using generative adversarial network is proposed, which uses generator and discriminator to learn the characteristics of underwater acoustic samples, so as to generate reliable underwater acoustic signals to expand the training data set. The expanded data set is input into the deep neural network, and the transfer learning method is applied to further reduce the impact caused by small samples by fixing part of the pre-trained parameters. The experimental results show that the recognition result of this method is better than the general underwater acoustic recognition method, and the effectiveness of this method is verified.

Download Full-text

A Survey on Deep Learning-Based Short/Zero-Calibration Approaches for EEG-Based Brain–Computer Interfaces

Frontiers in Human Neuroscience ◽

10.3389/fnhum.2021.643386 ◽

2021 ◽

Vol 15 ◽

Author(s):

Wonjun Ko ◽

Eunjin Jeon ◽

Seungwoo Jeong ◽

Jaeun Phyo ◽

Heung-Il Suk

Keyword(s):

Deep Learning ◽

Explicit Knowledge ◽

Data Augmentation ◽

Generative Model ◽

Machine Learning Techniques ◽

Brain Computer Interfaces ◽

Computer Interfaces ◽

Training Samples ◽

Calibration Methods ◽

Complex Patterns

Brain–computer interfaces (BCIs) utilizing machine learning techniques are an emerging technology that enables a communication pathway between a user and an external system, such as a computer. Owing to its practicality, electroencephalography (EEG) is one of the most widely used measurements for BCI. However, EEG has complex patterns and EEG-based BCIs mostly involve a cost/time-consuming calibration phase; thus, acquiring sufficient EEG data is rarely possible. Recently, deep learning (DL) has had a theoretical/practical impact on BCI research because of its use in learning representations of complex patterns inherent in EEG. Moreover, algorithmic advances in DL facilitate short/zero-calibration in BCI, thereby suppressing the data acquisition phase. Those advancements include data augmentation (DA), increasing the number of training samples without acquiring additional data, and transfer learning (TL), taking advantage of representative knowledge obtained from one dataset to address the so-called data insufficiency problem in other datasets. In this study, we review DL-based short/zero-calibration methods for BCI. Further, we elaborate methodological/algorithmic trends, highlight intriguing approaches in the literature, and discuss directions for further research. In particular, we search for generative model-based and geometric manipulation-based DA methods. Additionally, we categorize TL techniques in DL-based BCIs into explicit and implicit methods. Our systematization reveals advances in the DA and TL methods. Among the studies reviewed herein, ~45% of DA studies used generative model-based techniques, whereas ~45% of TL studies used explicit knowledge transferring strategy. Moreover, based on our literature review, we recommend an appropriate DA strategy for DL-based BCIs and discuss trends of TLs used in DL-based BCIs.

Download Full-text

Target Recognition and Evaluation of Typical Transmission Line Equipment Based on Deep Learning

Proceedings of PURPLE MOUNTAIN FORUM 2019-International Forum on Smart Grid Protection and Control - Lecture Notes in Electrical Engineering ◽

10.1007/978-981-13-9783-7_57 ◽

2019 ◽

pp. 701-709

Author(s):

Ziqiang Zhou ◽

Guangyu Yuan ◽

Wanxing Feng ◽

Shanqiang Gu ◽

Peng Fan

Keyword(s):

Deep Learning ◽

Transmission Line ◽

Target Recognition

Download Full-text

UAV-g 2019: Unmanned Aerial Vehicles in Geomatics

Drones ◽

10.3390/drones3030074 ◽

2019 ◽

Vol 3 (3) ◽

pp. 74 ◽

Cited By ~ 2

Author(s):

Nex

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Unmanned Aerial Vehicles ◽

Autonomous Navigation ◽

Aerial Vehicles ◽

Surrounding Environment ◽

Poster Sessions ◽

Aerial Vehicle ◽

The University ◽

The Impact

Unmanned aerial vehicle in geomatics (UAV-g) is a well-established scientific event dedicated to UAVs in geomatics and remote sensing. In the different editions of the journal, new scientific challenges have increased their synergy with adjacent domains, such as robotics and computer vision, thereby increasing the impact of this conference. The 2019 edition has been hosted by the University of Twente (The Netherlands) and has attracted about 300 participants for the full three-day program. Researchers from 36 different countries (from all continents) have presented 89 accepted papers in 17 oral and 2 poster sessions. The presented papers covered multi-disciplinary topics, such as photogrammetry, natural resources monitoring, autonomous navigation, and deep learning. All these contributions have in common the use of UAV platforms for the innovative acquisition and processing of the acquired data and information extracted from the surrounding environment.

Download Full-text

Improving multi-class Boosting-based object detection

Integrated Computer-Aided Engineering ◽

10.3233/ica-200636 ◽

2020 ◽

Vol 28 (1) ◽

pp. 81-96

Author(s):

José Miguel Buenaposada ◽

Luis Baumela

Keyword(s):

Deep Learning ◽

Object Detection ◽

Data Augmentation ◽

Detection Performance ◽

Significant Progress ◽

Training Techniques ◽

Multi Scale ◽

Bounding Box ◽

Open Issue ◽

The Impact

In recent years we have witnessed significant progress in the performance of object detection in images. This advance stems from the use of rich discriminative features produced by deep models and the adoption of new training techniques. Although these techniques have been extensively used in the mainstream deep learning-based models, it is still an open issue to analyze their impact in alternative, and computationally more efficient, ensemble-based approaches. In this paper we evaluate the impact of the adoption of data augmentation, bounding box refinement and multi-scale processing in the context of multi-class Boosting-based object detection. In our experiments we show that use of these training advancements significantly improves the object detection performance.

Download Full-text

A Full Stage Data Augmentation Method in Deep Convolutional Neural Network for Natural Image Classification

Discrete Dynamics in Nature and Society ◽

10.1155/2020/4706576 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11 ◽

Cited By ~ 4

Author(s):

Qinghe Zheng ◽

Mingqiang Yang ◽

Xinyu Tian ◽

Nan Jiang ◽

Deqiang Wang

Keyword(s):

Deep Learning ◽

Image Classification ◽

Network Architecture ◽

Data Augmentation ◽

Coarse Grained ◽

Natural Image ◽

Deep Convolutional Neural Networks ◽

Specific Domain ◽

Training Costs ◽

Model Training

Nowadays, deep learning has achieved remarkable results in many computer vision related tasks, among which the support of big data is essential. In this paper, we propose a full stage data augmentation framework to improve the accuracy of deep convolutional neural networks, which can also play the role of implicit model ensemble without introducing additional model training costs. Simultaneous data augmentation during training and testing stages can ensure network optimization and enhance its generalization ability. Augmentation in two stages needs to be consistent to ensure the accurate transfer of specific domain information. Furthermore, this framework is universal for any network architecture and data augmentation strategy and therefore can be applied to a variety of deep learning based tasks. Finally, experimental results about image classification on the coarse-grained dataset CIFAR-10 (93.41%) and fine-grained dataset CIFAR-100 (70.22%) demonstrate the effectiveness of the framework by comparing with state-of-the-art results.

Download Full-text

Deep Learning Applied to Phenotyping of Biomass in Forages with UAV-Based RGB Imagery

Sensors ◽

10.3390/s20174802 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4802 ◽

Cited By ~ 2

Author(s):

Wellington Castro ◽

José Marcato Junior ◽

Caio Polidoro ◽

Lucas Prado Osco ◽

Wesley Gonçalves ◽

...

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Biomass Yield ◽

Absolute Error ◽

Grass Species ◽

Biomass Estimation ◽

Panicum Maximum ◽

Breeding Populations ◽

Aerial Vehicle ◽

High Throughput Phenotyping

Monitoring biomass of forages in experimental plots and livestock farms is a time-consuming, expensive, and biased task. Thus, non-destructive, accurate, precise, and quick phenotyping strategies for biomass yield are needed. To promote high-throughput phenotyping in forages, we propose and evaluate the use of deep learning-based methods and UAV (Unmanned Aerial Vehicle)-based RGB images to estimate the value of biomass yield by different genotypes of the forage grass species Panicum maximum Jacq. Experiments were conducted in the Brazilian Cerrado with 110 genotypes with three replications, totaling 330 plots. Two regression models based on Convolutional Neural Networks (CNNs) named AlexNet and ResNet18 were evaluated, and compared to VGGNet—adopted in previous work in the same thematic for other grass species. The predictions returned by the models reached a correlation of 0.88 and a mean absolute error of 12.98% using AlexNet considering pre-training and data augmentation. This proposal may contribute to forage biomass estimation in breeding populations and livestock areas, as well as to reduce the labor in the field.

Download Full-text

Deep Learning Network Intensification for Preventing Noisy-Labeled Samples for Remote Sensing Classification

Remote Sensing ◽

10.3390/rs13091689 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1689

Author(s):

Chuang Lin ◽

Shanxin Guo ◽

Jinsong Chen ◽

Luyi Sun ◽

Xiaorou Zheng ◽

...

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Network Performance ◽

Aerial Image ◽

Label Noise ◽

Remote Sensing Classification ◽

Learning Network ◽

Training Samples ◽

Deep Learning Network ◽

The Impact

The deep-learning-network performance depends on the accuracy of the training samples. The training samples are commonly labeled by human visual investigation or inherited from historical land-cover or land-use maps, which usually contain label noise, depending on subjective knowledge and the time of the historical map. Helping the network to distinguish noisy labels during the training process is a prerequisite for applying the model for training across time and locations. This study proposes an antinoise framework, the Weight Loss Network (WLN), to achieve this goal. The WLN contains three main parts: (1) the segmentation subnetwork, which any state-of-the-art segmentation network can replace; (2) the attention subnetwork (λ); and (3) the class-balance coefficient (α). Four types of label noise (an insufficient label, redundant label, missing label and incorrect label) were simulated by dilate and erode processing to test the network’s antinoise ability. The segmentation task was set to extract buildings from the Inria Aerial Image Labeling Dataset, which includes Austin, Chicago, Kitsap County, Western Tyrol and Vienna. The network’s performance was evaluated by comparing it with the original U-Net model by adding noisy training samples with different noise rates and noise levels. The result shows that the proposed antinoise framework (WLN) can maintain high accuracy, while the accuracy of the U-Net model dropped. Specifically, after adding 50% of dilated-label samples at noise level 3, the U-Net model’s accuracy dropped by 12.7% for OA, 20.7% for the Mean Intersection over Union (MIOU) and 13.8% for Kappa scores. By contrast, the accuracy of the WLN dropped by 0.2% for OA, 0.3% for the MIOU and 0.8% for Kappa scores. For eroded-label samples at the same level, the accuracy of the U-Net model dropped by 8.4% for OA, 24.2% for the MIOU and 43.3% for Kappa scores, while the accuracy of the WLN dropped by 4.5% for OA, 4.7% for the MIOU and 0.5% for Kappa scores. This result shows that the antinoise framework proposed in this paper can help current segmentation models to avoid the impact of noisy training labels and has the potential to be trained by a larger remote sensing image set regardless of the inner label error.

Download Full-text