RADIO: Parameterized Generative Radar Data Augmentation for Small Datasets

Marcel Sheeny; Andrew Wallace; Sen Wang

doi:10.3390/app10113861

RADIO: Parameterized Generative Radar Data Augmentation for Small Datasets

Applied Sciences ◽

10.3390/app10113861 ◽

2020 ◽

Vol 10 (11) ◽

pp. 3861

Author(s):

Marcel Sheeny ◽

Andrew Wallace ◽

Sen Wang

Keyword(s):

Deep Learning ◽

Physical Properties ◽

Data Augmentation ◽

Speckle Noise ◽

Radar Data ◽

Beam Divergence ◽

Data Generation ◽

Learning Models ◽

Radar Signals ◽

Small Dataset

We present a novel, parameterised radar data augmentation (RADIO) technique to generate realistic radar samples from small datasets for the development of radar-related deep learning models. RADIO leverages the physical properties of radar signals, such as attenuation, azimuthal beam divergence and speckle noise, for data generation and augmentation. Exemplary applications on radar-based classification and detection demonstrate that RADIO can generate meaningful radar samples that effectively boost the accuracy of classification and generalisability of deep models trained with a small dataset.

Download Full-text

Application of Deep Learning on Millimeter-Wave Radar Signals: A Review

Sensors ◽

10.3390/s21061951 ◽

2021 ◽

Vol 21 (6) ◽

pp. 1951

Author(s):

Fahad Jibrin Abdu ◽

Yixiong Zhang ◽

Maozhong Fu ◽

Yuhan Li ◽

Zhenmiao Deng

Keyword(s):

Deep Learning ◽

Radar Data ◽

Radar Signal ◽

Future Research ◽

Learning Technology ◽

Learning Approaches ◽

Learning Models ◽

Wave Radar ◽

Adverse Weather ◽

Radar Signals

The progress brought by the deep learning technology over the last decade has inspired many research domains, such as radar signal processing, speech and audio recognition, etc., to apply it to their respective problems. Most of the prominent deep learning models exploit data representations acquired with either Lidar or camera sensors, leaving automotive radars rarely used. This is despite the vital potential of radars in adverse weather conditions, as well as their ability to simultaneously measure an object’s range and radial velocity seamlessly. As radar signals have not been exploited very much so far, there is a lack of available benchmark data. However, recently, there has been a lot of interest in applying radar data as input to various deep learning algorithms, as more datasets are being provided. To this end, this paper presents a survey of various deep learning approaches processing radar signals to accomplish some significant tasks in an autonomous driving application, such as detection and classification. We have itemized the review based on different radar signal representations, as it is one of the critical aspects while using radar data with deep learning models. Furthermore, we give an extensive review of the recent deep learning-based multi-sensor fusion models exploiting radar signals and camera images for object detection tasks. We then provide a summary of the available datasets containing radar data. Finally, we discuss the gaps and important innovations in the reviewed papers and highlight some possible future research prospects.

Download Full-text

Levenshtein Augmentation Improves Performance of SMILES Based Deep-Learning Synthesis Prediction

10.26434/chemrxiv.12562121 ◽

2020 ◽

Author(s):

Dean Sumner ◽

Jiazhen He ◽

Amol Thakkar ◽

Ola Engkvist ◽

Esben Jannik Bjerrum

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Deep Learning ◽

Recurrent Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Sequence Similarity ◽

Learning Models ◽

Underlying Network

<p>SMILES randomization, a form of data augmentation, has previously been shown to increase the performance of deep learning models compared to non-augmented baselines. Here, we propose a novel data augmentation method we call “Levenshtein augmentation” which considers local SMILES sub-sequence similarity between reactants and their respective products when creating training pairs. The performance of Levenshtein augmentation was tested using two state of the art models - transformer and sequence-to-sequence based recurrent neural networks with attention. Levenshtein augmentation demonstrated an increase performance over non-augmented, and conventionally SMILES randomization augmented data when used for training of baseline models. Furthermore, Levenshtein augmentation seemingly results in what we define as <i>attentional gain </i>– an enhancement in the pattern recognition capabilities of the underlying network to molecular motifs.</p>

Download Full-text

Confronting Deep-Learning and Biodiversity Challenges for Automatic Video-Monitoring of Marine Ecosystems

Sensors ◽

10.3390/s22020497 ◽

2022 ◽

Vol 22 (2) ◽

pp. 497

Author(s):

Sébastien Villon ◽

Corina Iovan ◽

Morgan Mangeas ◽

Laurent Vigliola

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Low Cost ◽

Video Recording ◽

Species Abundance ◽

Automatic Monitoring ◽

Training Dataset ◽

Data Generation ◽

Automatic Assessment ◽

Open Set

With the availability of low-cost and efficient digital cameras, ecologists can now survey the world’s biodiversity through image sensors, especially in the previously rather inaccessible marine realm. However, the data rapidly accumulates, and ecologists face a data processing bottleneck. While computer vision has long been used as a tool to speed up image processing, it is only since the breakthrough of deep learning (DL) algorithms that the revolution in the automatic assessment of biodiversity by video recording can be considered. However, current applications of DL models to biodiversity monitoring do not consider some universal rules of biodiversity, especially rules on the distribution of species abundance, species rarity and ecosystem openness. Yet, these rules imply three issues for deep learning applications: the imbalance of long-tail datasets biases the training of DL models; scarce data greatly lessens the performances of DL models for classes with few data. Finally, the open-world issue implies that objects that are absent from the training dataset are incorrectly classified in the application dataset. Promising solutions to these issues are discussed, including data augmentation, data generation, cross-entropy modification, few-shot learning and open set recognition. At a time when biodiversity faces the immense challenges of climate change and the Anthropocene defaunation, stronger collaboration between computer scientists and ecologists is urgently needed to unlock the automatic monitoring of biodiversity.

Download Full-text

Data Augmentation for Improving Deep Learning Models in Building Inspections or Postdisaster Evaluation

Journal of Performance of Constructed Facilities ◽

10.1061/(asce)cf.1943-5509.0001594 ◽

2021 ◽

Vol 35 (4) ◽

Author(s):

Samuel Leach ◽

Yunhe Xue ◽

Rahul Sridhar ◽

Stephanie Paal ◽

Zhangyang Wang ◽

...

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Learning Models

Download Full-text

Deploying Machine and Deep Learning Models for Efficient Data-Augmented Detection of COVID-19 Infections

Viruses ◽

10.3390/v12070769 ◽

2020 ◽

Vol 12 (7) ◽

pp. 769 ◽

Cited By ~ 12

Author(s):

Ahmed Sedik ◽

Abdullah M Iliyasu ◽

Basma Abd El-Rahiem ◽

Mohammed E. Abdel Samea ◽

Asmaa Abdel-Raheem ◽

...

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Short Term Memory ◽

Machine Learning Techniques ◽

Detection Accuracy ◽

Testing Time ◽

Learning Models ◽

Accurate Identification ◽

Infected People ◽

Corona Virus

This generation faces existential threats because of the global assault of the novel Corona virus 2019 (i.e., COVID-19). With more than thirteen million infected and nearly 600000 fatalities in 188 countries/regions, COVID-19 is the worst calamity since the World War II. These misfortunes are traced to various reasons, including late detection of latent or asymptomatic carriers, migration, and inadequate isolation of infected people. This makes detection, containment, and mitigation global priorities to contain exposure via quarantine, lockdowns, work/stay at home, and social distancing that are focused on “flattening the curve”. While medical and healthcare givers are at the frontline in the battle against COVID-19, it is a crusade for all of humanity. Meanwhile, machine and deep learning models have been revolutionary across numerous domains and applications whose potency have been exploited to birth numerous state-of-the-art technologies utilised in disease detection, diagnoses, and treatment. Despite these potentials, machine and, particularly, deep learning models are data sensitive, because their effectiveness depends on availability and reliability of data. The unavailability of such data hinders efforts of engineers and computer scientists to fully contribute to the ongoing assault against COVID-19. Faced with a calamity on one side and absence of reliable data on the other, this study presents two data-augmentation models to enhance learnability of the Convolutional Neural Network (CNN) and the Convolutional Long Short-Term Memory (ConvLSTM)-based deep learning models (DADLMs) and, by doing so, boost the accuracy of COVID-19 detection. Experimental results reveal improvement in terms of accuracy of detection, logarithmic loss, and testing time relative to DLMs devoid of such data augmentation. Furthermore, average increases of 4% to 11% in COVID-19 detection accuracy are reported in favour of the proposed data-augmented deep learning models relative to the machine learning techniques. Therefore, the proposed algorithm is effective in performing a rapid and consistent Corona virus diagnosis that is primarily aimed at assisting clinicians in making accurate identification of the virus.

Download Full-text

Ani-GIFs: A Benchmark Dataset for Domain Generalization of Action Recognition from GIFs

10.22541/au.162464907.76209032/v1 ◽

2021 ◽

Author(s):

Shoumik Majumdar ◽

Shubhangi Jain ◽

Isidora Chara Tourni ◽

Arsenii Mustafin ◽

Diala Lteif ◽

...

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Action Recognition ◽

Data Augmentation ◽

Learning Models ◽

Visual Content ◽

Lack Of Information ◽

Temporal Features ◽

Synthetic Video ◽

Multiple Domains

Deep learning models perform remarkably well for the same task under the assumption that data is always coming from the same distribution. However, this is generally violated in practice, mainly due to the differences in the data acquisition techniques and the lack of information about the underlying source of new data. Domain Generalization targets the ability to generalize to test data of an unseen domain; while this problem is well-studied for images, such studies are significantly lacking in spatiotemporal visual content – videos and GIFs. This is due to (1) the challenging nature of misalignment of temporal features and the varying appearance/motion of actors and actions in different domains, and (2) spatiotemporal datasets being laborious to collect and annotate for multiple domains. We collect and present the first synthetic video dataset of Animated GIFs for domain generalization, Ani-GIFs, that is used to study domain gap of videos vs. GIFs, and animated vs. real GIFs, for the task of action recognition. We provide a training and testing setting for Ani-GIFs, and extend two domain generalization baseline approaches, based on data augmentation and explainability, to the spatiotemporal domain to catalyze research in this direction.

Download Full-text

Efficient Pneumonia Detection in Chest Xray Images Using Deep Transfer Learning

Diagnostics ◽

10.3390/diagnostics10060417 ◽

2020 ◽

Vol 10 (6) ◽

pp. 417 ◽

Cited By ~ 5

Author(s):

Mohammad Farukh Hashmi ◽

Satyarth Katiyar ◽

Avinash G Keskar ◽

Neeraj Dhanraj Bokde ◽

Zong Woo Geem

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Data Augmentation ◽

Medical Center ◽

Training Dataset ◽

Test Accuracy ◽

X Rays ◽

Learning Models ◽

Novel Approach ◽

Unseen Data

Pneumonia causes the death of around 700,000 children every year and affects 7% of the global population. Chest X-rays are primarily used for the diagnosis of this disease. However, even for a trained radiologist, it is a challenging task to examine chest X-rays. There is a need to improve the diagnosis accuracy. In this work, an efficient model for the detection of pneumonia trained on digital chest X-ray images is proposed, which could aid the radiologists in their decision making process. A novel approach based on a weighted classifier is introduced, which combines the weighted predictions from the state-of-the-art deep learning models such as ResNet18, Xception, InceptionV3, DenseNet121, and MobileNetV3 in an optimal way. This approach is a supervised learning approach in which the network predicts the result based on the quality of the dataset used. Transfer learning is used to fine-tune the deep learning models to obtain higher training and validation accuracy. Partial data augmentation techniques are employed to increase the training dataset in a balanced way. The proposed weighted classifier is able to outperform all the individual models. Finally, the model is evaluated, not only in terms of test accuracy, but also in the AUC score. The final proposed weighted classifier model is able to achieve a test accuracy of 98.43% and an AUC score of 99.76 on the unseen data from the Guangzhou Women and Children’s Medical Center pneumonia dataset. Hence, the proposed model can be used for a quick diagnosis of pneumonia and can aid the radiologists in the diagnosis process.

Download Full-text

A Language Model for Misogyny Detection in Latin American Spanish Driven by Multisource Feature Extraction and Transformers

Applied Sciences ◽

10.3390/app112110467 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10467

Author(s):

Edwin Aldana-Bobadilla ◽

Alejandro Molina-Villegas ◽

Yuridia Montelongo-Padilla ◽

Ivan Lopez-Arevalo ◽

Oscar S. Sordia

Keyword(s):

Deep Learning ◽

Latin American ◽

Data Augmentation ◽

Physical Violence ◽

Language Model ◽

Learning Models ◽

Exact Figure ◽

Latin American Spanish ◽

American Spanish ◽

Statistical Systems

Creating effective mechanisms to detect misogyny online automatically represents significant scientific and technological challenges. The complexity of recognizing misogyny through computer models lies in the fact that it is a subtle type of violence, it is not always explicitly aggressive, and it can even hide behind seemingly flattering words, jokes, parodies, and other expressions. Currently, it is even difficult to have an exact figure for the rate of misogynistic comments online because, unlike other types of violence, such as physical violence, these events are not registered by any statistical systems. This research contributes to the development of models for the automatic detection of misogynistic texts in Latin American Spanish and contributes to the design of data augmentation methodologies since the amount of data required for deep learning models is considerable.

Download Full-text

Deep learning model for Pathogen Classification using Feature fusion and Data augmentation

Current Bioinformatics ◽

10.2174/1574893615999200707143535 ◽

2020 ◽

Vol 15 ◽

Author(s):

Fareed Ahmad ◽

Amjad Farooq ◽

Muhammad Usman Ghani Khan

Keyword(s):

Deep Learning ◽

Cross Validation ◽

Data Augmentation ◽

Feature Vector ◽

Feature Fusion ◽

High Capacity ◽

Automatic Classification ◽

Fine Tuning ◽

Correct Identification ◽

Learning Models

Background: Bacterial pathogens are deadly for animals and humans. The ease of their dissemination, coupled with their high capacity for ailment and death in infected individuals, makes them a threat to society. Objective: Due to high similarity among genera and species of pathogens, it is sometimes difficult for microbiologists to differentiate between them. Their automatic classification using deep-learning models can help in reliable, and accurate outcomes. Method: Deep-learning models, namely; AlexNet, GoogleNet, ResNet101, and InceptionV3 are used with numerous variations including training model from scratch, fine-tuning without pre-trained weights, fine-tuning along with freezing weights of initial layers, fine-tuning along with adjusting weights of all layers and augmenting the dataset by random translation and reflection. Moreover, as the dataset is small, fine-tuning and data augmentation strategies are applied to avoid overfitting and produce a generalized model. A merged feature vector is produced using two best-performing models and accuracy is calculated by xgboost algorithm on the feature vector by applying cross-validation. Results: Fine-tuned models where augmentation is applied produces the best results. Out of these, two-best-performing deep models i.e. (ResNet101, and InceptionV3) selected for feature fusion, produced a similar validation accuracy of 95.83 with a loss of 0.0213 and 0.1066, and a testing accuracy of 97.92 and 93.75, respectively. The proposed model used xgboost to attained a classification accuracy of 98.17% by using 35-folds cross-validation. Conclusion: The automatic classification using these models can help experts in the correct identification of pathogens. Consequently, they can help in controlling epidemics and thereby minimizing the socio-economic impact on the community.

Download Full-text

Automatic Pancreas Segmentation Using Coarse-Scaled 2D Model of Deep Learning: Usefulness of Data Augmentation and Deep U-Net

Applied Sciences ◽

10.3390/app10103360 ◽

2020 ◽

Vol 10 (10) ◽

pp. 3360

Author(s):

Mizuho Nishio ◽

Shunjiro Noguchi ◽

Koji Fujimoto

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Automatic Segmentation ◽

Dice Similarity Coefficient ◽

Learning Models ◽

Segmentation Methods ◽

Learning Architectures ◽

Pancreas Segmentation ◽

Fold Cross Validation ◽

Better Than

Combinations of data augmentation methods and deep learning architectures for automatic pancreas segmentation on CT images are proposed and evaluated. Images from a public CT dataset of pancreas segmentation were used to evaluate the models. Baseline U-net and deep U-net were chosen for the deep learning models of pancreas segmentation. Methods of data augmentation included conventional methods, mixup, and random image cropping and patching (RICAP). Ten combinations of the deep learning models and the data augmentation methods were evaluated. Four-fold cross validation was performed to train and evaluate these models with data augmentation methods. The dice similarity coefficient (DSC) was calculated between automatic segmentation results and manually annotated labels and these were visually assessed by two radiologists. The performance of the deep U-net was better than that of the baseline U-net with mean DSC of 0.703–0.789 and 0.686–0.748, respectively. In both baseline U-net and deep U-net, the methods with data augmentation performed better than methods with no data augmentation, and mixup and RICAP were more useful than the conventional method. The best mean DSC was obtained using a combination of deep U-net, mixup, and RICAP, and the two radiologists scored the results from this model as good or perfect in 76 and 74 of the 82 cases.

Download Full-text