Deep learning for camera data acquisition, control, and image estimation

David J. Brady; Lu Fang; Zhan Ma

doi:10.1364/aop.398263

A Citizen Science Unmanned Aerial System Data Acquisition Protocol and Deep Learning Techniques for the Automatic Detection and Mapping of Marine Litter Concentrations in the Coastal Zone

Drones ◽

10.3390/drones5010006 ◽

2021 ◽

Vol 5 (1) ◽

pp. 6

Author(s):

Apostolos Papakonstantinou ◽

Marios Batsaris ◽

Spyros Spondylidis ◽

Konstantinos Topouzelis

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Citizen Science ◽

Coastal Zone ◽

Automatic Detection ◽

Monitoring Methods ◽

Monitoring Method ◽

Marine Litter ◽

Density Maps ◽

Learning Techniques

Marine litter (ML) accumulation in the coastal zone has been recognized as a major problem in our time, as it can dramatically affect the environment, marine ecosystems, and coastal communities. Existing monitoring methods fail to respond to the spatiotemporal changes and dynamics of ML concentrations. Recent works showed that unmanned aerial systems (UAS), along with computer vision methods, provide a feasible alternative for ML monitoring. In this context, we proposed a citizen science UAS data acquisition and annotation protocol combined with deep learning techniques for the automatic detection and mapping of ML concentrations in the coastal zone. Five convolutional neural networks (CNNs) were trained to classify UAS image tiles into two classes: (a) litter and (b) no litter. Testing the CCNs’ generalization ability to an unseen dataset, we found that the VVG19 CNN returned an overall accuracy of 77.6% and an f-score of 77.42%. ML density maps were created using the automated classification results. They were compared with those produced by a manual screening classification proving our approach’s geographical transferability to new and unknown beaches. Although ML recognition is still a challenging task, this study provides evidence about the feasibility of using a citizen science UAS-based monitoring method in combination with deep learning techniques for the quantification of the ML load in the coastal zone using density maps.

Download Full-text

A COVID-19 pandemic AI-based system with deep learning forecasting and automatic statistical data acquisition: Development and Implementation Study (Preprint)

Journal of Medical Internet Research ◽

10.2196/27806 ◽

2021 ◽

Author(s):

Cheng-Sheng Yu ◽

Shy-Shin Chang ◽

Tzu-Hao Chang ◽

Jenny L Wu ◽

Yu-Jiun Lin ◽

...

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Statistical Data ◽

Implementation Study

Download Full-text

IoT Data Acquisition Node For Deep Learning Time Series Prediction

10.1109/ibdap52511.2021.9552096 ◽

2021 ◽

Author(s):

Li Xinyun ◽

Liu Huidan ◽

Yin Hang ◽

Cao Zilan ◽

Chen Bangdi ◽

...

Keyword(s):

Time Series ◽

Deep Learning ◽

Data Acquisition ◽

Time Series Prediction ◽

Learning Time

Download Full-text

Ani-GIFs: A Benchmark Dataset for Domain Generalization of Action Recognition from GIFs

10.22541/au.162464907.76209032/v1 ◽

2021 ◽

Author(s):

Shoumik Majumdar ◽

Shubhangi Jain ◽

Isidora Chara Tourni ◽

Arsenii Mustafin ◽

Diala Lteif ◽

...

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Action Recognition ◽

Data Augmentation ◽

Learning Models ◽

Visual Content ◽

Lack Of Information ◽

Temporal Features ◽

Synthetic Video ◽

Multiple Domains

Deep learning models perform remarkably well for the same task under the assumption that data is always coming from the same distribution. However, this is generally violated in practice, mainly due to the differences in the data acquisition techniques and the lack of information about the underlying source of new data. Domain Generalization targets the ability to generalize to test data of an unseen domain; while this problem is well-studied for images, such studies are significantly lacking in spatiotemporal visual content – videos and GIFs. This is due to (1) the challenging nature of misalignment of temporal features and the varying appearance/motion of actors and actions in different domains, and (2) spatiotemporal datasets being laborious to collect and annotate for multiple domains. We collect and present the first synthetic video dataset of Animated GIFs for domain generalization, Ani-GIFs, that is used to study domain gap of videos vs. GIFs, and animated vs. real GIFs, for the task of action recognition. We provide a training and testing setting for Ani-GIFs, and extend two domain generalization baseline approaches, based on data augmentation and explainability, to the spatiotemporal domain to catalyze research in this direction.

Download Full-text

A Data-Centric Approach to Design and Analysis of a Surface-Inspection System Based on Deep Learning in the Plastic Injection Molding Industry

Processes ◽

10.3390/pr9111895 ◽

2021 ◽

Vol 9 (11) ◽

pp. 1895

Author(s):

Donggyun Im ◽

Sangkyu Lee ◽

Homin Lee ◽

Byungguan Yoon ◽

Fayoung So ◽

...

Keyword(s):

Feature Extraction ◽

Deep Learning ◽

Data Acquisition ◽

Type I Error ◽

Vision System ◽

Structural Characteristics ◽

Inspection System ◽

Type I ◽

Data Annotation ◽

Background Removal

Manufacturers are eager to replace the human inspector with automatic inspection systems to improve the competitive advantage by means of quality. However, some manufacturers have failed to apply the traditional vision system because of constraints in data acquisition and feature extraction. In this paper, we propose an inspection system based on deep learning for a tampon applicator producer that uses the applicator’s structural characteristics for data acquisition and uses state-of-the-art models for object detection and instance segmentation, YOLOv4 and YOLACT for feature extraction, respectively. During the on-site trial test, we experienced some False-Positive (FP) cases and found a possible Type I error. We used a data-centric approach to solve the problem by using two different data pre-processing methods, the Background Removal (BR) and Contrast Limited Adaptive Histogram Equalization (CLAHE). We have experimented with analyzing the effect of the methods on the inspection with the self-created dataset. We found that CLAHE increased Recall by 0.1 at the image level, and both CLAHE and BR improved Precision by 0.04–0.06 at the bounding box level. These results support that the data-centric approach might improve the detection rate. However, the data pre-processing techniques deteriorated the metrics used to measure the overall performance, such as F1-score and Average Precision (AP), even though we empirically confirmed that the malfunctions improved. With the detailed analysis of the result, we have found some cases that revealed the ambiguity of the decisions caused by the inconsistency in data annotation. Our research alerts AI practitioners that validating the model based only on the metrics may lead to a wrong conclusion.

Download Full-text

Sensor Data Acquisition and Multimodal Sensor Fusion for Human Activity Recognition Using Deep Learning

Sensors ◽

10.3390/s19071716 ◽

2019 ◽

Vol 19 (7) ◽

pp. 1716 ◽

Cited By ~ 20

Author(s):

Seungeun Chung ◽

Jiyoun Lim ◽

Kyoung Ju Noh ◽

Gague Kim ◽

Hyuntae Jeong

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Sensor Fusion ◽

Activity Recognition ◽

Human Activity ◽

Short Term Memory ◽

Sampling Rate ◽

Human Activity Recognition ◽

Sensor Data ◽

Activity Data

In this paper, we perform a systematic study about the on-body sensor positioning and data acquisition details for Human Activity Recognition (HAR) systems. We build a testbed that consists of eight body-worn Inertial Measurement Units (IMU) sensors and an Android mobile device for activity data collection. We develop a Long Short-Term Memory (LSTM) network framework to support training of a deep learning model on human activity data, which is acquired in both real-world and controlled environments. From the experiment results, we identify that activity data with sampling rate as low as 10 Hz from four sensors at both sides of wrists, right ankle, and waist is sufficient in recognizing Activities of Daily Living (ADLs) including eating and driving activity. We adopt a two-level ensemble model to combine class-probabilities of multiple sensor modalities, and demonstrate that a classifier-level sensor fusion technique can improve the classification performance. By analyzing the accuracy of each sensor on different types of activity, we elaborate custom weights for multimodal sensor fusion that reflect the characteristic of individual activities.

Download Full-text

Intelligent Aperture Identification Combining Compressed Data Acquisition with Sparse Filtering-based Deep Learning Towards Natural Gas Pipeline Leak

Structural Health Monitoring 2017 ◽

10.12783/shm2017/14172 ◽

2017 ◽

Cited By ~ 1

Author(s):

JIEDI SUN ◽

YANLEI QIAO ◽

JIANGTAO WEN

Keyword(s):

Deep Learning ◽

Natural Gas ◽

Data Acquisition ◽

Gas Pipeline ◽

Natural Gas Pipeline ◽

Sparse Filtering ◽

Compressed Data

Download Full-text

Automatic large-scale data acquisition via crowdsourcing for crosswalk classification: A deep learning approach

Computers & Graphics ◽

10.1016/j.cag.2017.08.004 ◽

2017 ◽

Vol 68 ◽

pp. 32-42 ◽

Cited By ~ 18

Author(s):

Rodrigo F. Berriel ◽

Franco Schmidt Rossi ◽

Alberto F. de Souza ◽

Thiago Oliveira-Santos

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Large Scale ◽

Learning Approach ◽

Large Scale Data ◽

Scale Data

Download Full-text

Intelligent wind turbine blade icing detection using supervisory control and data acquisition data and ensemble deep learning

Energy Science & Engineering ◽

10.1002/ese3.449 ◽

2019 ◽

Vol 7 (6) ◽

pp. 2633-2645 ◽

Cited By ~ 4

Author(s):

Yao Liu ◽

Han Cheng ◽

Xianguang Kong ◽

Qibin Wang ◽

Huan Cui

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

Wind Turbine ◽

Turbine Blade ◽

Supervisory Control ◽

Wind Turbine Blade

Download Full-text

High Throughput Data Acquisition and Deep Learning for Insect Ecoinformatics

Frontiers in Ecology and Evolution ◽

10.3389/fevo.2021.600931 ◽

2021 ◽

Vol 9 ◽

Author(s):

Alexander Gerovichev ◽

Achiad Sadeh ◽

Vlad Winter ◽

Avi Bar-Massada ◽

Tamar Keasar ◽

...

Keyword(s):

Deep Learning ◽

Data Acquisition ◽

High Throughput ◽

Well Being ◽

Ecosystem Functions ◽

High Tech ◽

Pest Species ◽

Agricultural Pests ◽

Sticky Traps ◽

Flying Insects

Ecology documents and interprets the abundance and distribution of organisms. Ecoinformatics addresses this challenge by analyzing databases of observational data. Ecoinformatics of insects has high scientific and applied importance, as insects are abundant, speciose, and involved in many ecosystem functions. They also crucially impact human well-being, and human activities dramatically affect insect demography and phenology. Hazards, such as pollinator declines, outbreaks of agricultural pests and the spread insect-borne diseases, raise an urgent need to develop ecoinformatics strategies for their study. Yet, insect databases are mostly focused on a small number of pest species, as data acquisition is labor-intensive and requires taxonomical expertise. Thus, despite decades of research, we have only a qualitative notion regarding fundamental questions of insect ecology, and only limited knowledge about the spatio-temporal distribution of insects. We describe a novel high throughput cost-effective approach for monitoring flying insects as an enabling step toward “big data” entomology. The proposed approach combines “high tech” deep learning with “low tech” sticky traps that sample flying insects in diverse locations. As a proof of concept we considered three recent insect invaders of Israel’s forest ecosystem: two hemipteran pests of eucalypts and a parasitoid wasp that attacks one of them. We developed software, based on deep learning, to identify the three species in images of sticky traps from Eucalyptus forests. These image processing tasks are quite difficult as the insects are small (<5 mm) and stick to the traps in random poses. The resulting deep learning model discriminated the three focal organisms from one another, as well as from other elements such as leaves and other insects, with high precision. We used the model to compare the abundances of these species among six sites, and validated the results by manually counting insects on the traps. Having demonstrated the power of the proposed approach, we started a more ambitious study that monitors these insects at larger spatial and temporal scales. We aim at building an ecoinformatics repository for trap images and generating data-driven models of the populations’ dynamics and morphological traits.

Download Full-text