Deep Learning on Construction Sites: A Case Study of Sparse Data Learning Techniques for Rebar Segmentation

Suzanna Cuypers; Maarten Bassier; Maarten Vergauwen

doi:10.3390/s21165428

Deep Learning on Construction Sites: A Case Study of Sparse Data Learning Techniques for Rebar Segmentation

Sensors ◽

10.3390/s21165428 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5428

Author(s):

Suzanna Cuypers ◽

Maarten Bassier ◽

Maarten Vergauwen

Keyword(s):

Deep Learning ◽

Image Interpretation ◽

Semantic Segmentation ◽

Training Model ◽

Training Data ◽

Construction Site ◽

Major Drawback ◽

Automate Monitoring ◽

Site Monitoring

With recent advancements in deep learning models for image interpretation, it has finally become possible to automate construction site monitoring processes that rely on remote sensing. However, the major drawback of these models is their dependency on large datasets of training images labeled at pixel level, which have to be produced manually by skilled personnel. To alleviate the need for training data, this study evaluates weakly- and semi-supervised semantic segmentation models for construction site imagery to efficiently automate monitoring tasks. As a case study, we compare fully-, weakly- and semi-supervised methods for the detection of rebar covers, which are useful for quality control. In the experiments, recent models, i.e. IRNet, DeepLabv3+ and the cross-consistency training model, are compared for their ability to segment rebar covers from construction site imagery with minimal manual input. The results show that weakly- and semi-supervised models can indeed approach the performance of fully-supervised models, with the majority of the target objects being properly found. Through this study, construction site stakeholders are provided with detailed information on how tp leverage deep learning for efficient construction site monitoring and weigh preprocessing, training and testing efforts against each other in order to decide between fully-, weakly- and semi-supervised training.

Download Full-text

3DLEB-Net: Label-Efficient Deep Learning-Based Semantic Segmentation of Building Point Clouds at LoD3 Level

Applied Sciences ◽

10.3390/app11198996 ◽

2021 ◽

Vol 11 (19) ◽

pp. 8996

Author(s):

Yuwei Cao ◽

Marco Scaioni

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Point Clouds ◽

Training Data ◽

Second Step ◽

Point Cloud Data ◽

Dynamic Graph ◽

Cloud Data ◽

Supervised Methods ◽

Global And Local

In current research, fully supervised Deep Learning (DL) techniques are employed to train a segmentation network to be applied to point clouds of buildings. However, training such networks requires large amounts of fine-labeled buildings’ point-cloud data, presenting a major challenge in practice because they are difficult to obtain. Consequently, the application of fully supervised DL for semantic segmentation of buildings’ point clouds at LoD3 level is severely limited. In order to reduce the number of required annotated labels, we proposed a novel label-efficient DL network that obtains per-point semantic labels of LoD3 buildings’ point clouds with limited supervision, named 3DLEB-Net. In general, it consists of two steps. The first step (Autoencoder, AE) is composed of a Dynamic Graph Convolutional Neural Network (DGCNN) encoder and a folding-based decoder. It is designed to extract discriminative global and local features from input point clouds by faithfully reconstructing them without any label. The second step is the semantic segmentation network. By supplying a small amount of task-specific supervision, a segmentation network is proposed for semantically segmenting the encoded features acquired from the pre-trained AE. Experimentally, we evaluated our approach based on the Architectural Cultural Heritage (ArCH) dataset. Compared to the fully supervised DL methods, we found that our model achieved state-of-the-art results on the unseen scenes, with only 10% of labeled training data from fully supervised methods as input. Moreover, we conducted a series of ablation studies to show the effectiveness of the design choices of our model.

Download Full-text

A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning

Database ◽

10.1093/database/baz116 ◽

2019 ◽

Vol 2019 ◽

Cited By ~ 2

Author(s):

Tao Chen ◽

Mingfen Wu ◽

Hexi Li

Keyword(s):

Deep Learning ◽

Large Scale ◽

Relation Extraction ◽

Training Model ◽

Biomedical Literature ◽

Training Data ◽

Fine Tuning ◽

Learning Approaches ◽

Additional Time ◽

Clinical Records

Abstract The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model. We propose using a pre-trained model and a fine-tuning technique to improve these approaches without additional time-consuming human labeling. Firstly, we show the architecture of Bidirectional Encoder Representations from Transformers (BERT), an approach for pre-training a model on large-scale unstructured text. We then combine BERT with a one-dimensional convolutional neural network (1d-CNN) to fine-tune the pre-trained model for relation extraction. Extensive experiments on three datasets, namely the BioCreative V chemical disease relation corpus, traditional Chinese medicine literature corpus and i2b2 2012 temporal relation challenge corpus, show that the proposed approach achieves state-of-the-art results (giving a relative improvement of 22.2, 7.77, and 38.5% in F1 score, respectively, compared with a traditional 1d-CNN classifier). The source code is available at https://github.com/chentao1999/MedicalRelationExtraction.

Download Full-text

A Case Study of the Augmentation and Evaluation of Training Data for Deep Learning

Journal of Data and Information Quality ◽

10.1145/3317573 ◽

2019 ◽

Vol 11 (4) ◽

pp. 1-22 ◽

Cited By ~ 1

Author(s):

Junhua Ding ◽

Xinchuan Li ◽

Xiaojun Kang ◽

Venkat N. Gudivada

Keyword(s):

Deep Learning ◽

Training Data ◽

Evaluation Of Training

Download Full-text

LABEL-EFFICIENT DEEP LEARNING-BASED SEMANTIC SEGMENTATION OF BUILDING POINT CLOUDS AT LOD3 LEVEL

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2021-449-2021 ◽

2021 ◽

Vol XLIII-B2-2021 ◽

pp. 449-456

Author(s):

Y. Cao ◽

M. Scaioni

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Semantic Segmentation ◽

Point Clouds ◽

Training Data ◽

Second Step ◽

Dynamic Graph ◽

Input Point ◽

Supervised Methods ◽

Global And Local

Abstract. In recent research, fully supervised Deep Learning (DL) techniques and large amounts of pointwise labels are employed to train a segmentation network to be applied to buildings’ point clouds. However, fine-labelled buildings’ point clouds are hard to find and manually annotating pointwise labels is time-consuming and expensive. Consequently, the application of fully supervised DL for semantic segmentation of buildings’ point clouds at LoD3 level is severely limited. To address this issue, we propose a novel label-efficient DL network that obtains per-point semantic labels of LoD3 buildings’ point clouds with limited supervision. In general, it consists of two steps. The first step (Autoencoder – AE) is composed of a Dynamic Graph Convolutional Neural Network-based encoder and a folding-based decoder, designed to extract discriminative global and local features from input point clouds by reconstructing them without any label. The second step is semantic segmentation. By supplying a small amount of task-specific supervision, a segmentation network is proposed for semantically segmenting the encoded features acquired from the pre-trained AE. Experimentally, we evaluate our approach based on the ArCH dataset. Compared to the fully supervised DL methods, we find that our model achieved state-of-the-art results on the unseen scenes, with only 10% of labelled training data from fully supervised methods as input.

Download Full-text

Text Separation From Document Images

Machine Learning and Deep Learning in Real-Time Applications - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-7998-3095-5.ch013 ◽

2020 ◽

pp. 283-313

Author(s):

Priti P. Rege ◽

Shaheera Akhter

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Optical Character Recognition ◽

Semantic Segmentation ◽

Document Image ◽

Training Data ◽

Document Images ◽

Learning Techniques ◽

Extraction Processes ◽

Segmentation Image

Text separation in document image analysis is an important preprocessing step before executing an optical character recognition (OCR) task. It is necessary to improve the accuracy of an OCR system. Traditionally, for separating text from a document, different feature extraction processes have been used that require handcrafting of the features. However, deep learning-based methods are excellent feature extractors that learn features from the training data automatically. Deep learning gives state-of-the-art results on various computer vision, image classification, segmentation, image captioning, object detection, and recognition tasks. This chapter compares various traditional as well as deep-learning techniques and uses a semantic segmentation method for separating text from Devanagari document images using U-Net and ResU-Net models. These models are further fine-tuned for transfer learning to get more precise results. The final results show that deep learning methods give more accurate results compared with conventional methods of image processing for Devanagari text extraction.

Download Full-text

Deep Learning Case Study on Imbalanced Training Data for Automatic Bird Identification

Deep Learning: Algorithms and Applications - Studies in Computational Intelligence ◽

10.1007/978-3-030-31760-7_8 ◽

2019 ◽

pp. 231-262

Author(s):

Juha Niemi ◽

Juha T. Tanttu

Keyword(s):

Deep Learning ◽

Training Data ◽

Imbalanced Training Data

Download Full-text

Semantic Segmentation of Building Point Clouds Using Deep Learning: A Method for Creating Training Data Using BIM to Point Cloud Label Transfer

Computing in Civil Engineering 2019 ◽

10.1061/9780784482421.052 ◽

2019 ◽

Cited By ~ 2

Author(s):

Thomas Czerniawski ◽

Fernanda Leite

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Semantic Segmentation ◽

Point Clouds ◽

Training Data ◽

Label Transfer

Download Full-text

Integrated application of semantic segmentation-assisted deep learning to quantitative multi-phased microstructural analysis in composite materials: Case study of cathode composite materials of solid oxide fuel cells

Journal of Power Sources ◽

10.1016/j.jpowsour.2020.228458 ◽

2020 ◽

Vol 471 ◽

pp. 228458

Author(s):

Heesu Hwang ◽

Sung Min Choi ◽

Jiwon Oh ◽

Seung-Muk Bae ◽

Jong-Ho Lee ◽

...

Keyword(s):

Deep Learning ◽

Fuel Cells ◽

Composite Materials ◽

Solid Oxide Fuel Cells ◽

Microstructural Analysis ◽

Semantic Segmentation ◽

Solid Oxide ◽

Oxide Fuel

Download Full-text

Ubiquitous Sensor Network for Construction Site Monitoring

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.919-921.388 ◽

2014 ◽

Vol 919-921 ◽

pp. 388-391 ◽

Cited By ~ 1

Author(s):

Jae Min Shin ◽

Sang Yong Kim ◽

Gwang Hee Kim ◽

Min Gu Jung ◽

Dae Woong Shin

Keyword(s):

Health And Safety ◽

Construction Site ◽

Practical Application ◽

Real Situation ◽

Construction Monitoring ◽

Site Monitoring ◽

Effective Operation ◽

Automated Monitoring System ◽

Communication Methods

The importance of construction monitoring trend is required rational method to take health and safety and effective maintenance control from uncertainity and associated risks. Thus, timely field monitoring can overcome the gap between the prediction and real situation through the analyzing validity for the construction. This study suggests automated monitoring system with three kinds of communication methods to achieve effective operation of the system. The example of case study helps to easily understand for practical application with use of the mobile phones.

Download Full-text

Semantic Segmentation of Cabbage in the South Korea Highlands with Images by Unmanned Aerial Vehicles

Applied Sciences ◽

10.3390/app11104493 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4493

Author(s):

Yongwon Jo ◽

Soobin Lee ◽

Youngjae Lee ◽

Hyungu Kahng ◽

Seonghun Park ◽

...

Keyword(s):

Deep Learning ◽

South Korea ◽

Unmanned Aerial Vehicles ◽

Rapid Development ◽

Significant Proportion ◽

Semantic Segmentation ◽

Training Data ◽

Field Surveys ◽

Aerial Vehicles ◽

Segmentation Models

Identifying agricultural fields that grow cabbage in the highlands of South Korea is critical for accurate crop yield estimation. Only grown for a limited time during the summer, highland cabbage accounts for a significant proportion of South Korea’s annual cabbage production. Thus, it has a profound effect on the formation of cabbage prices. Traditionally, labor-extensive and time-consuming field surveys are manually carried out to derive agricultural field maps of the highlands. Recently, high-resolution overhead images of the highlands have become readily available with the rapid development of unmanned aerial vehicles (UAV) and remote sensing technology. In addition, deep learning-based semantic segmentation models have quickly advanced by recent improvements in algorithms and computational resources. In this study, we propose a semantic segmentation framework based on state-of-the-art deep learning techniques to automate the process of identifying cabbage cultivation fields. We operated UAVs and collected 2010 multispectral images under different spatiotemporal conditions to measure how well semantic segmentation models generalize. Next, we manually labeled these images at a pixel-level to obtain ground truth labels for training. Our results demonstrate that our framework performs well in detecting cabbage fields not only in areas included in the training data but also in unseen areas not included in the training data. Moreover, we analyzed the effects of infrared wavelengths on the performance of identifying cabbage fields. Based on the results of our framework, we expect agricultural officials to reduce time and manpower when identifying information about highlands cabbage fields by replacing field surveys.

Download Full-text