3D Façade Labeling over Complex Scenarios: A Case Study Using Convolutional Neural Network and Structure-From-Motion

Rodolfo Lotte; Norbert Haala; Mateusz Karpina; Luiz Aragão; Yosio Shimabukuro

doi:10.3390/rs10091435

3D Façade Labeling over Complex Scenarios: A Case Study Using Convolutional Neural Network and Structure-From-Motion

Remote Sensing ◽

10.3390/rs10091435 ◽

2018 ◽

Vol 10 (9) ◽

pp. 1435 ◽

Cited By ~ 5

Author(s):

Rodolfo Lotte ◽

Norbert Haala ◽

Mateusz Karpina ◽

Luiz Aragão ◽

Yosio Shimabukuro

Keyword(s):

Neural Network ◽

Image Classification ◽

Structure From Motion ◽

Intelligent Systems ◽

Data Extraction ◽

Remote Sensing Data ◽

Urban Environments ◽

Architectural Styles ◽

Optical Images ◽

Segmented Images

Urban environments are regions in which spectral variability and spatial variability are extremely high, with a huge range of shapes and sizes, and they also demand high resolution images for applications involving their study. Due to the fact that these environments can grow even more over time, applications related to their monitoring tend to turn to autonomous intelligent systems, which together with remote sensing data could help or even predict daily life situations. The task of mapping cities by autonomous operators was usually carried out by aerial optical images due to its scale and resolution; however new scientific questions have arisen, and this has led research into a new era of highly-detailed data extraction. For many years, using artificial neural models to solve complex problems such as automatic image classification was commonplace, owing much of their popularity to their ability to adapt to complex situations without needing human intervention. In spite of that, their popularity declined in the mid-2000s, mostly due to the complex and time-consuming nature of their methods and workflows. However, newer neural network architectures have brought back the interest in their application for autonomous classifiers, especially for image classification purposes. Convolutional Neural Networks (CNN) have been a trend for pixel-wise image segmentation, showing flexibility when detecting and classifying any kind of object, even in situations where humans failed to perceive differences, such as in city scenarios. In this paper, we aim to explore and experiment with state-of-the-art technologies to semantically label 3D urban models over complex scenarios. To achieve these goals, we split the problem into two main processing lines: first, how to correctly label the façade features in the 2D domain, where a supervised CNN is used to segment ground-based façade images into six feature classes, roof, window, wall, door, balcony and shop; second, a Structure-from-Motion (SfM) and Multi-View-Stereo (MVS) workflow is used to extract the geometry of the façade, wherein the segmented images in the previous stage are then used to label the generated mesh by a “reverse” ray-tracing technique. This paper demonstrates that the proposed methodology is robust in complex scenarios. The façade feature inferences have reached up to 93% accuracy over most of the datasets used. Although it still presents some deficiencies in unknown architectural styles and needs some improvements to be made regarding 3D-labeling, we present a consistent and simple methodology to handle the problem.

Download Full-text

FLOOD DETECTION IN TIME SERIES OF OPTICAL AND SAR IMAGES

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2020-1343-2020 ◽

2020 ◽

Vol XLIII-B2-2020 ◽

pp. 1343-1346

Author(s):

C. Rambour ◽

N. Audebert ◽

E. Koeniguer ◽

B. Le Saux ◽

M. Crucianu ◽

...

Keyword(s):

Neural Network ◽

Time Series ◽

Remote Sensing Data ◽

Earth Observation ◽

Synthetic Aperture ◽

Flood Events ◽

Sar Images ◽

Optical Images ◽

Flood Detection ◽

Network Approaches

Abstract. These last decades, Earth Observation brought a number of new perspectives from geosciences to human activity monitoring. As more data became available, Artificial Intelligence (AI) techniques led to very successful results for understanding remote sensing data. Moreover, various acquisition techniques such as Synthetic Aperture Radar (SAR) can also be used for problems that could not be tackled only through optical images. This is the case for weather-related disasters such as floods or hurricanes, which are generally associated with large clouds cover. Yet, machine learning on SAR data is still considered challenging due to the lack of available labeled data. To help the community go forward, we introduce a new dataset composed of co-registered optical and SAR images time series for the detection of flood events and new neural network approaches to leverage these two modalities.

Download Full-text

Fused Random Pooling in Convolutional Neural Network for Herbal Plants Image Classification

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2019/87862019 ◽

2019 ◽

Vol 8 (6) ◽

pp. 3208-3214

Author(s):

Ian Val P. Delos Reyes ◽

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Herbal Plants

Download Full-text

Improving Convolutional Neural Network (CNN) Architecture (miniVGGNet) with Batch Normalization and Learning Rate Decay Factor for Image Classification

International Journal of Integrated Engineering ◽

10.30880/ijie.2019.11.04.006 ◽

2019 ◽

Vol 11 (4) ◽

Author(s):

Asmida Ismail ◽

◽

Siti Anom Ahmad ◽

Azura Che Soh ◽

Khair Hassan ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Learning Rate ◽

Decay Factor ◽

Batch Normalization ◽

Rate Decay

Download Full-text

Image Classification Method Based on Supplement Convolutional Neural Network

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2018.16322 ◽

2018 ◽

Vol 30 (3) ◽

pp. 385 ◽

Cited By ~ 3

Author(s):

Qiang Wang ◽

Xiaojie Li ◽

Jun Chen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Classification Method

Download Full-text

Accelerating Convolutional Neural Network-Based Hyperspectral Image Classification by Step Activation Quantization

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2021.3058321 ◽

2021 ◽

pp. 1-12

Author(s):

Shaohui Mei ◽

Xiaofeng Chen ◽

Yifan Zhang ◽

Jun Li ◽

Antonio Plaza

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification

Download Full-text

Deep Manifold Reconstruction Neural Network for Hyperspectral Image Classification

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2020.3042999 ◽

2020 ◽

pp. 1-5 ◽

Cited By ~ 1

Author(s):

Zhengying Li ◽

Hong Huang ◽

Zhen Zhang

Keyword(s):

Neural Network ◽

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Manifold Reconstruction

Download Full-text

Performance Comparison for Different Neural Network Architectures for chest X-Ray Image Classification

2021 7th International Conference on Engineering, Applied Sciences and Technology (ICEAST) ◽

10.1109/iceast52143.2021.9426289 ◽

2021 ◽

Author(s):

Pinyada Rajadanuraks ◽

Sarapom Suranuntchai ◽

Suejit Pechprasam ◽

Treesukon Treebupachatsakul

Keyword(s):

Neural Network ◽

Image Classification ◽

Performance Comparison ◽

Network Architectures ◽

X Ray ◽

Chest X Ray ◽

Neural Network Architectures

Download Full-text

An Xception Based Convolutional Neural Network for Scene Image Classification with Transfer Learning

2020 2nd International Conference on Information Technology and Computer Application (ITCA) ◽

10.1109/itca52113.2020.00063 ◽

2020 ◽

Author(s):

Xizhi Wu ◽

Rongzhe Liu ◽

Hanqing Yang ◽

Zizhao Chen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Transfer Learning ◽

Scene Image

Download Full-text

An Automated Model using Deep Convolutional Neural Network for Retinal Image Classification to Detect Diabetic Retinopathy

Proceedings of the International Conference on Computing Advancements ◽

10.1145/3377049.3377067 ◽

2020 ◽

Author(s):

Md Sazzad Hossen ◽

Alim Ahmed Reza ◽

Mahbub C. Mishu

Keyword(s):

Neural Network ◽

Diabetic Retinopathy ◽

Convolutional Neural Network ◽

Image Classification ◽

Retinal Image ◽

Deep Convolutional Neural Network

Download Full-text

Binary Precision Neural Network Manycore Accelerator

ACM Journal on Emerging Technologies in Computing Systems ◽

10.1145/3423136 ◽

2021 ◽

Vol 17 (2) ◽

pp. 1-27

Author(s):

Morteza Hosseini ◽

Tinoosh Mohsenin

Keyword(s):

Neural Network ◽

Low Power ◽

Image Classification ◽

Case Studies ◽

Average Power ◽

Total Power ◽

Fabrication Technology ◽

Population Count ◽

Cluster Architecture ◽

Domain Specific

This article presents a low-power, programmable, domain-specific manycore accelerator, Binarized neural Network Manycore Accelerator (BiNMAC), which adopts and efficiently executes binary precision weight/activation neural network models. Such networks have compact models in which weights are constrained to only 1 bit and can be packed several in one memory entry that minimizes memory footprint to its finest. Packing weights also facilitates executing single instruction, multiple data with simple circuitry that allows maximizing performance and efficiency. The proposed BiNMAC has light-weight cores that support domain-specific instructions, and a router-based memory access architecture that helps with efficient implementation of layers in binary precision weight/activation neural networks of proper size. With only 3.73% and 1.98% area and average power overhead, respectively, novel instructions such as Combined Population-Count-XNOR , Patch-Select , and Bit-based Accumulation are added to the instruction set architecture of the BiNMAC, each of which replaces execution cycles of frequently used functions with 1 clock cycle that otherwise would have taken 54, 4, and 3 clock cycles, respectively. Additionally, customized logic is added to every core to transpose 16×16-bit blocks of memory on a bit-level basis, that expedites reshaping intermediate data to be well-aligned for bitwise operations. A 64-cluster architecture of the BiNMAC is fully placed and routed in 65-nm TSMC CMOS technology, where a single cluster occupies an area of 0.53 mm 2 with an average power of 232 mW at 1-GHz clock frequency and 1.1 V. The 64-cluster architecture takes 36.5 mm 2 area and, if fully exploited, consumes a total power of 16.4 W and can perform 1,360 Giga Operations Per Second (GOPS) while providing full programmability. To demonstrate its scalability, four binarized case studies including ResNet-20 and LeNet-5 for high-performance image classification, as well as a ConvNet and a multilayer perceptron for low-power physiological applications were implemented on BiNMAC. The implementation results indicate that the population-count instruction alone can expedite the performance by approximately 5×. When other new instructions are added to a RISC machine with existing population-count instruction, the performance is increased by 58% on average. To compare the performance of the BiNMAC with other commercial-off-the-shelf platforms, the case studies with their double-precision floating-point models are also implemented on the NVIDIA Jetson TX2 SoC (CPU+GPU). The results indicate that, within a margin of ∼2.1%--9.5% accuracy loss, BiNMAC on average outperforms the TX2 GPU by approximately 1.9× (or 7.5× with fabrication technology scaled) in energy consumption for image classification applications. On low power settings and within a margin of ∼3.7%--5.5% accuracy loss compared to ARM Cortex-A57 CPU implementation, BiNMAC is roughly ∼9.7×--17.2× (or 38.8×--68.8× with fabrication technology scaled) more energy efficient for physiological applications while meeting the application deadline.

Download Full-text