Non-Uniform Discretization-based Ordinal Regression for Monocular Depth Estimation of an Indoor Drone

Xiangzhu Zhang; Lijia Zhang; Frank L. Lewis; Hailong Pei

doi:10.3390/electronics9111767

Non-Uniform Discretization-based Ordinal Regression for Monocular Depth Estimation of an Indoor Drone

Electronics ◽

10.3390/electronics9111767 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1767

Author(s):

Xiangzhu Zhang ◽

Lijia Zhang ◽

Frank L. Lewis ◽

Hailong Pei

Keyword(s):

Deep Learning ◽

Binary Classification ◽

Depth Estimation ◽

Ordinal Regression ◽

Classification Model ◽

Security Requirements ◽

Data Set ◽

Decision Algorithm ◽

Decision Area ◽

Monocular Depth

At present, the main methods of solving the monocular depth estimation for indoor drones are the simultaneous localization and mapping (SLAM) algorithm and the deep learning algorithm. SLAM requires the construction of a depth map of the unknown environment, which is slow to calculate and generally requires expensive sensors, whereas current deep learning algorithms are mostly based on binary classification or regression. The output of the binary classification model gives the decision algorithm relatively rough control over the unmanned aerial vehicle. The regression model solves the problem of the binary classification, but it carries out the same processing for long and short distances, resulting in a decline in short-range prediction performance. In order to solve the above problems, according to the characteristics of the strong order correlation of the distance value, we propose a non-uniform spacing-increasing discretization-based ordinal regression algorithm (NSIDORA) to solve the monocular depth estimation for indoor drone tasks. According to the security requirements of this task, the distance label of the data set is discretized into three major areas—the dangerous area, decision area, and safety area—and the decision area is discretized based on spacing-increasing discretization. Considering the inconsistency of ordinal regression, a new distance decoder is produced. Experimental evaluation shows that the root-mean-square error (RMSE) of NSIDORA in the decision area is 33.5% lower than that of non-uniform discretization (NUD)-based ordinal regression methods. Although it is higher overall than that of the state-of-the-art two-stream regression algorithm, the RMSE of the NSIDORA in the top 10 categories of the decision area is 21.8% lower than that of the two-stream regression algorithm. The inference speed of NSIDORA is 3.4 times faster than that of two-stream ordinal regression. Furthermore, the effectiveness of the decoder has been proved through ablation experiments.

Download Full-text

DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-451-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 451-458

Author(s):

L. Madhuanand ◽

F. Nex ◽

M. Y. Yang

Keyword(s):

Deep Learning ◽

Ground Level ◽

Depth Estimation ◽

Aerial Images ◽

Aerial Image ◽

Depth Information ◽

Single Image ◽

Monocular Depth ◽

Uav Images ◽

Image Depth

Abstract. Depth is an essential component for various scene understanding tasks and for reconstructing the 3D geometry of the scene. Estimating depth from stereo images requires multiple views of the same scene to be captured which is often not possible when exploring new environments with a UAV. To overcome this monocular depth estimation has been a topic of interest with the recent advancements in computer vision and deep learning techniques. This research has been widely focused on indoor scenes or outdoor scenes captured at ground level. Single image depth estimation from aerial images has been limited due to additional complexities arising from increased camera distance, wider area coverage with lots of occlusions. A new aerial image dataset is prepared specifically for this purpose combining Unmanned Aerial Vehicles (UAV) images covering different regions, features and point of views. The single image depth estimation is based on image reconstruction techniques which uses stereo images for learning to estimate depth from single images. Among the various available models for ground-level single image depth estimation, two models, 1) a Convolutional Neural Network (CNN) and 2) a Generative Adversarial model (GAN) are used to learn depth from aerial images from UAVs. These models generate pixel-wise disparity images which could be converted into depth information. The generated disparity maps from these models are evaluated for its internal quality using various error metrics. The results show higher disparity ranges with smoother images generated by CNN model and sharper images with lesser disparity range generated by GAN model. The produced disparity images are converted to depth information and compared with point clouds obtained using Pix4D. It is found that the CNN model performs better than GAN and produces depth similar to that of Pix4D. This comparison helps in streamlining the efforts to produce depth from a single aerial image.

Download Full-text

Deep Learning-Based Monocular Depth Estimation Methods—A State-of-the-Art Review

Sensors ◽

10.3390/s20082272 ◽

2020 ◽

Vol 20 (8) ◽

pp. 2272 ◽

Cited By ~ 5

Author(s):

Faisal Khan ◽

Saqib Salahuddin ◽

Hossein Javidnia

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Research Work ◽

Depth Estimation ◽

Autonomous Driving ◽

Estimation Methods ◽

Future Research ◽

Comprehensive Overview ◽

Ill Posed ◽

Monocular Depth

Monocular depth estimation from Red-Green-Blue (RGB) images is a well-studied ill-posed problem in computer vision which has been investigated intensively over the past decade using Deep Learning (DL) approaches. The recent approaches for monocular depth estimation mostly rely on Convolutional Neural Networks (CNN). Estimating depth from two-dimensional images plays an important role in various applications including scene reconstruction, 3D object-detection, robotics and autonomous driving. This survey provides a comprehensive overview of this research topic including the problem representation and a short description of traditional methods for depth estimation. Relevant datasets and 13 state-of-the-art deep learning-based approaches for monocular depth estimation are reviewed, evaluated and discussed. We conclude this paper with a perspective towards future research work requiring further investigation in monocular depth estimation challenges.

Download Full-text

SUW-Learn: Joint Supervised, Unsupervised, Weakly Supervised Deep Learning for Monocular Depth Estimation

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ◽

10.1109/cvprw50498.2020.00383 ◽

2020 ◽

Author(s):

Haoyu Ren ◽

Aman Raj ◽

Mostafa El-Khamy ◽

Jungwon Lee

Keyword(s):

Deep Learning ◽

Depth Estimation ◽

Monocular Depth ◽

Weakly Supervised

Download Full-text

Deep Ordinal Regression Network for Monocular Depth Estimation

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvpr.2018.00214 ◽

2018 ◽

Cited By ~ 225

Author(s):

Huan Fu ◽

Mingming Gong ◽

Chaohui Wang ◽

Kayhan Batmanghelich ◽

Dacheng Tao

Keyword(s):

Depth Estimation ◽

Ordinal Regression ◽

Monocular Depth

Download Full-text

Benchmark for Deep Learning based Visual Odometry and Monocular Depth Estimation

The Journal of Korea Robotics Society ◽

10.7746/jkros.2019.14.2.114 ◽

2019 ◽

Vol 14 (2) ◽

pp. 114-121

Author(s):

Hyukdoo Choi

Keyword(s):

Deep Learning ◽

Depth Estimation ◽

Visual Odometry ◽

Monocular Depth

Download Full-text

Negative emotion diffusion and intervention countermeasures of social networks based on deep learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-179979 ◽

2020 ◽

Vol 39 (4) ◽

pp. 4935-4945

Author(s):

Qiuyun Cheng ◽

Yun Ke ◽

Ahmed Abdelmouty

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Negative Emotion ◽

Sentiment Classification ◽

Classification Model ◽

Learning Models ◽

Data Set ◽

Position Information ◽

Processing Technologies ◽

And Training

Aiming at the limitation of using only word features in traditional deep learning sentiment classification, this paper combines topic features with deep learning models to build a topic-fused deep learning sentiment classification model. The model can fuse topic features to obtain high-quality high-level text features. Experiments show that in binary sentiment classification, the highest classification accuracy of the model can reach more than 90%, which is higher than that of commonly used deep learning models. This paper focuses on the combination of deep neural networks and emerging text processing technologies, and improves and perfects them from two aspects of model architecture and training methods, and designs an efficient deep network sentiment analysis model. A CNN (Convolutional Neural Network) model based on polymorphism is proposed. The model constructs the CNN input matrix by combining the word vector information of the text, the emotion information of the words, and the position information of the words, and adjusts the importance of different feature information in the training process by means of weight control. The multi-objective sample data set is used to verify the effectiveness of the proposed model in the sentiment analysis task of related objects from the classification effect and training performance.

Download Full-text

Hierarchical Binary Classification for Monocular Depth Estimation

2019 IEEE International Conference on Robotics and Biomimetics (ROBIO) ◽

10.1109/robio49542.2019.8961430 ◽

2019 ◽

Author(s):

Hualie Jiang ◽

Rui Huang

Keyword(s):

Binary Classification ◽

Depth Estimation ◽

Monocular Depth

Download Full-text

Explaining the Attributes of a Deep Learning Based Intrusion Detection System for Industrial Control Networks

Sensors ◽

10.3390/s20143817 ◽

2020 ◽

Vol 20 (14) ◽

pp. 3817 ◽

Cited By ~ 1

Author(s):

Zhidong Wang ◽

Yingxu Lai ◽

Zenghui Liu ◽

Jing Liu

Keyword(s):

Deep Learning ◽

Control System ◽

Intrusion Detection ◽

Security System ◽

Classification Model ◽

Industrial Control System ◽

Industrial Control ◽

Data Set ◽

Network Intrusion ◽

Calculation Process

Intrusion detection is only the initial part of the security system for an industrial control system. Because of the criticality of the industrial control system, professionals still make the most important security decisions. Therefore, a simple intrusion alarm has a very limited role in the security system, and intrusion detection models based on deep learning struggle to provide more information because of the lack of explanation. This limits the application of deep learning methods to industrial control network intrusion detection. We analyzed the deep neural network (DNN) model and the interpretable classification model from the perspective of information, and clarified the correlation between the calculation process of the DNN model and the classification process. By comparing the normal samples with the abnormal samples, the abnormalities that occur during the calculation of the DNN model compared to the normal samples could be found. Based on this, a layer-wise relevance propagation method was designed to map the abnormalities in the calculation process to the abnormalities of attributes. At the same time, considering that the data set may already contain some useful information, we designed filtering rules for a kind of data set that can be obtained at a low cost, so that the calculation result is presented in a more accurate manner, which should help professionals lock and address intrusion threats more quickly.

Download Full-text

Transfer learning to detect COVID-19 automatically from X-ray images, using convolutional neural networks

10.1101/2020.08.25.20182170 ◽

2020 ◽

Author(s):

Mundher Taresh ◽

Ningbo Zhu ◽

Talal Ahmed Ali Ali

Keyword(s):

Deep Learning ◽

Confusion Matrix ◽

High Accuracy ◽

Classification Model ◽

Statistical Parameters ◽

Data Set ◽

X Ray ◽

X Ray Imaging ◽

Chest X Ray ◽

Novel Coronavirus

AbstractNovel coronavirus pneumonia (COVID-19) is a contagious disease that has already caused thousands of deaths and infected millions of people worldwide. Thus, all technological gadgets that allow the fast detection of COVID-19 infection with high accuracy can offer help to healthcare professionals. This study is purposed to explore the effectiveness of artificial intelligence (AI) in the rapid and reliable detection of COVID-19 based on chest X-ray imaging. In this study, reliable pre-trained deep learning algorithms were applied to achieve the automatic detection of COVID-19-induced pneumonia from digital chest X-ray images.Moreover, the study aims to evaluate the performance of advanced neural architectures proposed for the classification of medical images over recent years. The data set used in the experiments involves 274 COVID-19 cases, 380 viral pneumonia, and 380 healthy cases, which was collected from the available X-ray images on public medical repositories. The confusion matrix provided a basis for testing the post-classification model. Furthermore, an open-source library PyCM* was used to support the statistical parameters. The study revealed the superiority of Model VGG16 over other models applied to conduct this research where the model performed best in terms of overall scores and based-class scores. According to the research results, deep learning with X-ray imaging is useful in the collection of critical biological markers associated with COVID-19 infection. The technique is conducive for the physicians to make a diagnosis of COVID-19 infection. Meanwhile, the high accuracy of this computer-aided diagnostic tool can significantly improve the speed and accuracy of COVID-19 diagnosis.

Download Full-text

CORNet: Context-based Ordinal Regression Network for Monocular Depth Estimation

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2021.3128505 ◽

2021 ◽

pp. 1-1

Author(s):

Xuyang Meng ◽

Chunxiao Fan ◽

Yue Ming ◽

Hui Yu

Keyword(s):

Depth Estimation ◽

Ordinal Regression ◽

Monocular Depth

Download Full-text