Using Unsupervised Deep Learning Technique for Monocular Visual Odometry

Unsupervised Deep Learning-Based RGB-D Visual Odometry

Applied Sciences ◽

10.3390/app10165426 ◽

2020 ◽

Vol 10 (16) ◽

pp. 5426 ◽

Cited By ~ 1

Author(s):

Qiang Liu ◽

Haidong Zhang ◽

Yiming Xu ◽

Li Wang

Keyword(s):

Deep Learning ◽

Feature Matching ◽

Ground Truth ◽

Visual Odometry ◽

Depth Images ◽

Network Training ◽

Stream Structure ◽

Unsupervised Deep Learning ◽

Rgb Images ◽

Learning Frameworks

Recently, deep learning frameworks have been deployed in visual odometry systems and achieved comparable results to traditional feature matching based systems. However, most deep learning-based frameworks inevitably need labeled data as ground truth for training. On the other hand, monocular odometry systems are incapable of restoring absolute scale. External or prior information has to be introduced for scale recovery. To solve these problems, we present a novel deep learning-based RGB-D visual odometry system. Our two main contributions are: (i) during network training and pose estimation, the depth images are fed into the network to form a dual-stream structure with the RGB images, and a dual-stream deep neural network is proposed. (ii) the system adopts an unsupervised end-to-end training method, thus the labor-intensive data labeling task is not required. We have tested our system on the KITTI dataset, and results show that the proposed RGB-D Visual Odometry (VO) system has obvious advantages over other state-of-the-art systems in terms of both translation and rotation errors.

Download Full-text

UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning

2018 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2018.8461251 ◽

2018 ◽

Cited By ~ 91

Author(s):

Ruihao Li ◽

Sen Wang ◽

Zhiqiang Long ◽

Dongbing Gu

Keyword(s):

Deep Learning ◽

Visual Odometry ◽

Unsupervised Deep Learning

Download Full-text

An unsupervised deep learning technique for susceptibility artifact correction in reversed phase-encoding EPI images

Magnetic Resonance Imaging ◽

10.1016/j.mri.2020.04.004 ◽

2020 ◽

Vol 71 ◽

pp. 1-10

Author(s):

Soan T.M. Duong ◽

Son L. Phung ◽

Abdesselam Bouzerdoum ◽

Mark M. Schira

Keyword(s):

Deep Learning ◽

Reversed Phase ◽

Phase Encoding ◽

Artifact Correction ◽

Susceptibility Artifact ◽

Learning Technique ◽

Unsupervised Deep Learning

Download Full-text

Monocular Visual Odometry Using Unsupervised Deep Learning

2019 Chinese Automation Congress (CAC) ◽

10.1109/cac48633.2019.8996257 ◽

2019 ◽

Author(s):

Fanning Liu ◽

Zhenghua Liu ◽

Qian Wu

Keyword(s):

Deep Learning ◽

Visual Odometry ◽

Unsupervised Deep Learning

Download Full-text

Stereo Visual Odometry Pose Correction through Unsupervised Deep Learning

Sensors ◽

10.3390/s21144735 ◽

2021 ◽

Vol 21 (14) ◽

pp. 4735

Author(s):

Sumin Zhang ◽

Shouyi Lu ◽

Rui He ◽

Zhipeng Bao

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Depth Map ◽

Ground Truth ◽

Visual Odometry ◽

Vital Role ◽

Positioning Accuracy ◽

Multiview Geometry ◽

Localization And Mapping ◽

Unsupervised Deep Learning

Visual simultaneous localization and mapping (VSLAM) plays a vital role in the field of positioning and navigation. At the heart of VSLAM is visual odometry (VO), which uses continuous images to estimate the camera’s ego-motion. However, due to many assumptions of the classical VO system, robots can hardly operate in challenging environments. To solve this challenge, we combine the multiview geometry constraints of the classical stereo VO system with the robustness of deep learning to present an unsupervised pose correction network for the classical stereo VO system. The pose correction network regresses a pose correction that results in positioning error due to violation of modeling assumptions to make the classical stereo VO positioning more accurate. The pose correction network does not rely on the dataset with ground truth poses for training. The pose correction network also simultaneously generates a depth map and an explainability mask. Extensive experiments on the KITTI dataset show the pose correction network can significantly improve the positioning accuracy of the classical stereo VO system. Notably, the corrected classical stereo VO system’s average absolute trajectory error, average translational relative pose error, and average translational root-mean-square drift on a length of 100–800 m in the KITTI dataset is 13.77 cm, 0.038 m, and 1.08%, respectively. Therefore, the improved stereo VO system has almost reached the state of the art.

Download Full-text

Deep Learning Technique for Real-time Traffic Light Detection by Automated Vehicles

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i7.387392 ◽

2018 ◽

Vol 6 (7) ◽

pp. 387-392

Author(s):

Priyanka S.N. ◽

Shashidhara H.S.

Keyword(s):

Deep Learning ◽

Real Time ◽

Automated Vehicles ◽

Traffic Light ◽

Light Detection ◽

Real Time Traffic ◽

Learning Technique

Download Full-text

Cryptocurrency Trading-Pair Forecasting, Using Machine Learning and Deep Learning Technique

SSRN Electronic Journal ◽

10.2139/ssrn.3610340 ◽

2020 ◽

Author(s):

Ernest Osifo ◽

Ritabrata Bhattacharyya

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Technique

Download Full-text

Deep Learning Technique Based Visually Impaired People Using YOLO V3 Framework Mechanism

2021 3rd International Conference on Signal Processing and Communication (ICPSC) ◽

10.1109/icspc51351.2021.9451710 ◽

2021 ◽

Author(s):

A. Balachandar ◽

E. Santhosh ◽

A. Suriyakrishnan ◽

N. Vigensh ◽

S. Usharani ◽

...

Keyword(s):

Deep Learning ◽

Visually Impaired ◽

Visually Impaired People ◽

Impaired People ◽

Learning Technique

Download Full-text

Unsupervised Deep Learning For Accelerated High Quality Echocardiography

2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) ◽

10.1109/isbi48211.2021.9433770 ◽

2021 ◽

Author(s):

Shujaat Khan ◽

Jaeyoung Huh ◽

Jong Chul Ye

Keyword(s):

Deep Learning ◽

High Quality ◽

Unsupervised Deep Learning

Download Full-text

ExypnoSteganos - A smarter approach to steganography

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189879 ◽

2021 ◽

pp. 1-12

Author(s):

Gaurav Sarraf ◽

Anirudh Ramesh Srivatsa ◽

MS Swetha

Keyword(s):

Neural Network ◽

Deep Learning ◽

Data Compression ◽

Building Block ◽

Communication Techniques ◽

Systems Security ◽

Learning Technique ◽

Critical Problems ◽

Sporadic Cases ◽

Compressed Data

With the ever-rising threat to security, multiple industries are always in search of safer communication techniques both in rest and transit. Multiple security institutions agree that any systems security can be modeled around three major concepts: Confidentiality, Availability, and Integrity. We try to reduce the holes in these concepts by developing a Deep Learning based Steganography technique. In our study, we have seen, data compression has to be at the heart of any sound steganography system. In this paper, we have shown that it is possible to compress and encode data efficiently to solve critical problems of steganography. The deep learning technique, which comprises an auto-encoder with Convolutional Neural Network as its building block, not only compresses the secret file but also learns how to hide the compressed data in the cover file efficiently. The proposed techniques can encode secret files of the same size as of cover, or in some sporadic cases, even larger files can be encoded. We have also shown that the same model architecture can theoretically be applied to any file type. Finally, we show that our proposed technique surreptitiously evades all popular steganalysis techniques.

Download Full-text