A Comparative Study of VoxelNet and PointNet for 3D Object Detection in Car by Using KITTI Benchmark

Harish S Gujjar

doi:10.4018/ijicthd.2018070103

A Comparative Study of VoxelNet and PointNet for 3D Object Detection in Car by Using KITTI Benchmark

International Journal of Information Communication Technologies and Human Development ◽

10.4018/ijicthd.2018070103 ◽

2018 ◽

Vol 10 (3) ◽

pp. 28-38

Author(s):

Harish S Gujjar

Keyword(s):

Virtual Reality ◽

Object Recognition ◽

High Performance ◽

Point Clouds ◽

Small Scale ◽

3D Object Recognition ◽

3D Object ◽

Robotic Vision ◽

Car Detection ◽

3D Object Detection

In today's world, 2D object recognition is a normal course of study in research. 3D objection recognition is more in demand and important in the present scenario. 3D object recognition has gained importance in areas such as navigation of vehicles, robotic vision, HoME, virtual reality, etc. This work reveals the two important methods, Voxelnet and PointNet, useful in 3D object recognition. In case of NetPoint, the recognition is good when used with segmentation of point clouds which are in small-scale. Whereas, in case of Voxelnet, scans are used directly on raw points of clouds which are directly operated on patterns. The above conclusion is arrived on KITTI car detection. The KITTI uses detection by using bird's eye view. In this method of KITTI we compare two different methods called LiDAR and RGB-D. We arrive at a conclusion that pointNet is useful and has high performance when we are using small scenarios and Voxelnet is useful and has high performance when we are using large scenarios.

Download Full-text

Robotic vision: 3D object recognition and pose determination

Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190) ◽

10.1109/iros.1998.727463 ◽

2002 ◽

Cited By ~ 8

Author(s):

A.K.C. Wong ◽

L. Rong ◽

X. Liang

Keyword(s):

Object Recognition ◽

3D Object Recognition ◽

3D Object ◽

Robotic Vision ◽

Pose Determination

Download Full-text

Fast and Robust Multi-view 3D Object Recognition in Point Clouds

2015 International Conference on 3D Vision ◽

10.1109/3dv.2015.27 ◽

2015 ◽

Cited By ~ 8

Author(s):

Guan Pang ◽

Ulrich Neumann

Keyword(s):

Object Recognition ◽

Point Clouds ◽

3D Object Recognition ◽

3D Object

Download Full-text

Learning high-level features by fusing multi-view representation of MLS point clouds for 3D object recognition in road environments

ISPRS Journal of Photogrammetry and Remote Sensing ◽

10.1016/j.isprsjprs.2019.01.024 ◽

2019 ◽

Vol 150 ◽

pp. 44-58 ◽

Cited By ~ 7

Author(s):

Zhipeng Luo ◽

Jonathan Li ◽

Zhenlong Xiao ◽

Z. Geroge Mou ◽

Xiaojie Cai ◽

...

Keyword(s):

Object Recognition ◽

Point Clouds ◽

3D Object Recognition ◽

3D Object ◽

High Level

Download Full-text

DGCB-Net: Dynamic Graph Convolutional Broad Network for 3D Object Recognition in Point Cloud

Remote Sensing ◽

10.3390/rs13010066 ◽

2020 ◽

Vol 13 (1) ◽

pp. 66

Author(s):

Yifei Tian ◽

Long Chen ◽

Wei Song ◽

Yunsick Sung ◽

Sangchul Woo

Keyword(s):

Object Recognition ◽

Point Cloud ◽

Recognition Performance ◽

Point Clouds ◽

Disease Diagnosis ◽

3D Object Recognition ◽

Dynamic Graph ◽

3D Object ◽

Feature Aggregation ◽

Dimensional Object

3D (3-Dimensional) object recognition is a hot research topic that benefits environment perception, disease diagnosis, and the mobile robot industry. Point clouds collected by range sensors are a popular data structure to represent a 3D object model. This paper proposed a 3D object recognition method named Dynamic Graph Convolutional Broad Network (DGCB-Net) to realize feature extraction and 3D object recognition from the point cloud. DGCB-Net adopts edge convolutional layers constructed by weight-shared multiple-layer perceptrons (MLPs) to extract local features from the point cloud graph structure automatically. Features obtained from all edge convolutional layers are concatenated together to form a feature aggregation. Unlike stacking many layers in-depth, our DGCB-Net employs a broad architecture to extend point cloud feature aggregation flatly. The broad architecture is structured utilizing a flat combining architecture with multiple feature layers and enhancement layers. Both feature layers and enhancement layers concatenate together to further enrich the features’ information of the point cloud. All features work on the object recognition results thus that our DGCB-Net show better recognition performance than other 3D object recognition algorithms on ModelNet10/40 and our scanning point cloud dataset.

Download Full-text

3D object recognition from large-scale point clouds with global descriptor and sliding window

2016 23rd International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2016.7899720 ◽

2016 ◽

Cited By ~ 1

Author(s):

Naoyuki Gunji ◽

Hitoshi Niigaki ◽

Ken Tsutsuguchi ◽

Takayuki Kurozumi ◽

Tetsuya Kinebuchi

Keyword(s):

Object Recognition ◽

Large Scale ◽

Sliding Window ◽

Point Clouds ◽

3D Object Recognition ◽

3D Object ◽

Scale Point

Download Full-text

3D object recognition method with multiple feature extraction from LiDAR point clouds

The Journal of Supercomputing ◽

10.1007/s11227-019-02830-9 ◽

2019 ◽

Vol 75 (8) ◽

pp. 4430-4442 ◽

Cited By ~ 4

Author(s):

Yifei Tian ◽

Wei Song ◽

Su Sun ◽

Simon Fong ◽

Shuanghui Zou

Keyword(s):

Feature Extraction ◽

Object Recognition ◽

Point Clouds ◽

3D Object Recognition ◽

Recognition Method ◽

3D Object ◽

Multiple Feature

Download Full-text

3D object recognition for Virtual Reality based Digital Twins

10.1145/3492324.3494171 ◽

2021 ◽

Author(s):

Ilyas Ashkir ◽

Ben Roullier ◽

Frank McQuade ◽

Ashiq Anjum

Keyword(s):

Virtual Reality ◽

Object Recognition ◽

3D Object Recognition ◽

3D Object ◽

Digital Twins

Download Full-text

Efficient 3D object recognition using foveated point clouds

Computers & Graphics ◽

10.1016/j.cag.2013.03.005 ◽

2013 ◽

Vol 37 (5) ◽

pp. 496-508 ◽

Cited By ~ 23

Author(s):

Rafael Beserra Gomes ◽

Bruno Marques Ferreira da Silva ◽

Lourena Karin de Medeiros Rocha ◽

Rafael Vidal Aroca ◽

Luiz Carlos Pacheco Rodrigues Velho ◽

...

Keyword(s):

Object Recognition ◽

Point Clouds ◽

3D Object Recognition ◽

3D Object

Download Full-text

Local Feature Descriptors for 3D Object Recognition in Ubiquitous Virtual Reality

2012 International Symposium on Ubiquitous Virtual Reality ◽

10.1109/isuvr.2012.20 ◽

2012 ◽

Cited By ~ 2

Author(s):

Youngkyoon Jang ◽

Woontack Woo

Keyword(s):

Virtual Reality ◽

Object Recognition ◽

Local Feature ◽

3D Object Recognition ◽

3D Object ◽

Feature Descriptors

Download Full-text

Strong-Weak Feature Alignment for 3D Object Detection

Electronics ◽

10.3390/electronics10101205 ◽

2021 ◽

Vol 10 (10) ◽

pp. 1205

Author(s):

Zhiyu Wang ◽

Li Wang ◽

Bin Dai

Keyword(s):

Object Detection ◽

Point Clouds ◽

Autonomous Driving ◽

Feature Representation ◽

Alignment Algorithm ◽

3D Object ◽

3D Point Clouds ◽

Object Feature ◽

3D Object Detection ◽

Feature Alignment

Object detection in 3D point clouds is still a challenging task in autonomous driving. Due to the inherent occlusion and density changes of the point cloud, the data distribution of the same object will change dramatically. Especially, the incomplete data with sparsity or occlusion can not represent the complete characteristics of the object. In this paper, we proposed a novel strong–weak feature alignment algorithm between complete and incomplete objects for 3D object detection, which explores the correlations within the data. It is an end-to-end adaptive network that does not require additional data and can be easily applied to other object detection networks. Through a complete object feature extractor, we achieve a robust feature representation of the object. It serves as a guarding feature to help the incomplete object feature generator to generate effective features. The strong–weak feature alignment algorithm reduces the gap between different states of the same object and enhances the ability to represent the incomplete object. The proposed adaptation framework is validated on the KITTI object benchmark and gets about 6% improvement in detection average precision on 3D moderate difficulty compared to the basic model. The results show that our adaptation method improves the detection performance of incomplete 3D objects.

Download Full-text