Orientation-Encoding CNN for Point Cloud Classification and Segmentation

Hongbin Lin; Wu Zheng; Xiuping Peng

doi:10.3390/make3030031

Orientation-Encoding CNN for Point Cloud Classification and Segmentation

Machine Learning and Knowledge Extraction ◽

10.3390/make3030031 ◽

2021 ◽

Vol 3 (3) ◽

pp. 601-614

Author(s):

Hongbin Lin ◽

Wu Zheng ◽

Xiuping Peng

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Feature Learning ◽

Point Clouds ◽

Point Sets ◽

Learning Network ◽

Rule Structure ◽

Visual Tasks ◽

Deep Learning Network ◽

Point Cloud Classification

With the introduction of effective and general deep learning network frameworks, deep learning based methods have achieved remarkable success in various visual tasks. However, there are still tough challenges in applying them to convolutional neural networks due to the lack of a potential rule structure of point clouds. Therefore, by taking the original point clouds as the input data, this paper proposes an orientation-encoding (OE) convolutional module and designs a convolutional neural network for effectively extracting local geometric features of point sets. By searching for the same number of points in 8 directions and arranging them in order in 8 directions, the OE convolution is then carried out according to the number of points in the direction, which realizes the effective feature learning of the local structure of the point sets. Further experiments on diverse datasets show that the proposed method has competitive performance on classification and segmentation tasks of point sets.

Download Full-text

Deep learning network for point cloud classification based on k-dimensional tree neighbor query

JOURNAL OF SHENZHEN UNIVERSITY SCIENCE AND ENGINEERING ◽

10.3724/sp.j.1249.2020.01079 ◽

2020 ◽

Vol 37 (1) ◽

pp. 79-83 ◽

Cited By ~ 1

Author(s):

Jie MA ◽

Xujiao WANG ◽

Pengfei MA ◽

Lichuang YANG ◽

Nannan WANG

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Learning Network ◽

Cloud Classification ◽

Deep Learning Network ◽

Point Cloud Classification

Download Full-text

Classification of Point Clouds for Indoor Components Using Few Labeled Samples

Remote Sensing ◽

10.3390/rs12142181 ◽

2020 ◽

Vol 12 (14) ◽

pp. 2181

Author(s):

Hangbin Wu ◽

Huimin Yang ◽

Shengyu Huang ◽

Doudou Zeng ◽

Chun Liu ◽

...

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Point Clouds ◽

Neighborhood Search ◽

Learning Methods ◽

Semantic Classification ◽

Cloud Classification ◽

Mixed Features ◽

Indoor Scenarios ◽

Point Cloud Classification

The existing deep learning methods for point cloud classification are trained using abundant labeled samples and used to test only a few samples. However, classification tasks are diverse, and not all tasks have enough labeled samples for training. In this paper, a novel point cloud classification method for indoor components using few labeled samples is proposed to solve the problem of the requirement for abundant labeled samples for training with deep learning classification methods. This method is composed of four parts: mixing samples, feature extraction, dimensionality reduction, and semantic classification. First, the few labeled point clouds are mixed with unlabeled point clouds. Next, the mixed high-dimensional features are extracted using a deep learning framework. Subsequently, a nonlinear manifold learning method is used to embed the mixed features into a low-dimensional space. Finally, the few labeled point clouds in each cluster are identified, and semantic labels are provided for unlabeled point clouds in the same cluster by a neighborhood search strategy. The validity and versatility of the proposed method were validated by different experiments and compared with three state-of-the-art deep learning methods. Our method uses fewer than 30 labeled point clouds to achieve an accuracy that is 1.89–19.67% greater than existing methods. More importantly, the experimental results suggest that this method is not only suitable for single-attribute indoor scenarios but also for comprehensive complex indoor scenarios.

Download Full-text

A Novel Point Cloud Encoding Method Based on Local Information for 3D Classification and Segmentation

Sensors ◽

10.3390/s20092501 ◽

2020 ◽

Vol 20 (9) ◽

pp. 2501 ◽

Cited By ~ 2

Author(s):

Yanan Song ◽

Liang Gao ◽

Xinyu Li ◽

Weiming Shen

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Semantic Segmentation ◽

Local Information ◽

Local Region ◽

Feature Representation ◽

Learning Network ◽

Feature Representations ◽

Encoding Method ◽

Deep Learning Network

Deep learning is robust to the perturbation of a point cloud, which is an important data form in the Internet of Things. However, it cannot effectively capture the local information of the point cloud and recognize the fine-grained features of an object. Different levels of features in the deep learning network are integrated to obtain local information, but this strategy increases network complexity. This paper proposes an effective point cloud encoding method that facilitates the deep learning network to utilize the local information. An axis-aligned cube is used to search for a local region that represents the local information. All of the points in the local region are available to construct the feature representation of each point. These feature representations are then input to a deep learning network. Two well-known datasets, ModelNet40 shape classification benchmark and Stanford 3D Indoor Semantics Dataset, are used to test the performance of the proposed method. Compared with other methods with complicated structures, the proposed method with only a simple deep learning network, can achieve a higher accuracy in 3D object classification and semantic segmentation.

Download Full-text

Accuracy Assessment of Deep Learning Based Classification of LiDAR and UAV Points Clouds for DTM Creation and Flood Risk Mapping

Geosciences ◽

10.3390/geosciences9070323 ◽

2019 ◽

Vol 9 (7) ◽

pp. 323 ◽

Cited By ~ 2

Author(s):

Gordana Jakovljevic ◽

Miro Govedarica ◽

Flor Alvarez-Taboada ◽

Vladimir Pajic

Keyword(s):

Deep Learning ◽

Flood Risk ◽

Point Cloud ◽

Accuracy Assessment ◽

Point Clouds ◽

Water Levels ◽

Cloud Classification ◽

Ground Point ◽

Flood Risk Mapping ◽

Point Cloud Classification

Digital elevation model (DEM) has been frequently used for the reduction and management of flood risk. Various classification methods have been developed to extract DEM from point clouds. However, the accuracy and computational efficiency need to be improved. The objectives of this study were as follows: (1) to determine the suitability of a new method to produce DEM from unmanned aerial vehicle (UAV) and light detection and ranging (LiDAR) data, using a raw point cloud classification and ground point filtering based on deep learning and neural networks (NN); (2) to test the convenience of rebalancing datasets for point cloud classification; (3) to evaluate the effect of the land cover class on the algorithm performance and the elevation accuracy; and (4) to assess the usability of the LiDAR and UAV structure from motion (SfM) DEM in flood risk mapping. In this paper, a new method of raw point cloud classification and ground point filtering based on deep learning using NN is proposed and tested on LiDAR and UAV data. The NN was trained on approximately 6 million points from which local and global geometric features and intensity data were extracted. Pixel-by-pixel accuracy assessment and visual inspection confirmed that filtering point clouds based on deep learning using NN is an appropriate technique for ground classification and producing DEM, as for the test and validation areas, both ground and non-ground classes achieved high recall (>0.70) and high precision values (>0.85), which showed that the two classes were well handled by the model. The type of method used for balancing the original dataset did not have a significant influence in the algorithm accuracy, and it was suggested not to use any of them unless the distribution of the generated and real data set will remain the same. Furthermore, the comparisons between true data and LiDAR and a UAV structure from motion (UAV SfM) point clouds were analyzed, as well as the derived DEM. The root mean square error (RMSE) and the mean average error (MAE) of the DEM were 0.25 m and 0.05 m, respectively, for LiDAR data, and 0.59 m and –0.28 m, respectively, for UAV data. For all land cover classes, the UAV DEM overestimated the elevation, whereas the LIDAR DEM underestimated it. The accuracy was not significantly different in the LiDAR DEM for the different vegetation classes, while for the UAV DEM, the RMSE increased with the height of the vegetation class. The comparison of the inundation areas derived from true LiDAR and UAV data for different water levels showed that in all cases, the largest differences were obtained for the lowest water level tested, while they performed best for very high water levels. Overall, the approach presented in this work produced DEM from LiDAR and UAV data with the required accuracy for flood mapping according to European Flood Directive standards. Although LiDAR is the recommended technology for point cloud acquisition, a suitable alternative is also UAV SfM in hilly areas.

Download Full-text

A Deep Learning Network for Point Cloud of Medicine Structure

2018 9th International Conference on Information Technology in Medicine and Education (ITME) ◽

10.1109/itme.2018.00157 ◽

2018 ◽

Cited By ~ 1

Author(s):

Jia Guo ◽

Xuanxia Yao ◽

Mengyu Shen ◽

Jiafei Wang ◽

Wanyou Liao

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Learning Network ◽

Deep Learning Network

Download Full-text

SEMANTIC3D.NET: A NEW LARGE-SCALE POINT CLOUD CLASSIFICATION BENCHMARK

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-1-w1-91-2017 ◽

2017 ◽

Vol IV-1/W1 ◽

pp. 91-98 ◽

Cited By ~ 88

Author(s):

T. Hackel ◽

N. Savinov ◽

L. Ladicky ◽

J. D. Wegner ◽

K. Schindler ◽

...

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Point Clouds ◽

Full Potential ◽

3D Point Cloud ◽

Learning Methods ◽

Data Set ◽

Cloud Classification ◽

Wide Range ◽

Point Cloud Classification

This paper presents a new 3D point cloud classification benchmark data set with over four billion manually labelled points, meant as input for data-hungry (deep) learning methods. We also discuss first submissions to the benchmark that use deep convolutional neural networks (CNNs) as a work horse, which already show remarkable performance improvements over state-of-the-art. CNNs have become the de-facto standard for many tasks in computer vision and machine learning like semantic segmentation or object detection in images, but have no yet led to a true breakthrough for 3D point cloud labelling tasks due to lack of training data. With the massive data set presented in this paper, we aim at closing this data gap to help unleash the full potential of deep learning methods for 3D labelling tasks. Our semantic3D.net data set consists of dense point clouds acquired with static terrestrial laser scanners. It contains 8 semantic classes and covers a wide range of urban outdoor scenes: churches, streets, railroad tracks, squares, villages, soccer fields and castles. We describe our labelling interface and show that our data set provides more dense and complete point clouds with much higher overall number of labelled points compared to those already available to the research community. We further provide baseline method descriptions and comparison between methods submitted to our online system. We hope semantic3D.net will pave the way for deep learning methods in 3D point cloud labelling to learn richer, more general 3D representations, and first submissions after only a few months indicate that this might indeed be the case.

Download Full-text

A 3D Shape Recognition Method Using Hybrid Deep Learning Network CNN–SVM

Electronics ◽

10.3390/electronics9040649 ◽

2020 ◽

Vol 9 (4) ◽

pp. 649

Author(s):

Long Hoang ◽

Suk-Hwan Lee ◽

Ki-Ryong Kwon

Keyword(s):

Feature Extraction ◽

Deep Learning ◽

3D Model ◽

Shape Recognition ◽

Point Clouds ◽

Support Vector ◽

Learning Network ◽

3D Shape ◽

3D Data ◽

Deep Learning Network

3D shape recognition becomes necessary due to the popularity of 3D data resources. This paper aims to introduce the new method, hybrid deep learning network convolution neural network–support vector machine (CNN–SVM), for 3D recognition. The vertices of the 3D mesh are interpolated to be converted into Point Clouds; those Point Clouds are rotated for 3D data augmentation. We obtain and store the 2D projection of this 3D augmentation data in a 32 × 32 × 12 matrix, the input data of CNN–SVM. An eight-layer CNN is used as the algorithm for feature extraction, then SVM is applied for classifying feature extraction. Two big datasets, ModelNet40 and ModelNet10, of the 3D model are used for model validation. Based on our numerical experimental results, CNN–SVM is more accurate and efficient than other methods. The proposed method is 13.48% more accurate than the PointNet method in ModelNet10 and 8.5% more precise than 3D ShapeNets for ModelNet40. The proposed method works with both the 3D model in the augmented/virtual reality system and in the 3D Point Clouds, an output of the LIDAR sensor in autonomously driving cars.

Download Full-text

Structure-Aware Convolution for 3D Point Cloud Classification and Segmentation

Remote Sensing ◽

10.3390/rs12040634 ◽

2020 ◽

Vol 12 (4) ◽

pp. 634 ◽

Cited By ~ 2

Author(s):

Lei Wang ◽

Yuxuan Liu ◽

Shenman Zhang ◽

Jixing Yan ◽

Pengjie Tao

Keyword(s):

Deep Learning ◽

Template Matching ◽

Point Cloud ◽

Structure Learning ◽

Feature Learning ◽

Point Clouds ◽

Learning Networks ◽

Geometric Structures ◽

Learning Capability ◽

3D Point Clouds

Semantic feature learning on 3D point clouds is quite challenging because of their irregular and unordered data structure. In this paper, we propose a novel structure-aware convolution (SAC) to generalize deep learning on regular grids to irregular 3D point clouds. Similar to the template-matching process of convolution on 2D images, the key of our SAC is to match the point clouds’ neighborhoods with a series of 3D kernels, where each kernel can be regarded as a “geometric template” formed by a set of learnable 3D points. Thus, the interested geometric structures of the input point clouds can be activated by the corresponding kernels. To verify the effectiveness of the proposed SAC, we embedded it into three recently developed point cloud deep learning networks (PointNet, PointNet++, and KCNet) as a lightweight module, and evaluated its performance on both classification and segmentation tasks. Experimental results show that, benefiting from the geometric structure learning capability of our SAC, all these back-end networks achieved better classification and segmentation performance (e.g., +2.77% mean accuracy for classification and +4.99% mean intersection over union (IoU) for segmentation) with few additional parameters. Furthermore, results also demonstrate that the proposed SAC is helpful in improving the robustness of networks with the constraints of geometric structures.

Download Full-text

A point-based deep learning network for semantic segmentation of MLS point clouds

ISPRS Journal of Photogrammetry and Remote Sensing ◽

10.1016/j.isprsjprs.2021.03.001 ◽

2021 ◽

Vol 175 ◽

pp. 199-214

Author(s):

Xu Han ◽

Zhen Dong ◽

Bisheng Yang

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Point Clouds ◽

Learning Network ◽

Deep Learning Network

Download Full-text

LiftingNet: A Novel Deep Learning Network With Layerwise Feature Learning From Noisy Mechanical Data for Fault Classification

IEEE Transactions on Industrial Electronics ◽

10.1109/tie.2017.2767540 ◽

2018 ◽

Vol 65 (6) ◽

pp. 4973-4982 ◽

Cited By ~ 75

Author(s):

Jun Pan ◽

Yanyang Zi ◽

Jinglong Chen ◽

Zitong Zhou ◽

Biao Wang

Keyword(s):

Deep Learning ◽

Feature Learning ◽

Fault Classification ◽

Learning Network ◽

Deep Learning Network

Download Full-text