Research and Application of Clustering Algorithm Based on Shared Nearest Neighbor

A Multi-Relational Hierarchical Clustering Algorithm Based on Shared Nearest Neighbor Similarity

2007 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2007.4370836 ◽

2007 ◽

Cited By ~ 1

Author(s):

Jing-Feng Guo ◽

Yu-Yan Zhao ◽

Jing Li

Keyword(s):

Hierarchical Clustering ◽

Clustering Algorithm ◽

Nearest Neighbor ◽

Hierarchical Clustering Algorithm ◽

Shared Nearest Neighbor

Download Full-text

MR-SNN: Design of parallel Shared Nearest Neighbor clustering algorithm using MapReduce

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( ◽

10.1109/icbda.2017.8078831 ◽

2017 ◽

Cited By ~ 1

Author(s):

Sujing Wang ◽

Christoph F. Eick

Keyword(s):

Clustering Algorithm ◽

Nearest Neighbor ◽

Shared Nearest Neighbor

Download Full-text

Unsupervised Building Instance Segmentation of Airborne LiDAR Point Clouds for Parallel Reconstruction Analysis

Remote Sensing ◽

10.3390/rs13061136 ◽

2021 ◽

Vol 13 (6) ◽

pp. 1136

Author(s):

Yongjun Zhang ◽

Wangshan Yang ◽

Xinyi Liu ◽

Yi Wan ◽

Xianzhang Zhu ◽

...

Keyword(s):

Clustering Algorithm ◽

Evaluation Method ◽

Nearest Neighbor ◽

Point Clouds ◽

Airborne Lidar ◽

Model Consistency ◽

Parallel Reconstruction ◽

Shared Nearest Neighbor ◽

Consistency Evaluation ◽

Instance Segmentation

Efficient building instance segmentation is necessary for many applications such as parallel reconstruction, management and analysis. However, most of the existing instance segmentation methods still suffer from low completeness, low correctness and low quality for building instance segmentation, which are especially obvious for complex building scenes. This paper proposes a novel unsupervised building instance segmentation (UBIS) method of airborne Light Detection and Ranging (LiDAR) point clouds for parallel reconstruction analysis, which combines a clustering algorithm and a novel model consistency evaluation method. The proposed method first divides building point clouds into building instances by the improved kd tree 2D shared nearest neighbor clustering algorithm (Ikd-2DSNN). Then, the geometric feature of the building instance is obtained using the model consistency evaluation method, which is used to determine whether the building instance is a single building instance or a multi-building instance. Finally, for multiple building instances, the improved kd tree 3D shared nearest neighbor clustering algorithm (Ikd-3DSNN) is used to divide multi-building instances again to improve the accuracy of building instance segmentation. Our experimental results demonstrate that the proposed UBIS method obtained good performances for various buildings in different scenes such as high-rise building, podium buildings and a residential area with detached houses. A comparative analysis confirms that the proposed UBIS method performed better than state-of-the-art methods.

Download Full-text

Fast Searching Density Peak Clustering Algorithm Based on Shared Nearest Neighbor and Adaptive Clustering Center

Symmetry ◽

10.3390/sym12122014 ◽

2020 ◽

Vol 12 (12) ◽

pp. 2014

Author(s):

Yi Lv ◽

Mandan Liu ◽

Yue Xiang

Keyword(s):

Prior Knowledge ◽

Clustering Algorithm ◽

Nearest Neighbor ◽

Local Density ◽

Density Peak ◽

Adaptive Clustering ◽

Clustering Center ◽

Density Peak Clustering ◽

Shared Nearest Neighbor ◽

Fast Searching

The clustering analysis algorithm is used to reveal the internal relationships among the data without prior knowledge and to further gather some data with common attributes into a group. In order to solve the problem that the existing algorithms always need prior knowledge, we proposed a fast searching density peak clustering algorithm based on the shared nearest neighbor and adaptive clustering center (DPC-SNNACC) algorithm. It can automatically ascertain the number of knee points in the decision graph according to the characteristics of different datasets, and further determine the number of clustering centers without human intervention. First, an improved calculation method of local density based on the symmetric distance matrix was proposed. Then, the position of knee point was obtained by calculating the change in the difference between decision values. Finally, the experimental and comparative evaluation of several datasets from diverse domains established the viability of the DPC-SNNACC algorithm.

Download Full-text

A Hybrid Clustering Algorithm Based on Rough Set and Shared Nearest Neighbors

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.145.189 ◽

2011 ◽

Vol 145 ◽

pp. 189-193 ◽

Cited By ~ 3

Author(s):

Horng Lin Shieh

Keyword(s):

Rough Set ◽

Hybrid Method ◽

Data Clustering ◽

Clustering Algorithm ◽

Nearest Neighbor ◽

Nearest Neighbors ◽

Nearest Neighbor Algorithm ◽

Data Set ◽

Lower And Upper Approximations ◽

Shared Nearest Neighbor

In this paper, a hybrid method combining rough set and shared nearest neighbor algorithms is proposed for data clustering with non-globular shapes. The roughk-means algorithm is based on the distances between data and cluster centers. It partitions a data set with globular shapes well, but when the data are non-globular shapes, the results obtained by a roughk-means algorithm are not very satisfactory. In order to resolve this problem, a combined rough set and shared nearest neighbor algorithm is proposed. The proposed algorithm first adopts a shared nearest neighbor algorithm to evaluate the similarity among data, then the lower and upper approximations of a rough set algorithm are used to partition the data set into clusters.

Download Full-text

An improved clustering algorithm based on density and shared nearest neighbor

2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference ◽

10.1109/itnec.2016.7560314 ◽

2016 ◽

Author(s):

Hanmin Ye ◽

Hao Lv ◽

Qianting Sun

Keyword(s):

Clustering Algorithm ◽

Nearest Neighbor ◽

Shared Nearest Neighbor

Download Full-text

Batch Incremental Shared Nearest Neighbor Density Based Clustering Algorithm for Dynamic Datasets

Lecture Notes in Computer Science - Advances in Information Retrieval ◽

10.1007/978-3-319-56608-5_50 ◽

2017 ◽

pp. 568-574

Author(s):

Panthadeep Bhattacharjee ◽

Amit Awekar

Keyword(s):

Clustering Algorithm ◽

Nearest Neighbor ◽

Density Based Clustering ◽

Shared Nearest Neighbor

Download Full-text

Understanding the SNN Input Parameters and How They Affect the Clustering Results

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2015070102 ◽

2015 ◽

Vol 11 (3) ◽

pp. 26-48 ◽

Cited By ~ 3

Author(s):

Guilherme Moreira ◽

Maribel Yasmina Santos ◽

João Moura Pires ◽

João Galvão

Keyword(s):

Clustering Algorithm ◽

Nearest Neighbor ◽

Clustering Algorithms ◽

Data Sets ◽

Comprehensive Understanding ◽

Analysis Process ◽

Arduous Task ◽

Input Parameters ◽

Definition Of ◽

Shared Nearest Neighbor

Huge amounts of data are available for analysis in nowadays organizations, which are facing several challenges when trying to analyze the generated data with the aim of extracting useful information. This analytical capability needs to be enhanced with tools capable of dealing with big data sets without making the analytical process an arduous task. Clustering is usually used in the data analysis process, as this technique does not require any prior knowledge about the data. However, clustering algorithms usually require one or more input parameters that influence the clustering process and the results that can be obtained. This work analyses the relation between the three input parameters of the SNN (Shared Nearest Neighbor) clustering algorithm, providing a comprehensive understanding of the relationships that were identified between k, Eps and MinPts, the algorithm's input parameters. Moreover, this work also proposes specific guidelines for the definition of the appropriate input parameters, optimizing the processing time, as the number of trials needed to achieve appropriate results can be substantial reduced.

Download Full-text

High-Dimensional Shared Nearest Neighbor Clustering Algorithm

Fuzzy Systems and Knowledge Discovery - Lecture Notes in Computer Science ◽

10.1007/11540007_60 ◽

2005 ◽

pp. 494-502 ◽

Cited By ~ 6

Author(s):

Jian Yin ◽

Xianli Fan ◽

Yiqun Chen ◽

Jiangtao Ren

Keyword(s):

Clustering Algorithm ◽

Nearest Neighbor ◽

High Dimensional ◽

Shared Nearest Neighbor

Download Full-text

An Improved Random Seed Searching Clustering Algorithm Based on Shared Nearest Neighbor

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.719-720.1160 ◽

2015 ◽

Vol 719-720 ◽

pp. 1160-1165 ◽

Cited By ~ 1

Author(s):

Ya Ran Su ◽

Xi Xian Niu

Keyword(s):

Clustering Algorithm ◽

Nearest Neighbor ◽

Local Maximum ◽

Distribution Data ◽

Data Sets ◽

Cluster Number ◽

Data Set ◽

Maximum Cluster ◽

Data Objects ◽

Shared Nearest Neighbor

Clustering analysis continually consider as a hot field in Data Mining. For different types data sets and application purposes, the relevant researchers concern on various aspect, such as the adaptability to fit density and shape, noise detection, outliers identification, cluster number determination, accuracy and optimization. Lots of related works focus on the Shared Nearest Neighbor measure method, due to its best and wide adaptability to deal with complex distribution data set. Based on Shared Nearest Neighbor, an improved algorithm is proposed in this paper, it mainly target on the problems solution of natural distribute density, arbitrary shape and cluster number determination. The new algorithm start with random selected seed, follow the direction of its nearest neighbors, search and find its neighbors which have the greatest similar features, form the local maximum cluster, dynamically adjust the data objects’ affiliation to realize the local optimization at the same time, and then end the clustering procedure until identify all the data objects. Experiments verify the new algorithm has the advanced ability to fit the problems such as different density, shape, noise, cluster number and so on, and can realize fast optimization searching.

Download Full-text