An Automatic Data Clustering Algorithm Based on Differential Evolution

Author(s):  
Chun-Wei Tsai ◽  
Chiech-An Tai ◽  
Ming-Chao Chiang
2019 ◽  
pp. 1-14
Author(s):  
Jun-Xian Chen ◽  
Yue-Jiao Gong ◽  
Wei-Neng Chen ◽  
Mengting Li ◽  
Jun Zhang

2013 ◽  
Vol 380-384 ◽  
pp. 1589-1592
Author(s):  
Xiang Ping Hu

An improved data clustering algorithm was proposed based on the Fuzzy C-Means (FCM) algorithm for the purpose of clustering the data precisely and effectively, through progressing the performance of the data clustering to afford the element work for the application of fault diagnosis and target recognition and so on. There was fatal weakness for the traditional FCM algorithm that the algorithm is sensitive to initial value and noise. The chaotic differential evolution FCM algorithm was proposed according to the efficient global search capability of differential evolution algorithm and the traversal characteristic of chaotic time series. The improved algorithm used the Logistics chaotic mapping to search for the optimal solution, and the chaos disturbance was introduced into the evolutionary population to make up for the defects of FCM algorithm. The new method can overcome the problems of initial value sensitiveness with FCM and local convergence with genetic algorithm. Because the new method. Three types of typical vibration data of faults engines was taken as the example for the research and application. The simulation and application result shows that the data clustering performance of the improved FCM algorithm is much better than the traditional FCM algorithm, and the accuracy rates of fault diagnosis in the application was increased by more than twenty percent, it shows good application prospect.


2021 ◽  
Vol 11 (23) ◽  
pp. 11246
Author(s):  
Abiodun M. Ikotun ◽  
Mubarak S. Almutari ◽  
Absalom E. Ezugwu

K-means clustering algorithm is a partitional clustering algorithm that has been used widely in many applications for traditional clustering due to its simplicity and low computational complexity. This clustering technique depends on the user specification of the number of clusters generated from the dataset, which affects the clustering results. Moreover, random initialization of cluster centers results in its local minimal convergence. Automatic clustering is a recent approach to clustering where the specification of cluster number is not required. In automatic clustering, natural clusters existing in datasets are identified without any background information of the data objects. Nature-inspired metaheuristic optimization algorithms have been deployed in recent times to overcome the challenges of the traditional clustering algorithm in handling automatic data clustering. Some nature-inspired metaheuristics algorithms have been hybridized with the traditional K-means algorithm to boost its performance and capability to handle automatic data clustering problems. This study aims to identify, retrieve, summarize, and analyze recently proposed studies related to the improvements of the K-means clustering algorithm with nature-inspired optimization techniques. A quest approach for article selection was adopted, which led to the identification and selection of 147 related studies from different reputable academic avenues and databases. More so, the analysis revealed that although the K-means algorithm has been well researched in the literature, its superiority over several well-established state-of-the-art clustering algorithms in terms of speed, accessibility, simplicity of use, and applicability to solve clustering problems with unlabeled and nonlinearly separable datasets has been clearly observed in the study. The current study also evaluated and discussed some of the well-known weaknesses of the K-means clustering algorithm, for which the existing improvement methods were conceptualized. It is noteworthy to mention that the current systematic review and analysis of existing literature on K-means enhancement approaches presents possible perspectives in the clustering analysis research domain and serves as a comprehensive source of information regarding the K-means algorithm and its variants for the research community.


2018 ◽  
Vol 6 (2) ◽  
pp. 176-183
Author(s):  
Purnendu Das ◽  
◽  
Bishwa Ranjan Roy ◽  
Saptarshi Paul ◽  
◽  
...  

2014 ◽  
Vol 543-547 ◽  
pp. 1934-1938
Author(s):  
Ming Xiao

For a clustering algorithm in two-dimension spatial data, the Adaptive Resonance Theory exists not only the shortcomings of pattern drift and vector module of information missing, but also difficultly adapts to spatial data clustering which is irregular distribution. A Tree-ART2 network model was proposed based on the above situation. It retains the memory of old model which maintains the constraint of spatial distance by learning and adjusting LTM pattern and amplitude information of vector. Meanwhile, introducing tree structure to the model can reduce the subjective requirement of vigilance parameter and decrease the occurrence of pattern mixing. It is showed that TART2 network has higher plasticity and adaptability through compared experiments.


Energies ◽  
2018 ◽  
Vol 11 (9) ◽  
pp. 2344 ◽  
Author(s):  
Enwen Li ◽  
Linong Wang ◽  
Bin Song ◽  
Siliang Jian

Dissolved gas analysis (DGA) of the oil allows transformer fault diagnosis and status monitoring. Fuzzy c-means (FCM) clustering is an effective pattern recognition method, but exhibits poor clustering accuracy for dissolved gas data and usually fails to subsequently correctly classify transformer faults. The existing feasible approach involves combination of the FCM clustering algorithm with other intelligent algorithms, such as neural networks and support vector machines. This method enables good classification; however, the algorithm complexity is greatly increased. In this paper, the FCM clustering algorithm itself is improved and clustering analysis of DGA data is realized. First, the non-monotonicity of the traditional clustering membership function with respect to the sample distance and its several local extrema are discussed, which mainly explain the poor classification accuracy of DGA data clustering. Then, an exponential form of the membership function is proposed to obtain monotony with respect to distance, thereby improving the dissolved gas data clustering. Likewise, a similarity function to determine the degree of membership is derived. Test results for large datasets show that the improved clustering algorithm can be successfully applied for DGA-data-based transformer fault detection.


Sign in / Sign up

Export Citation Format

Share Document