Automatic Diagnosis of Microgrid Networks’ Power Device Faults Based on Stacked Denoising Autoencoders and Adaptive Affinity Propagation Clustering

Complexity ◽

10.1155/2020/8509142 ◽

2020 ◽

Vol 2020 ◽

pp. 1-24 ◽

Cited By ~ 1

Author(s):

Fan Xu ◽

Xin Shu ◽

Xiaodi Zhang ◽

Bo Fan

Keyword(s):

Clustering Algorithm ◽

Fitness Function ◽

Clustering Algorithms ◽

Principal Component ◽

Affinity Propagation ◽

Cluster Center ◽

Classification Error ◽

Damping Factor ◽

Classification Error Rate ◽

Better Than

This paper presents a model based on stacked denoising autoencoders (SDAEs) in deep learning and adaptive affinity propagation (adAP) for bearing fault diagnosis automatically. First, SDAEs are used to extract potential fault features and directly reduce their high dimension to 3. To prove that the feature extraction capability of SDAEs is better than stacked autoencoders (SAEs), principal component analysis (PCA) is employed to compare and reduce their dimension to 3, except for the final hidden layer. Hence, the extracted 3-dimensional features are chosen as the input for adAP cluster models. Compared with other traditional cluster methods, such as the Fuzzy C-mean (FCM), Gustafson–Kessel (GK), Gath–Geva (GG), and affinity propagation (AP), clustering algorithms can identify fault samples without cluster center number selection. However, AP needs to set two key parameters depending on manual experience—the damping factor and the bias parameter—before its calculation. To overcome this drawback, adAP is introduced in this paper. The adAP clustering algorithm can find the available parameters according to the fitness function automatic. Finally, the experimental results prove that SDAEs with adAP are better than other models, including SDAE-FCM/GK/GG according to the cluster assess index (Silhouette) and the classification error rate.

Download Full-text

Service Partition Method Based on Particle Swarm Fuzzy Clustering

Wireless Communications and Mobile Computing ◽

10.1155/2021/7225552 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Hong Xia ◽

Qingyi Dong ◽

Hui Gao ◽

Yanping Chen ◽

ZhongMin Wang

Keyword(s):

Fuzzy Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Particle Swarm ◽

Cluster Center ◽

Fuzzy Clustering Algorithm ◽

Partition Method ◽

Service Data ◽

Optimal Cluster ◽

Better Than

It is difficult to accurately classify a service into specific service clusters for the multirelationships between services. To solve this problem, this paper proposes a service partition method based on particle swarm fuzzy clustering, which can effectively consider multirelationships between services by using a fuzzy clustering algorithm. Firstly, the algorithm for automatically determining the number of clusters is to determine the number of service clusters based on the density of the service core point. Secondly, the fuzzy c -means combined with particle swarm optimization algorithm to find the optimal cluster center of the service. Finally, the fuzzy clustering algorithm uses the improved Gram-cosine similarity to obtain the final results. Extensive experiments on real web service data show that our method is better than mainstream clustering algorithms in accuracy.

Download Full-text

A Hard C-Means Clustering Algorithm Incorporating Membership KL Divergence and Local Data Information for Noisy Image Segmentation

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800141850012x ◽

2017 ◽

Vol 32 (04) ◽

pp. 1850012 ◽

Cited By ~ 5

Author(s):

R. R. Gharieb ◽

G. Gendy ◽

H. Selim

Keyword(s):

Image Segmentation ◽

Membership Function ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Cluster Center ◽

Local Data ◽

Cluster Membership ◽

Kl Divergence ◽

Clustering Approach ◽

Center Distance

In this paper, the standard hard C-means (HCM) clustering approach to image segmentation is modified by incorporating weighted membership Kullback–Leibler (KL) divergence and local data information into the HCM objective function. The membership KL divergence, used for fuzzification, measures the proximity between each cluster membership function of a pixel and the locally-smoothed value of the membership in the pixel vicinity. The fuzzification weight is a function of the pixel to cluster-centers distances. The used pixel to a cluster-center distance is composed of the original pixel data distance plus a fraction of the distance generated from the locally-smoothed pixel data. It is shown that the obtained membership function of a pixel is proportional to the locally-smoothed membership function of this pixel multiplied by an exponentially distributed function of the minus pixel distance relative to the minimum distance provided by the nearest cluster-center to the pixel. Therefore, since incorporating the locally-smoothed membership and data information in addition to the relative distance, which is more tolerant to additive noise than the absolute distance, the proposed algorithm has a threefold noise-handling process. The presented algorithm, named local data and membership KL divergence based fuzzy C-means (LDMKLFCM), is tested by synthetic and real-world noisy images and its results are compared with those of several FCM-based clustering algorithms.

Download Full-text

Comparison of dimensionality reduction and clustering methods for SARS-CoV-2 genome

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i4.2803 ◽

2021 ◽

Vol 10 (4) ◽

pp. 2170-2180

Author(s):

Untari N. Wisesty ◽

Tati Rajab Mengko

Keyword(s):

Dimensionality Reduction ◽

Dimensional Reduction ◽

Clustering Algorithm ◽

Sequence Data ◽

Clustering Algorithms ◽

Gaussian Mixture Models ◽

Reduction Process ◽

Principal Component ◽

Gaussian Mixture ◽

Clustering Methods

This paper aims to conduct an analysis of the SARS-CoV-2 genome variation was carried out by comparing the results of genome clustering using several clustering algorithms and distribution of sequence in each cluster. The clustering algorithms used are K-means, Gaussian mixture models, agglomerative hierarchical clustering, mean-shift clustering, and DBSCAN. However, the clustering algorithm has a weakness in grouping data that has very high dimensions such as genome data, so that a dimensional reduction process is needed. In this research, dimensionality reduction was carried out using principal component analysis (PCA) and autoencoder method with three models that produce 2, 10, and 50 features. The main contributions achieved were the dimensional reduction and clustering scheme of SARS-CoV-2 sequence data and the performance analysis of each experiment on each scheme and hyper parameters for each method. Based on the results of experiments conducted, PCA and DBSCAN algorithm achieve the highest silhouette score of 0.8770 with three clusters when using two features. However, dimensionality reduction using autoencoder need more iterations to converge. On the testing process with Indonesian sequence data, more than half of them enter one cluster and the rest are distributed in the other two clusters.

Download Full-text

Summary of Affinity Propagation

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.268-270.811 ◽

2011 ◽

Vol 268-270 ◽

pp. 811-816

Author(s):

Yong Zhou ◽

Yan Xing

Keyword(s):

Clustering Algorithm ◽

Large Data ◽

Large Data Sets ◽

Affinity Propagation ◽

Damping Factor ◽

Data Sets ◽

Similarity Matrix ◽

Data Points

Affinity Propagation(AP)is a new clustering algorithm, which is based on the similarity matrix between pairs of data points and messages are exchanged between data points until clustering result emerges. It is efficient and fast , and it can solve the clustering on large data sets. But the traditional Affinity Propagation has many limitations, this paper introduces the Affinity Propagation, and analyzes in depth the advantages and limitations of it, focuses on the improvements of the algorithm — improve the similarity matrix, adjust the preference and the damping-factor, combine with other algorithms. Finally, discusses the development of Affinity Propagation.

Download Full-text

An Improved Affinity Propagation Clustering Algorithm Based on Entropy Weight Method and Principal Component Analysis

International Journal of Database Theory and Application ◽

10.14257/ijdta.2016.9.6.23 ◽

2016 ◽

Vol 9 (6) ◽

pp. 227-238 ◽

Cited By ~ 1

Author(s):

Wang Limin ◽

Zhang Li ◽

Han Xuming ◽

Ji Qiang ◽

Mu Guangyu ◽

...

Keyword(s):

Principal Component Analysis ◽

Clustering Algorithm ◽

Principal Component ◽

Component Analysis ◽

Affinity Propagation ◽

Entropy Weight ◽

Affinity Propagation Clustering ◽

Entropy Weight Method ◽

Weight Method

Download Full-text

Texture Image Segmentation Using Affinity Propagation and Spectral Clustering

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001415550095 ◽

2015 ◽

Vol 29 (05) ◽

pp. 1555009 ◽

Cited By ~ 6

Author(s):

Hui Du ◽

Yuping Wang ◽

Xiaopan Dong

Keyword(s):

Image Segmentation ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Computational Cost ◽

Affinity Propagation ◽

Texture Image ◽

Texture Image Segmentation ◽

Two Phases ◽

Representative Points

Clustering is a popular and effective method for image segmentation. However, existing cluster methods often suffer the following problems: (1) Need a huge space and a lot of computation when the input data are large. (2) Need to assign some parameters (e.g. number of clusters) in advance which will affect the clustering results greatly. To save the space and computation, reduce the sensitivity of the parameters, and improve the effectiveness and efficiency of the clustering algorithms, we construct a new clustering algorithm for image segmentation. The new algorithm consists of two phases: coarsening clustering and exact clustering. First, we use Affinity Propagation (AP) algorithm for coarsening. Specifically, in order to save the space and computational cost, we only compute the similarity between each point and its t nearest neighbors, and get a condensed similarity matrix (with only t columns, where t << N and N is the number of data points). Second, to further improve the efficiency and effectiveness of the proposed algorithm, the Self-tuning Spectral Clustering (SSC) is used to the resulted points (the representative points gotten in the first phase) to do the exact clustering. As a result, the proposed algorithm can quickly and precisely realize the clustering for texture image segmentation. The experimental results show that the proposed algorithm is more efficient than the compared algorithms FCM, K-means and SOM.

Download Full-text

A Quantitative Discriminant Method of Elbow Point for the Optimal Number of Clusters in Clustering Algorithm

10.21203/rs.3.rs-58011/v3 ◽

2021 ◽

Author(s):

Congming Shi ◽

Bingtao Wei ◽

Shoulin Wei ◽

Wen Wang ◽

Hai Liu ◽

...

Keyword(s):

Clustering Algorithm ◽

Clustering Algorithms ◽

Optimal Number ◽

Machine Learning Method ◽

Cluster Number ◽

Number Of Clusters ◽

Public Dataset ◽

Optimal Cluster ◽

Better Than ◽

Optimal Number Of Clusters

Abstract Clustering, a traditional machine learning method, plays a significant role in data analysis. Most clustering algorithms depend on a predetermined exact number of clusters, whereas, in practice, clusters are usually unpredictable. Although the Elbow method is one of the most commonly used methods to discriminate the optimal cluster number, the discriminant of the number of clusters depends on the manual identification of the elbow points on the visualization curve. Thus, experienced analysts cannot clearly identify the elbow point from the plotted curve when the plotted curve is fairly smooth. To solve this problem, a new elbow point discriminant method is proposed to yield a statistical metric that estimates an optimal cluster number when clustering on a dataset. First, the average degree of distortion obtained by the Elbow method is normalized to the range of 0 to 10. Second, the normalized results are used to calculate the cosine of intersection angles between elbow points. Third, this calculated cosine of intersection angles and the arccosine theorem are used to compute the intersection angles between elbow points. Finally, the index of the above computed minimal intersection angles between elbow points is used as the estimated potential optimal cluster number. The experimental results based on simulated datasets and a well-known public dataset (Iris Dataset) demonstrated that the estimated optimal cluster number obtained by our newly proposed method is better than the widely used Silhouette method.

Download Full-text

On Expanded and Improved Affinity Propagation Clustering Algorithm

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.48-49.753 ◽

2011 ◽

Vol 48-49 ◽

pp. 753-756

Author(s):

Xin Quan Chen

Keyword(s):

Clustering Algorithm ◽

Clustering Algorithms ◽

Grid Cell ◽

Space Complexity ◽

Affinity Propagation ◽

Data Sets ◽

Time And Space ◽

Affinity Propagation Clustering ◽

Clustering Quality ◽

Time And Space Complexity

Facing to the shortcoming of Affinity Propagation algorithm (AP), we present two expanded and improved AP algorithms. In the two algorithms, the AP algorithm based on Grid Cell (APGC) is an effective extension of AP algorithm on the level of grid cells, and the AP clustering algorithm based on Near neighbour Sampling (APNS) is trying to make some improving in time and space complexity. From some simulated comparison experiments of three algorithms, we know that APGC and APNS algorithms have evident improving than AP algorithm in time and space complexity. They can not only get a good clustering quality for massive data sets, but also filtrate noises and isolates well. So we can say they are two effective clustering algorithms with much applied prospect. At last, several research directions are presented.

Download Full-text

Spectral Clustering Based on Sparse Representation

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.556-562.3822 ◽

2014 ◽

Vol 556-562 ◽

pp. 3822-3826

Author(s):

Chen Xiao Hu ◽

Xian Chun Zou

Keyword(s):

Sparse Representation ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Similarity Metrics ◽

Distance Metrics ◽

Information Propagation ◽

Discriminative Ability ◽

Two Samples ◽

Better Than

Spectral clustering is an efficient clustering algorithm based the information propagation between neighborhood nodes. Its performance is largely dependent on the distance metrics, thus it is possible to boost its performance by adapting more reliable distance metric. Given the advantages of sparse representation in discriminative ability, robust to noisy and more faithfully to measure the similarity between two samples, we propose an sparse representation algorithm based on sparse representation. The experimental study on several datasets shows that, the proposed algorithm performs better than the sparse clustering algorithms based on other similarity metrics.

Download Full-text

An improved affinity propagation clustering algorithm based on principal component analysis and variation coefficient

International Journal of Wireless and Mobile Computing ◽

10.1504/ijwmc.2014.065602 ◽

2014 ◽

Vol 7 (6) ◽

pp. 549

Author(s):

Limin Wang ◽

Li Zhang ◽

Xuming Han ◽

Na Huang ◽

Xintong Guo

Keyword(s):

Principal Component Analysis ◽

Clustering Algorithm ◽

Principal Component ◽

Component Analysis ◽

Variation Coefficient ◽

Affinity Propagation ◽

Affinity Propagation Clustering

Download Full-text