Assessment of some combinations of hard and fuzzy clustering techniques for regionalisation of catchments in Sefidroud basin

Ali Ahani; S. Saeid Mousavi Nadoushani

doi:10.2166/hydro.2016.239

Assessment of some combinations of hard and fuzzy clustering techniques for regionalisation of catchments in Sefidroud basin

Journal of Hydroinformatics ◽

10.2166/hydro.2016.239 ◽

2016 ◽

Vol 18 (6) ◽

pp. 1033-1054 ◽

Cited By ~ 7

Author(s):

Ali Ahani ◽

S. Saeid Mousavi Nadoushani

Keyword(s):

Objective Function ◽

Clustering Algorithms ◽

Hybrid Approach ◽

Hybrid Algorithms ◽

Flood Frequency Analysis ◽

Cluster Validity ◽

Fcm Algorithm ◽

Cluster Validity Indices ◽

Validity Indices ◽

Regional Flood Frequency Analysis

Cluster analysis methods are a type of well-known technique for regionalisation of catchments to perform regional flood frequency analysis. In this study, a fuzzy extension of hybrid clustering algorithms is evaluated. Self-organizing feature maps and four hierarchical clustering algorithms were used to provide the initial cluster centres for fuzzy c-means (FCM) algorithm. The hybrid approach was used for regionalisation of catchments in Sefidroud basin based on feature vectors including five catchment attributes: longitude and latitude, drainage area, runoff coefficient and mean annual precipitation. The results showed that according to the values of both the objective function and the cluster validity indices, the performances of FCM algorithm often was improved by using the proposed hybrid approach. Also, it was evident from the results that in the case of minimizing the objective function, the combination of Ward's algorithm and FCM provided best results, but according to the cluster validity indices, other hybrid algorithms such as combinations of single linkage or complete linkage and FCM algorithm presented the most desirable results. In addition, according to the results, there are two well-defined homogeneous regions in Sefidroud basin identified by all the examined hybrid algorithms.

Download Full-text

Role of Cluster Validity Indices in Delineation of Precipitation Regions

Water ◽

10.3390/w12051372 ◽

2020 ◽

Vol 12 (5) ◽

pp. 1372

Author(s):

Nikhil Bhatia ◽

Jency M. Sojan ◽

Slobodon Simonovic ◽

Roshan Srivastav

Keyword(s):

Clustering Algorithm ◽

Clustering Algorithms ◽

Optimal Number ◽

Ratio Test ◽

Cluster Validity ◽

Number Of Clusters ◽

Cluster Validity Indices ◽

Validity Indices ◽

Point Data ◽

Optimal Number Of Clusters

The delineation of precipitation regions is to identify homogeneous zones in which the characteristics of the process are statistically similar. The regionalization process has three main components: (i) delineation of regions using clustering algorithms, (ii) determining the optimal number of regions using cluster validity indices (CVIs), and (iii) validation of regions for homogeneity using L-moments ratio test. The identification of the optimal number of clusters will significantly affect the homogeneity of the regions. The objective of this study is to investigate the performance of the various CVIs in identifying the optimal number of clusters, which maximizes the homogeneity of the precipitation regions. The k-means clustering algorithm is adopted to delineate the regions using location-based attributes for two large areas from Canada, namely, the Prairies and the Great Lakes-St Lawrence lowlands (GL-SL) region. The seasonal precipitation data for 55 years (1951–2005) is derived using high-resolution ANUSPLIN gridded point data for Canada. The results indicate that the optimal number of clusters and the regional homogeneity depends on the CVI adopted. Among 42 cluster indices considered, 15 of them outperform in identifying the homogeneous precipitation regions. The Dunn, D e t _ r a t i o and Trace( W − 1 B ) indices found to be the best for all seasons in both the regions.

Download Full-text

Performance Evaluation of Line Symmetry-Based Validity Indices on Clustering Algorithms

Journal of Intelligent Systems ◽

10.1515/jisys-2016-0010 ◽

2017 ◽

Vol 26 (3) ◽

pp. 483-503 ◽

Cited By ~ 1

Author(s):

Vijay Kumar ◽

Jitender Kumar Chhabra ◽

Dinesh Kumar

Keyword(s):

Clustering Algorithms ◽

Harmony Search ◽

Real Life ◽

Optimal Number ◽

Distance Measures ◽

Cluster Validity ◽

Number Of Clusters ◽

Cluster Validity Indices ◽

Validity Indices ◽

On Line

AbstractFinding the optimal number of clusters and the appropriate partitioning of the given dataset are the two major challenges while dealing with clustering. For both of these, cluster validity indices are used. In this paper, seven widely used cluster validity indices, namely DB index, PS index, I index, XB index, FS index, K index, and SV index, have been developed based on line symmetry distance measures. These indices provide the measure of line symmetry present in the partitioning of the dataset. These are able to detect clusters of any shape or size in a given dataset, as long as they possess the property of line symmetry. The performance of these indices is evaluated on three clustering algorithms: K-means, fuzzy-C means, and modified harmony search-based clustering (MHSC). The efficacy of symmetry-based validity indices on clustering algorithms is demonstrated on artificial and real-life datasets, six each, with the number of clusters varying from 2 to $\sqrt n ,$ where n is the total number of data points existing in the dataset. The experimental results reveal that the incorporation of line symmetry-based distance improves the capabilities of these existing validity indices in finding the appropriate number of clusters. Comparisons of these indices are done with the point symmetric and original versions of these seven validity indices. The results also demonstrate that the MHSC technique performs better as compared to other well-known clustering techniques. For real-life datasets, analysis of variance statistical analysis is also performed.

Download Full-text

AutoClust: A Framework for Automated Clustering Based on Cluster Validity Indices

2020 IEEE International Conference on Data Mining (ICDM) ◽

10.1109/icdm50108.2020.00153 ◽

2020 ◽

Author(s):

Yannis Poulakis ◽

Christos Doulkeridis ◽

Dimosthenis Kyriazis

Keyword(s):

Cluster Validity ◽

Cluster Validity Indices ◽

Validity Indices

Download Full-text

On fuzzy cluster validity indices for the objects of mixed features

2009 IEEE International Conference on Fuzzy Systems ◽

10.1109/fuzzy.2009.5277190 ◽

2009 ◽

Cited By ~ 2

Author(s):

Mahnhoon Lee

Keyword(s):

Fuzzy Cluster ◽

Cluster Validity ◽

Cluster Validity Indices ◽

Validity Indices ◽

Mixed Features

Download Full-text

Number of Clusters and the Quality of Hybrid Predictive Models in Analytical CRM

Studies in Logic, Grammar and Rhetoric ◽

10.2478/slgr-2014-0022 ◽

2014 ◽

Vol 37 (1) ◽

pp. 141-157 ◽

Cited By ~ 1

Author(s):

Mariusz Łapczyński ◽

Bartłomiej Jefmański

Keyword(s):

Predictive Models ◽

Cluster Validity ◽

Number Of Clusters ◽

Model Combining ◽

Cluster Validity Indices ◽

Validity Indices ◽

And Cluster Analysis ◽

Analytical Tools ◽

F Measure

Abstract Making more accurate marketing decisions by managers requires building effective predictive models. Typically, these models specify the probability of customer belonging to a particular category, group or segment. The analytical CRM categories refer to customers interested in starting cooperation with the company (acquisition models), customers who purchase additional products (cross- and up-sell models) or customers intending to resign from the cooperation (churn models). During building predictive models researchers use analytical tools from various disciplines with an emphasis on their best performance. This article attempts to build a hybrid predictive model combining decision trees (C&RT algorithm) and cluster analysis (k-means). During experiments five different cluster validity indices and eight datasets were used. The performance of models was evaluated by using popular measures such as: accuracy, precision, recall, G-mean, F-measure and lift in the first and in the second decile. The authors tried to find a connection between the number of clusters and models' quality.

Download Full-text

Assessment of Twitter Data Clusters with Cosine-Based Validation Metrics Using Hybrid Topic Models

Ingénierie des systèmes d information ◽

10.18280/isi.250606 ◽

2020 ◽

Vol 25 (6) ◽

pp. 755-769

Author(s):

Noorullah R. Mohammed ◽

Moulana Mohammed

Keyword(s):

Data Clustering ◽

Topic Models ◽

Cluster Validity ◽

Text Documents ◽

Text Data ◽

Validity Assessment ◽

Text Document ◽

Cluster Validity Indices ◽

Validity Indices ◽

Data Clusters

Text data clustering is performed for organizing the set of text documents into the desired number of coherent and meaningful sub-clusters. Modeling the text documents in terms of topics derivations is a vital task in text data clustering. Each tweet is considered as a text document, and various topic models perform modeling of tweets. In existing topic models, the clustering tendency of tweets is assessed initially based on Euclidean dissimilarity features. Cosine metric is more suitable for more informative assessment, especially of text clustering. Thus, this paper develops a novel cosine based external and interval validity assessment of cluster tendency for improving the computational efficiency of tweets data clustering. In the experimental, tweets data clustering results are evaluated using cluster validity indices measures. Experimentally proved that cosine based internal and external validity metrics outperforms the other using benchmarked and Twitter-based datasets.

Download Full-text

Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

IEEE Access ◽

10.1109/access.2020.2969849 ◽

2020 ◽

Vol 8 ◽

pp. 22025-22047 ◽

Cited By ~ 1

Author(s):

Leonardo Enzo Brito Da Silva ◽

Niklas Max Melton ◽

Donald C. Wunsch

Keyword(s):

Online Learning ◽

Comparative Study ◽

Cluster Validity ◽

Cluster Validity Indices ◽

Validity Indices

Download Full-text

Generalized Possibilistic Fuzzy C-Means with novel cluster validity indices for clustering noisy data

Applied Soft Computing ◽

10.1016/j.asoc.2016.12.049 ◽

2017 ◽

Vol 53 ◽

pp. 262-283 ◽

Cited By ~ 28

Author(s):

S. Askari ◽

N. Montazerin ◽

M.H. Fazel Zarandi

Keyword(s):

Noisy Data ◽

Cluster Validity ◽

Fuzzy C Means ◽

Cluster Validity Indices ◽

Validity Indices

Download Full-text

Towards a standard methodology to evaluate internal cluster validity indices

Pattern Recognition Letters ◽

10.1016/j.patrec.2010.11.006 ◽

2011 ◽

Vol 32 (3) ◽

pp. 505-515 ◽

Cited By ~ 32

Author(s):

Ibai Gurrutxaga ◽

Javier Muguerza ◽

Olatz Arbelaitz ◽

Jesús M. Pérez ◽

José I. Martín

Keyword(s):

Cluster Validity ◽

Standard Methodology ◽

Cluster Validity Indices ◽

Validity Indices

Download Full-text

A New Clustering Algorithm Based On Cluster Validity Indices

Discovery Science - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30214-8_27 ◽

2004 ◽

pp. 322-329

Author(s):

Minho Kim ◽

R. S. Ramakrishna

Keyword(s):

Clustering Algorithm ◽

Cluster Validity ◽

Cluster Validity Indices ◽

Validity Indices

Download Full-text