Hyperspectral Image Clustering with Spatially-Regularized Ultrametrics

Shukun Zhang; James M. Murphy

doi:10.3390/rs13050955

Hyperspectral Image Clustering with Spatially-Regularized Ultrametrics

Remote Sensing ◽

10.3390/rs13050955 ◽

2021 ◽

Vol 13 (5) ◽

pp. 955

Author(s):

Shukun Zhang ◽

James M. Murphy

Keyword(s):

Spectral Clustering ◽

Hyperspectral Image ◽

State Of The Art ◽

Image Clustering ◽

Clustering Methods ◽

Performance Guarantees ◽

Data Density ◽

Spatial Geometry ◽

Data Points ◽

Almost All

We propose a method for the unsupervised clustering of hyperspectral images based on spatially regularized spectral clustering with ultrametric path distances. The proposed method efficiently combines data density and spectral-spatial geometry to distinguish between material classes in the data, without the need for training labels. The proposed method is efficient, with quasilinear scaling in the number of data points, and enjoys robust theoretical performance guarantees. Extensive experiments on synthetic and real HSI data demonstrate its strong performance compared to benchmark and state-of-the-art methods. Indeed, the proposed method not only achieves excellent labeling accuracy, but also efficiently estimates the number of clusters. Thus, unlike almost all existing hyperspectral clustering methods, the proposed algorithm is essentially parameter-free.

Download Full-text

ClusterEnG: An interactive educational web resource for clustering big data

10.1101/120915 ◽

2017 ◽

Author(s):

Mohith Manjunath ◽

Yi Zhang ◽

Steve H. Yeo ◽

Omar Sobh ◽

Nathan Russell ◽

...

Keyword(s):

Big Data ◽

State Of The Art ◽

Clustering Algorithms ◽

Clustering Methods ◽

Web Resource ◽

Interactive Visualizations ◽

Data Points ◽

Similarities And Differences ◽

Intuitive Manner ◽

The Web

AbstractSummaryClustering is one of the most common techniques used in data analysis to discover hidden structures by grouping together data points that are similar in some measure into clusters. Although there are many programs available for performing clustering, a single web resource that provides both state-of-the-art clustering methods and interactive visualizations is lacking. ClusterEnG (acronym for Clustering Engine for Genomics) provides an interface for clustering big data and interactive visualizations including 3D views, cluster selection and zoom features. ClusterEnG also aims at educating the user about the similarities and differences between various clustering algorithms and provides clustering tutorials that demonstrate potential pitfalls of each algorithm. The web resource will be particularly useful to scientists who are not conversant with computing but want to understand the structure of their data in an intuitive manner.AvailabilityClusterEnG is part of a bigger project called KnowEnG (Knowledge Engine for Genomics) and is available at http://education.knoweng.org/[email protected]

Download Full-text

Learning with Adaptive Neighbors for Image Clustering

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/344 ◽

2018 ◽

Cited By ~ 2

Author(s):

Yang Liu ◽

Quanxue Gao ◽

Zhaohua Yang ◽

Shujian Wang

Keyword(s):

State Of The Art ◽

Clustering Algorithms ◽

Original Data ◽

Image Clustering ◽

Complex Structures ◽

Clustering Methods ◽

Proposed Model ◽

Data Graph ◽

The Given ◽

Optimal Graph

Due to the importance and efficiency of learning complex structures hidden in data, graph-based methods have been widely studied and get successful in unsupervised learning. Generally, most existing graph-based clustering methods require post-processing on the original data graph to extract the clustering indicators. However, there are two drawbacks with these methods: (1) the cluster structures are not explicit in the clustering results; (2) the final clustering performance is sensitive to the construction of the original data graph. To solve these problems, in this paper, a novel learning model is proposed to learn a graph based on the given data graph such that the new obtained optimal graph is more suitable for the clustering task. We also propose an efficient algorithm to solve the model. Extensive experimental results illustrate that the proposed model outperforms other state-of-the-art clustering algorithms.

Download Full-text

Anchors Bring Ease: An Embarrassingly Simple Approach to Partial Multi-View Clustering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.3301118 ◽

2019 ◽

Vol 33 ◽

pp. 118-125 ◽

Cited By ~ 2

Author(s):

Jun Guo ◽

Jiahui Ye

Keyword(s):

Spectral Clustering ◽

Time Complexity ◽

State Of The Art ◽

Research Problem ◽

Simple Approach ◽

Clustering Methods ◽

Real World Data ◽

The Past ◽

Benchmark Datasets ◽

In Virtue Of

Clustering on multi-view data has attracted much more attention in the past decades. Most previous studies assume that each instance appears in all views, or there is at least one view containing all instances. However, real world data often suffers from missing some instances in each view, leading to the research problem of partial multi-view clustering. To address this issue, this paper proposes a simple yet effective Anchorbased Partial Multi-view Clustering (APMC) method, which utilizes anchors to reconstruct instance-to-instance relationships for clustering. APMC is conceptually simple and easy to implement in practice, besides it has clear intuitions and non-trivial empirical guarantees. Specifically, APMC firstly integrates intra- and inter- view similarities through anchors. Then, spectral clustering is performed on the fused similarities to obtain a unified clustering result. Compared with existing partial multi-view clustering methods, APMC has three notable advantages: 1) it can capture more non-linear relations among instances with the help of kernel-based similarities; 2) it has a much lower time complexity in virtue of a noniterative scheme; 3) it can inherently handle data with negative entries as well as be extended to more than two views. Finally, we extensively evaluate the proposed method on five benchmark datasets. Experimental results demonstrate the superiority of APMC over state-of-the-art approaches.

Download Full-text

The Fundamental Clustering and Projection Suite (FCPS): A Dataset Collection to Test the Performance of Clustering and Data Projection Algorithms

Data ◽

10.3390/data5010013 ◽

2020 ◽

Vol 5 (1) ◽

pp. 13 ◽

Cited By ~ 1

Author(s):

Alfred Ultsch ◽

Jörn Lötsch

Keyword(s):

Data Science ◽

Clustering Algorithms ◽

Cluster Structure ◽

Projection Methods ◽

Analysis Method ◽

Clustering Methods ◽

Projection Algorithms ◽

Science Data ◽

Data Density ◽

Data Points

In the context of data science, data projection and clustering are common procedures. The chosen analysis method is crucial to avoid faulty pattern recognition. It is therefore necessary to know the properties and especially the limitations of projection and clustering algorithms. This report describes a collection of datasets that are grouped together in the Fundamental Clustering and Projection Suite (FCPS). The FCPS contains 10 datasets with the names “Atom”, “Chainlink”, “EngyTime”, “Golfball”, “Hepta”, “Lsun”, “Target”, “Tetra”, “TwoDiamonds”, and “WingNut”. Common clustering methods occasionally identified non-existent clusters or assigned data points to the wrong clusters in the FCPS suite. Likewise, common data projection methods could only partially reproduce the data structure correctly on a two-dimensional plane. In conclusion, the FCPS dataset collection addresses general challenges for clustering and projection algorithms such as lack of linear separability, different or small inner class spacing, classes defined by data density rather than data spacing, no cluster structure at all, outliers, or classes that are in contact. This report describes a collection of datasets that are grouped together in the Fundamental Clustering and Projection Suite (FCPS). It is designed to address specific problems of structure discovery in high-dimensional spaces.

Download Full-text

Hypergraph Optimization for Multi-Structural Geometric Model Fitting

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018730 ◽

2019 ◽

Vol 33 ◽

pp. 8730-8737

Author(s):

Shuyuan Lin ◽

Guobao Xiao ◽

Yan Yan ◽

David Suter ◽

Hanzi Wang

Keyword(s):

Spectral Clustering ◽

Input Data ◽

State Of The Art ◽

Geometric Model ◽

Model Fitting ◽

Synthetic Data ◽

Estimation Algorithm ◽

Sampling Efficiency ◽

Data Points ◽

Fitting In

Recently, some hypergraph-based methods have been proposed to deal with the problem of model fitting in computer vision, mainly due to the superior capability of hypergraph to represent the complex relationship between data points. However, a hypergraph becomes extremely complicated when the input data include a large number of data points (usually contaminated with noises and outliers), which will significantly increase the computational burden. In order to overcome the above problem, we propose a novel hypergraph optimization based model fitting (HOMF) method to construct a simple but effective hypergraph. Specifically, HOMF includes two main parts: an adaptive inlier estimation algorithm for vertex optimization and an iterative hyperedge optimization algorithm for hyperedge optimization. The proposed method is highly efficient, and it can obtain accurate model fitting results within a few iterations. Moreover, HOMF can then directly apply spectral clustering, to achieve good fitting performance. Extensive experimental results show that HOMF outperforms several state-of-the-art model fitting methods on both synthetic data and real images, especially in sampling efficiency and in handling data with severe outliers.

Download Full-text

Multiple Partitions Aligned Clustering

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/375 ◽

2019 ◽

Cited By ~ 1

Author(s):

Zhao Kang ◽

Zipeng Guo ◽

Shudong Huang ◽

Siying Wang ◽

Wenyu Chen ◽

...

Keyword(s):

State Of The Art ◽

Cluster Structure ◽

Consensus Clustering ◽

Consensus Cluster ◽

Clustering Methods ◽

Unified Framework ◽

Significant Information ◽

Heterogeneous Information ◽

Data Points ◽

Indicator Matrix

Multi-view clustering is an important yet challenging task due to the difficulty of integrating the information from multiple representations. Most existing multi-view clustering methods explore the heterogeneous information in the space where the data points lie. Such common practice may cause significant information loss because of unavoidable noise or inconsistency among views. Since different views admit the same cluster structure, the natural space should be all partitions. Orthogonal to existing techniques, in this paper, we propose to leverage the multi-view information by fusing partitions. Specifically, we align each partition to form a consensus cluster indicator matrix through a distinct rotation matrix. Moreover, a weight is assigned for each view to account for the clustering capacity differences of views. Finally, the basic partitions, weights, and consensus clustering are jointly learned in a unified framework. We demonstrate the effectiveness of our approach on several real datasets, where significant improvement is found over other state-of-the-art multi-view clustering methods.

Download Full-text

Hyperspectral Band Selection via Spatial-Spectral Weighted Region-wise Multiple Graph Fusion-Based Spectral Clustering

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/418 ◽

2021 ◽

Author(s):

Chang Tang ◽

Xinwang Liu ◽

En Zhu ◽

Lizhe Wang ◽

Albert Zomaya

Keyword(s):

Spectral Clustering ◽

Hyperspectral Image ◽

State Of The Art ◽

Principal Component ◽

Band Selection ◽

Public Datasets ◽

Multiple Graph ◽

Homogeneous Regions ◽

Reflection Characteristics ◽

Minimum Noise

In this paper, we propose a hyperspectral band selection method via spatial-spectral weighted region-wise multiple graph fusion-based spectral clustering, referred to as RMGF briefly. Considering that different objects have different reflection characteristics, we use a superpixel segmentation algorithm to segment the first principal component of original hyperspectral image cube into homogeneous regions. For each superpixel, we construct a corresponding similarity graph to reflect the similarity between band pairs. Then, a multiple graph diffusion strategy with theoretical convergence guarantee is designed to learn a unified graph for partitioning the whole hyperspectral cube into several subcubes via spectral clustering. During the graph diffusion process, the spatial and spectral information of each superpixel are embedded to make spatial/spectral similar superpixels contribute more to each other. Finally, the band containing minimum noise in each subcube is selected to represent the whole subcube. Extensive experiments are conducted on three public datasets to validate the superiority of the proposed method when compared with other state-of-the-art ones.

Download Full-text

Deep Spatial-Spectral Subspace Clustering for Hyperspectral Images Based on Contrastive Learning

Remote Sensing ◽

10.3390/rs13214418 ◽

2021 ◽

Vol 13 (21) ◽

pp. 4418

Author(s):

Xiang Hu ◽

Teng Li ◽

Tong Zhou ◽

Yuanxi Peng

Keyword(s):

Spectral Clustering ◽

Data Augmentation ◽

Hyperspectral Image ◽

State Of The Art ◽

Subspace Clustering ◽

Feature Space ◽

Hyperspectral Images ◽

Negative Sample ◽

Clustering Method ◽

Spectral Subspace

Hyperspectral image (HSI) clustering is a major challenge due to the redundant spectral information in HSIs. In this paper, we propose a novel deep subspace clustering method that extracts spatial–spectral features via contrastive learning. First, we construct positive and negative sample pairs through data augmentation. Then, the data pairs are projected into feature space using a CNN model. Contrastive learning is conducted by minimizing the distances of positive pairs and maximizing those of negative pairs. Finally, based on their features, spectral clustering is employed to obtain the final result. Experimental results gained over three HSI datasets demonstrate that our proposed method is superior to other state-of-the-art methods.

Download Full-text

The Hyperspectral Image Clustering Based on Spatial Information and Spectral Clustering

2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP) ◽

10.1109/siprocess.2019.8868487 ◽

2019 ◽

Cited By ~ 1

Author(s):

Yiwei Wei ◽

Chao Niu ◽

Hongxia Wang ◽

Daizhi Liu

Keyword(s):

Spectral Clustering ◽

Spatial Information ◽

Hyperspectral Image ◽

Image Clustering

Download Full-text

CDIMC-net: Cognitive Deep Incomplete Multi-view Clustering Network

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/447 ◽

2020 ◽

Author(s):

Jie Wen ◽

Zheng Zhang ◽

Yong Xu ◽

Bob Zhang ◽

Lunke Fei ◽

...

Keyword(s):

State Of The Art ◽

Negative Influence ◽

Human Cognition ◽

Clustering Methods ◽

Clustering Problem ◽

Embedding Strategy ◽

Model Training ◽

High Level ◽

Almost All ◽

Incomplete Datasets

In recent years, incomplete multi-view clustering, which studies the challenging multi-view clustering problem on missing views, has received growing research interests. Although a series of methods have been proposed to address this issue, the following problems still exist: 1) Almost all of the existing methods are based on shallow models, which is difficult to obtain discriminative common representations. 2) These methods are generally sensitive to noise or outliers since the negative samples are treated equally as the important samples. In this paper, we propose a novel incomplete multi-view clustering network, called Cognitive Deep Incomplete Multi-view Clustering Network (CDIMC-net), to address these issues. Specifically, it captures the high-level features and local structure of each view by incorporating the view-specific deep encoders and graph embedding strategy into a framework. Moreover, based on the human cognition, \emph{i.e.}, learning from easy to hard, it introduces a self-paced strategy to select the most confident samples for model training, which can reduce the negative influence of outliers. Experimental results on several incomplete datasets show that CDIMC-net outperforms the state-of-the-art incomplete multi-view clustering methods.

Download Full-text