A Sparse Representation of High-Dimensional Input Spaces Based on an Augmented Growing Neural Gas

Mapping Intimacies ◽

10.29007/jgjt ◽

2018 ◽

Author(s):

Jochen Kerdels ◽

Gabriele Peters

Keyword(s):

Euclidean Distance ◽

Distance Measures ◽

High Dimensional ◽

Space Region ◽

Vector Representation ◽

Growing Neural Gas ◽

Input Space ◽

Neural Gas ◽

Neighborhood Relations ◽

Local Input

\noindent The growing neural gas (GNG) algorithm is an unsupervised learning method that is able to approximate the structure of its input space with a network of prototypes. Each prototype represents a local input space region and neighboring prototypes in the GNG network correspond to neighboring regions in input space. Here we address two problems that can arise when using the GNG algorithm. First, the GNG network structure becomes less and less meaningful with increasing dimensionality of the input space as typical distance measures like the Euclidean distance loose their expressiveness in higher dimensions. Second, the GNG itself does not provide a form of output that retains the discovered neighborhood relations when compared with common distance measures. We show that a GNG augmented with {\em local input space histograms} can mitigate both of these problems. We define a sparse vector representation as output of the augmented GNG that preserves important neighborhood relations while pruning erroneous relations that were introduced due to effects of high dimensionality.

Download Full-text

Representing Non-Rigid Objects with Neural Networks

Encyclopedia of Artificial Intelligence ◽

10.4018/978-1-59904-849-9.ch200 ◽

2011 ◽

pp. 1363-1369

Author(s):

José García-Rodríguez ◽

Francisco Flórez-Revuelta ◽

Juan Manuel García-Chamizo

Keyword(s):

Neural Networks ◽

Delaunay Triangulation ◽

Learning Process ◽

Competitive Learning ◽

Growing Neural Gas ◽

Input Space ◽

Adaptive Process ◽

Neural Gas ◽

Rigid Objects

Self-organising neural networks try to preserve the topology of an input space by means of their competitive learning. This capacity has been used, among others, for the representation of objects and their motion. In this work we use a kind of self-organising network, the Growing Neural Gas, to represent deformations in objects along a sequence of images. As a result of an adaptive process the objects are represented by a topology representing graph that constitutes an induced Delaunay triangulation of their shapes. These maps adapt the changes in the objects topology without reset the learning process.

Download Full-text

3D Maps Representation Using GNG

Mathematical Problems in Engineering ◽

10.1155/2014/972304 ◽

2014 ◽

Vol 2014 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Vicente Morell ◽

Miguel Cazorla ◽

Sergio Orts-Escolano ◽

Jose Garcia-Rodriguez

Keyword(s):

Input Data ◽

Mobile Robotics ◽

State Of The Art ◽

Growing Neural Gas ◽

Input Space ◽

Neural Gas ◽

Space Experiments ◽

Space Adaptation

Current RGB-D sensors provide a big amount of valuable information for mobile robotics tasks like 3D map reconstruction, but the storage and processing of the incremental data provided by the different sensors through time quickly become unmanageable. In this work, we focus on 3D maps representation and propose the use of the Growing Neural Gas (GNG) network as a model to represent 3D input data. GNG method is able to represent the input data with a desired amount of neurons or resolution while preserving the topology of the input space. Experiments show how GNG method yields a better input space adaptation than other state-of-the-art 3D map representation methods.

Download Full-text

Growing neural gas with random projection method for high-dimensional data stream clustering

Soft Computing ◽

10.1007/s00500-019-04492-4 ◽

2019 ◽

Vol 24 (13) ◽

pp. 9789-9807

Author(s):

Yingwen Zhu ◽

Songcan Chen

Keyword(s):

Projection Method ◽

Data Stream ◽

High Dimensional Data ◽

Random Projection ◽

High Dimensional ◽

Growing Neural Gas ◽

Neural Gas ◽

Stream Clustering ◽

Data Stream Clustering

Download Full-text

Techniques for Weighted Clustering Ensembles

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch293 ◽

2011 ◽

pp. 1916-1922

Author(s):

Carlotta Domeniconi

Keyword(s):

Feature Selection ◽

Clustering Algorithm ◽

Selection Procedure ◽

Subspace Clustering ◽

Distance Measures ◽

High Dimensional ◽

Clustering Methods ◽

Input Space ◽

Cluster Ensembles ◽

Local Correlations

In an effort to achieve improved classifier accuracy, extensive research has been conducted in classifier ensembles. Very recently, cluster ensembles have emerged. It is well known that off-the-shelf clustering methods may discover different structures in a given set of data. This is because each clustering algorithm has its own bias resulting from the optimization of different criteria. Furthermore, there is no ground truth against which the clustering result can be validated. Thus, no cross-validation technique can be carried out to tune input parameters involved in the clustering process. As a consequence, the user is not equipped with any guidelines for choosing the proper clustering method for a given dataset. Cluster ensembles offer a solution to challenges inherent to clustering arising from its ill-posed nature. Cluster ensembles can provide more robust and stable solutions by leveraging the consensus across multiple clustering results, while averaging out emergent spurious structures that arise due to the various biases to which each participating algorithm is tuned. In this chapter, we discuss the problem of combining multiple weighted clusters, discovered by a locally adaptive algorithm (Domeniconi, Papadopoulos, Gunopulos, & Ma, 2004) which detects clusters in different subspaces of the input space. We believe that our approach is the first attempt to design a cluster ensemble for subspace clustering (Al-Razgan & Domeniconi, 2006). Recently, several subspace clustering methods have been proposed (Parsons, Haque, & Liu, 2004). They all attempt to dodge the curse of dimensionality which affects any algorithm in high dimensional spaces. In high dimensional spaces, it is highly likely that, for any given pair of points within the same cluster, there exist at least a few dimensions on which the points are far apart from each other. As a consequence, distance functions that equally use all input features may not be effective. Furthermore, several clusters may exist in different subspaces comprised of different combinations of features. In many real-world problems, some points are correlated with respect to a given set of dimensions, while others are correlated with respect to different dimensions. Each dimension could be relevant to at least one of the clusters. Global dimensionality reduction techniques are unable to capture local correlations of data. Thus, a proper feature selection procedure should operate locally in input space. Local feature selection allows one to embed different distance measures in different regions of the input space; such distance metrics reflect local correlations of data. In (Domeniconi, Papadopoulos, Gunopulos, & Ma, 2004) we proposed a soft feature selection procedure (called LAC) that assigns weights to features according to the local correlations of data along each dimension. Dimensions along which data are loosely correlated receive a small weight, which has the effect of elongating distances along that dimension. Features along which data are strongly correlated receive a large weight, which has the effect of constricting distances along that dimension. Thus the learned weights perform a directional local reshaping of distances which allows a better separation of clusters, and therefore the discovery of different patterns in different subspaces of the original input space.

Download Full-text

Learning Topologies with the Growing Neural Forest

International Journal of Neural Systems ◽

10.1142/s0129065716500192 ◽

2016 ◽

Vol 26 (04) ◽

pp. 1650019 ◽

Cited By ~ 16

Author(s):

Esteban José Palomo ◽

Ezequiel López-Rubio

Keyword(s):

Self Organization ◽

Connected Components ◽

High Dimensional ◽

Connected Component ◽

Growing Neural Gas ◽

Component Structure ◽

Neural Gas ◽

Connected Cluster ◽

High Dimensional Datasets ◽

Self Organizing

In this work, a novel self-organizing model called growing neural forest (GNF) is presented. It is based on the growing neural gas (GNG), which learns a general graph with no special provisions for datasets with separated clusters. On the contrary, the proposed GNF learns a set of trees so that each tree represents a connected cluster of data. High dimensional datasets often contain large empty regions among clusters, so this proposal is better suited to them than other self-organizing models because it represents these separated clusters as connected components made of neurons. Experimental results are reported which show the self-organization capabilities of the model. Moreover, its suitability for unsupervised clustering and foreground detection applications is demonstrated. In particular, the GNF is shown to correctly discover the connected component structure of some datasets. Moreover, it outperforms some well-known foreground detectors both in quantitative and qualitative terms.

Download Full-text

Analysis of high-dimensional data using local input space histograms

Neurocomputing ◽

10.1016/j.neucom.2014.12.094 ◽

2015 ◽

Vol 169 ◽

pp. 272-280 ◽

Cited By ~ 3

Author(s):

Jochen Kerdels ◽

Gabriele Peters

Keyword(s):

High Dimensional Data ◽

High Dimensional ◽

Input Space ◽

Local Input

Download Full-text

From Point Cloud to Triangular Mesh by Growing Neural Gas

Journal of Software ◽

10.3724/sp.j.1001.2013.04293 ◽

2014 ◽

Vol 24 (3) ◽

pp. 651-662

Author(s):

Feng ZENG ◽

Tong YANG ◽

Shan YAO

Keyword(s):

Point Cloud ◽

Triangular Mesh ◽

Growing Neural Gas ◽

Neural Gas

Download Full-text

Computing Expectiles Using k-Nearest Neighbours Approach

Symmetry ◽

10.3390/sym13040645 ◽

2021 ◽

Vol 13 (4) ◽

pp. 645

Author(s):

Muhammad Farooq ◽

Sehrish Sarfraz ◽

Christophe Chesneau ◽

Mahmood Ul Hassan ◽

Muhammad Ali Raza ◽

...

Keyword(s):

Computational Cost ◽

Real Life ◽

Distance Measures ◽

Computational Time ◽

High Dimensional ◽

Test Error ◽

Nearest Neighbours ◽

Comparable Performance ◽

Asymmetric Least Squares ◽

Low Computational Cost

Expectiles have gained considerable attention in recent years due to wide applications in many areas. In this study, the k-nearest neighbours approach, together with the asymmetric least squares loss function, called ex-kNN, is proposed for computing expectiles. Firstly, the effect of various distance measures on ex-kNN in terms of test error and computational time is evaluated. It is found that Canberra, Lorentzian, and Soergel distance measures lead to minimum test error, whereas Euclidean, Canberra, and Average of (L1,L∞) lead to a low computational cost. Secondly, the performance of ex-kNN is compared with existing packages er-boost and ex-svm for computing expectiles that are based on nine real life examples. Depending on the nature of data, the ex-kNN showed two to 10 times better performance than er-boost and comparable performance with ex-svm regarding test error. Computationally, the ex-kNN is found two to five times faster than ex-svm and much faster than er-boost, particularly, in the case of high dimensional data.

Download Full-text