Linear and Fisher Separability of Random Points in the d-Dimensional Spherical Layer and Inside the d-Dimensional Cube

Sergey Sidorov; Nikolai Zolotykh

doi:10.3390/e22111281

Linear and Fisher Separability of Random Points in the d-Dimensional Spherical Layer and Inside the d-Dimensional Cube

Entropy ◽

10.3390/e22111281 ◽

2020 ◽

Vol 22 (11) ◽

pp. 1281

Author(s):

Sergey Sidorov ◽

Nikolai Zolotykh

Keyword(s):

Intelligent Systems ◽

High Probability ◽

Dimensional Space ◽

Spherical Layer ◽

High Dimensional ◽

Separation Theorems ◽

Random Set ◽

Artificial Intelligent ◽

Dimensional Cube ◽

Set Of Points

Stochastic separation theorems play important roles in high-dimensional data analysis and machine learning. It turns out that in high dimensional space, any point of a random set of points can be separated from other points by a hyperplane with high probability, even if the number of points is exponential in terms of dimensions. This and similar facts can be used for constructing correctors for artificial intelligent systems, for determining the intrinsic dimensionality of data and for explaining various natural intelligence phenomena. In this paper, we refine the estimations for the number of points and for the probability in stochastic separation theorems, thereby strengthening some results obtained earlier. We propose the boundaries for linear and Fisher separability, when the points are drawn randomly, independently and uniformly from a d-dimensional spherical layer and from the cube. These results allow us to better outline the applicability limits of the stochastic separation theorems in applications.

Download Full-text

Experimental Evaluation of the Height of a Random Set of Points in a d-Dimensional Cube

Algorithm Engineering and Experimentation - Lecture Notes in Computer Science ◽

10.1007/3-540-44808-x_13 ◽

2001 ◽

pp. 161-171 ◽

Cited By ~ 1

Author(s):

Eric Breimer ◽

Mark Goldberg ◽

Brian Kolstad ◽

Malik Magdon-Ismail

Keyword(s):

Experimental Evaluation ◽

Random Set ◽

Dimensional Cube ◽

Set Of Points

Download Full-text

Methods for Binary Multidimensional Scaling

Neural Computation ◽

10.1162/089976602753633457 ◽

2002 ◽

Vol 14 (5) ◽

pp. 1195-1232 ◽

Cited By ~ 8

Author(s):

Douglas L. T. Rohde

Keyword(s):

Neural Networks ◽

Multidimensional Scaling ◽

Dimensional Space ◽

Discrete Space ◽

High Dimensional ◽

Continuous Space ◽

Low Dimensional ◽

Set Of Points ◽

Lower Dimensional Space ◽

Lower Dimensional

Multidimensional scaling (MDS) is the process of transforming a set of points in a high-dimensional space to a lower-dimensional one while preserving the relative distances between pairs of points. Although effective methods have been developed for solving a variety of MDS problems, they mainly depend on the vectors in the lower-dimensional space having real-valued components. For some applications, the training of neural networks in particular, it is preferable or necessary to obtain vectors in a discrete, binary space. Unfortunately, MDS into a low-dimensional discrete space appears to be a significantly harder problem than MDS into a continuous space. This article introduces and analyzes several methods for performing approximately optimized binary MDS.

Download Full-text

A Classification Algorithm with Reject Option Based on Adaptive Minimum Spanning Tree Covering Model in High-dimensional Space

JOURNAL OF ELECTRONICS INFORMATION TECHNOLOGY ◽

10.3724/sp.j.1146.2009.00021 ◽

2011 ◽

Vol 32 (12) ◽

pp. 2895-2900 ◽

Cited By ~ 1

Author(s):

Zheng-ping Hu ◽

Cheng-qian Xu ◽

Qian-wen Jia

Keyword(s):

Spanning Tree ◽

Minimum Spanning Tree ◽

Dimensional Space ◽

Classification Algorithm ◽

High Dimensional ◽

High Dimensional Space ◽

Reject Option ◽

Covering Model

Download Full-text

Neural networks trained with high-dimensional functions approximation data in high-dimensional space

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211417 ◽

2021 ◽

pp. 1-12

Author(s):

Jian Zheng ◽

Jianfeng Wang ◽

Yanping Chen ◽

Shuping Chen ◽

Jingjin Chen ◽

...

Keyword(s):

Neural Networks ◽

Dimensional Space ◽

Data Distribution ◽

High Dimensional ◽

Sufficient Information ◽

Sufficient Data ◽

High Dimensional Space ◽

Positive Effects ◽

The Neural Networks ◽

Using Data

Neural networks can approximate data because of owning many compact non-linear layers. In high-dimensional space, due to the curse of dimensionality, data distribution becomes sparse, causing that it is difficulty to provide sufficient information. Hence, the task becomes even harder if neural networks approximate data in high-dimensional space. To address this issue, according to the Lipschitz condition, the two deviations, i.e., the deviation of the neural networks trained using high-dimensional functions, and the deviation of high-dimensional functions approximation data, are derived. This purpose of doing this is to improve the ability of approximation high-dimensional space using neural networks. Experimental results show that the neural networks trained using high-dimensional functions outperforms that of using data in the capability of approximation data in high-dimensional space. We find that the neural networks trained using high-dimensional functions more suitable for high-dimensional space than that of using data, so that there is no need to retain sufficient data for neural networks training. Our findings suggests that in high-dimensional space, by tuning hidden layers of neural networks, this is hard to have substantial positive effects on improving precision of approximation data.

Download Full-text

Cyclostationary signals analysis methods based on high-dimensional space transformation under impulsive noise.

IEEE Signal Processing Letters ◽

10.1109/lsp.2021.3104996 ◽

2021 ◽

pp. 1-1

Author(s):

Qiancheng Zhang ◽

Hongbing Ji ◽

Yan Jin

Keyword(s):

Impulsive Noise ◽

Dimensional Space ◽

High Dimensional ◽

High Dimensional Space ◽

Analysis Methods ◽

Space Transformation ◽

Cyclostationary Signals

Download Full-text

Network intrusion detection in high dimensional space

2009 2nd International Symposium on Applied Sciences in Biomedical and Communication Technologies ◽

10.1109/isabel.2009.5373652 ◽

2009 ◽

Cited By ~ 2

Author(s):

Stanislav Marcek ◽

Martin Drozda ◽

Gabriel Juhas ◽

Fedor Lehocki

Keyword(s):

Intrusion Detection ◽

Dimensional Space ◽

High Dimensional ◽

Network Intrusion Detection ◽

High Dimensional Space ◽

Network Intrusion

Download Full-text

A System to Assess the Semantic Content of Student Essays

Journal of Educational Computing Research ◽

10.2190/g649-0r9c-c021-p6x3 ◽

2001 ◽

Vol 24 (3) ◽

pp. 305-320 ◽

Cited By ~ 35

Author(s):

Benoit Lemaire ◽

Philippe Dessus

Keyword(s):

Latent Semantic Analysis ◽

Semantic Analysis ◽

Dimensional Space ◽

Semantic Content ◽

High Dimensional ◽

High Dimensional Space

This paper presents Apex, a system that can automatically assess a student essay based on its content. It relies on Latent Semantic Analysis, a tool which is used to represent the meaning of words as vectors in a high-dimensional space. By comparing an essay and the text of a given course on a semantic basis, our system can measure how well the essay matches the text. Various assessments are presented to the student regarding the topic, the outline and the coherence of the essay. Our experiments yield promising results.

Download Full-text

Effective approximation of high-dimensional space using neural networks

The Journal of Supercomputing ◽

10.1007/s11227-021-04038-2 ◽

2021 ◽

Author(s):

Jian Zheng ◽

Jianfeng Wang ◽

Yanping Chen ◽

Shuping Chen ◽

Jingjin Chen ◽

...

Keyword(s):

Neural Networks ◽

Dimensional Space ◽

High Dimensional ◽

High Dimensional Space ◽

Effective Approximation

Download Full-text

A Novel Bat Algorithm Based on Differential Operator and Lévy Flights Trajectory

Computational Intelligence and Neuroscience ◽

10.1155/2013/453812 ◽

2013 ◽

Vol 2013 ◽

pp. 1-13 ◽

Cited By ~ 63

Author(s):

Jian Xie ◽

Yongquan Zhou ◽

Huan Chen

Keyword(s):

Differential Operator ◽

Dimensional Space ◽

Bat Algorithm ◽

High Dimensional ◽

Local Minima ◽

Lévy Flights ◽

Levy Flights ◽

Mutation Strategy ◽

Differential Algorithm ◽

Simulation Results

Aiming at the phenomenon of slow convergence rate and low accuracy of bat algorithm, a novel bat algorithm based on differential operator and Lévy flights trajectory is proposed. In this paper, a differential operator is introduced to accelerate the convergence speed of proposed algorithm, which is similar to mutation strategy “DE/best/2” in differential algorithm. Lévy flights trajectory can ensure the diversity of the population against premature convergence and make the algorithm effectively jump out of local minima. 14 typical benchmark functions and an instance of nonlinear equations are tested; the simulation results not only show that the proposed algorithm is feasible and effective, but also demonstrate that this proposed algorithm has superior approximation capabilities in high-dimensional space.

Download Full-text

Semi-Supervised Deep Learning for High-Dimensional Uncertainty Quantification

Volume 11A: 46th Design Automation Conference (DAC) ◽

10.1115/detc2020-22204 ◽

2020 ◽

Author(s):

Zequn Wang ◽

Mingyang Li

Keyword(s):

Uncertainty Quantification ◽

Reliability Analysis ◽

Supervised Learning ◽

Dimensional Space ◽

Limit State ◽

Failure Surface ◽

Simulation Method ◽

High Dimensional ◽

State Function ◽

Latent Space

Abstract Conventional uncertainty quantification methods usually lacks the capability of dealing with high-dimensional problems due to the curse of dimensionality. This paper presents a semi-supervised learning framework for dimension reduction and reliability analysis. An autoencoder is first adopted for mapping the high-dimensional space into a low-dimensional latent space, which contains a distinguishable failure surface. Then a deep feedforward neural network (DFN) is utilized to learn the mapping relationship and reconstruct the latent space, while the Gaussian process (GP) modeling technique is used to build the surrogate model of the transformed limit state function. During the training process of the DFN, the discrepancy between the actual and reconstructed latent space is minimized through semi-supervised learning for ensuring the accuracy. Both labeled and unlabeled samples are utilized for defining the loss function of the DFN. Evolutionary algorithm is adopted to train the DFN, then the Monte Carlo simulation method is used for uncertainty quantification and reliability analysis based on the proposed framework. The effectiveness is demonstrated through a mathematical example.

Download Full-text