On the estimation of latent distances using graph distances

Ery Arias-Castro; Antoine Channarond; Bruno Pelletier; Nicolas Verzelen

doi:10.1214/21-ejs1801

Criminal networks analysis in missing data scenarios through graph distances

PLoS ONE ◽

10.1371/journal.pone.0255067 ◽

2021 ◽

Vol 16 (8) ◽

pp. e0255067

Author(s):

Annamaria Ficara ◽

Lucia Cavallaro ◽

Francesco Curreri ◽

Giacomo Fiumara ◽

Pasquale De Meo ◽

...

Keyword(s):

Law Enforcement ◽

Incomplete Data ◽

Law Enforcement Agencies ◽

Criminal Networks ◽

Criminal Organizations ◽

Edge Removal ◽

Graph Distances ◽

Intentional Deception ◽

Node Removal ◽

The Impact

Data collected in criminal investigations may suffer from issues like: (i) incompleteness, due to the covert nature of criminal organizations; (ii) incorrectness, caused by either unintentional data collection errors or intentional deception by criminals; (iii) inconsistency, when the same information is collected into law enforcement databases multiple times, or in different formats. In this paper we analyze nine real criminal networks of different nature (i.e., Mafia networks, criminal street gangs and terrorist organizations) in order to quantify the impact of incomplete data, and to determine which network type is most affected by it. The networks are firstly pruned using two specific methods: (i) random edge removal, simulating the scenario in which the Law Enforcement Agencies fail to intercept some calls, or to spot sporadic meetings among suspects; (ii) node removal, modeling the situation in which some suspects cannot be intercepted or investigated. Finally we compute spectral distances (i.e., Adjacency, Laplacian and normalized Laplacian Spectral Distances) and matrix distances (i.e., Root Euclidean Distance) between the complete and pruned networks, which we compare using statistical analysis. Our investigation identifies two main features: first, the overall understanding of the criminal networks remains high even with incomplete data on criminal interactions (i.e., when 10% of edges are removed); second, removing even a small fraction of suspects not investigated (i.e., 2% of nodes are removed) may lead to significant misinterpretation of the overall network.

Download Full-text

Modeling Network Populations via Graph Distances

Journal of the American Statistical Association ◽

10.1080/01621459.2020.1763803 ◽

2020 ◽

pp. 1-18

Author(s):

Simón Lunagómez ◽

Sofia C. Olhede ◽

Patrick J. Wolfe

Keyword(s):

Graph Distances

Download Full-text

Shape-aware Stochastic Neighbour Embedding for Robust Data Visualisations

10.21203/rs.3.rs-668207/v1 ◽

2021 ◽

Author(s):

Tobias Wängberg ◽

Chun-Biu Li ◽

Joanna Tyrcha

Keyword(s):

Single Cell ◽

Cluster Structure ◽

Synthetic Data ◽

Image Data ◽

Superior Performance ◽

Test Cases ◽

Data Sets ◽

Transcriptomics Data ◽

Quantitative Validation ◽

Graph Distances

Abstract The t-distributed Stochastic Neighbour Embedding (t-SNE) method has emerged as one of the leading methods for visualising High Dimensional (HD) data in a wide variety of fields, especially for revealing cluster structure in HD single cell transcriptomics data. However, several shortcomings of the algorithm have been identified. Specifically, t-SNE is often unable to correctly represent hierarchical relationships between clusters and spurious patterns may arise in the embedding due to incorrect parameter settings, which could lead to misinterpretations of the data. Here we incorporate t-SNE with shape-aware graph distances, a method termed shape-aware stochastic neighbour embedding (SASNE), to mitigate these limitations of the t-SNE. The merits of the SASNE are first demonstrated using synthetic data sets, where we see a significant improvement in embedding imbalanced and nonlinear clusters, as well as preservation of hierarchical structure, based on quantitative validation in clustering and dimensionality reductions. Moreover, we propose a data-driven parameter setting which we find consistently optimal in all test cases. Lastly, we demonstrate the superior performance of SASNE in embedding the MNIST image data and the single cell transcriptomics gene expression data.

Download Full-text

Multivariate Analysis of Orthogonal Range Searching and Graph Distances

Algorithmica ◽

10.1007/s00453-020-00680-z ◽

2020 ◽

Vol 82 (8) ◽

pp. 2292-2315

Author(s):

Karl Bringmann ◽

Thore Husfeldt ◽

Måns Magnusson

Keyword(s):

Multivariate Analysis ◽

Range Searching ◽

Orthogonal Range Searching ◽

Graph Distances

Download Full-text

Correlating Intrusion Events and Building Attack Scenarios Through Attack Graph Distances

20th Annual Computer Security Applications Conference ◽

10.1109/csac.2004.11 ◽

2005 ◽

Cited By ~ 60

Author(s):

S. Noel ◽

E. Robertson ◽

S. Jajodia

Keyword(s):

Attack Graph ◽

Graph Distances

Download Full-text

A Family of Tractable Graph Distances

Proceedings of the 2018 SIAM International Conference on Data Mining ◽

10.1137/1.9781611975321.38 ◽

2018 ◽

pp. 333-341 ◽

Cited By ~ 7

Author(s):

Jose Bento ◽

Stratis Ioannidis

Keyword(s):

Graph Distances

Download Full-text

Massively Distributed Graph Distances

IEEE Transactions on Signal and Information Processing over Networks ◽

10.1109/tsipn.2020.3022003 ◽

2020 ◽

Vol 6 ◽

pp. 667-683

Author(s):

Armin Moharrer ◽

Jasmin Gao ◽

Shikun Wang ◽

Jose Bento ◽

Stratis Ioannidis

Keyword(s):

Graph Distances

Download Full-text

Neural arbors are Pareto optimal

Proceedings of The Royal Society B Biological Sciences ◽

10.1098/rspb.2018.2727 ◽

2019 ◽

Vol 286 (1902) ◽

pp. 20182727 ◽

Cited By ~ 4

Author(s):

Arjun Chandrasekhar ◽

Saket Navlakha

Keyword(s):

Network Design ◽

Cell Body ◽

Pareto Front ◽

Cell Types ◽

Brain Regions ◽

Conduction Delay ◽

Pareto Optimal ◽

And Function ◽

Graph Distances ◽

Different Cell Types

Neural arbors (dendrites and axons) can be viewed as graphs connecting the cell body of a neuron to various pre- and post-synaptic partners. Several constraints have been proposed on the topology of these graphs, such as minimizing the amount of wire needed to construct the arbor (wiring cost), and minimizing the graph distances between the cell body and synaptic partners (conduction delay). These two objectives compete with each other—optimizing one results in poorer performance on the other. Here, we describe how well neural arbors resolve this network design trade-off using the theory of Pareto optimality. We develop an algorithm to generate arbors that near-optimally balance between these two objectives, and demonstrate that this algorithm improves over previous algorithms. We then use this algorithm to study how close neural arbors are to being Pareto optimal. Analysing 14 145 arbors across numerous brain regions, species and cell types, we find that neural arbors are much closer to being Pareto optimal than would be expected by chance and other reasonable baselines. We also investigate how the location of the arbor on the Pareto front, and the distance from the arbor to the Pareto front, can be used to classify between some arbor types (e.g. axons versus dendrites, or different cell types), highlighting a new potential connection between arbor structure and function. Finally, using this framework, we find that another biological branching structure—plant shoot architectures used to collect and distribute nutrients—are also Pareto optimal, suggesting shared principles of network design between two systems separated by millions of years of evolution.

Download Full-text

Distribution of Graph-Distances in Boltzmann Ensembles of RNA Secondary Structures

Lecture Notes in Computer Science - Algorithms in Bioinformatics ◽

10.1007/978-3-642-40453-5_10 ◽

2013 ◽

pp. 112-125 ◽

Cited By ~ 1

Author(s):

Rolf Backofen ◽

Markus Fricke ◽

Manja Marz ◽

Jing Qin ◽

Peter F. Stadler

Keyword(s):

Secondary Structures ◽

Rna Secondary Structures ◽

Graph Distances

Download Full-text

Social network analysis: the use of graph distances to compare artificial and criminal networks

10.20517/jsegc.2021.08 ◽

2021 ◽

Author(s):

Annamaria Ficara ◽

Francesco Curreri ◽

Lucia Cavallaro ◽

Pasquale De Meo ◽

Giacomo Fiumara ◽

...

Keyword(s):

Social Network ◽

Social Network Analysis ◽

Network Analysis ◽

Criminal Networks ◽

Graph Distances

Download Full-text