Artificial Benchmark for Community Detection (ABCD)—Fast random graph model with community structure

Network Science ◽

10.1017/nws.2020.45 ◽

2021 ◽

pp. 1-26

Author(s):

Bogumił Kamiński ◽

Paweł Prałat ◽

François Théberge

Keyword(s):

Community Structure ◽

Complex Networks ◽

Community Detection ◽

Random Graph ◽

Power Law ◽

Main Parameter ◽

Graph Model ◽

Machine Learning Algorithms ◽

Random Graph Model ◽

Mixing Parameter

Abstract Most of the current complex networks that are of interest to practitioners possess a certain community structure that plays an important role in understanding the properties of these networks. For instance, a closely connected social communities exhibit faster rate of transmission of information in comparison to loosely connected communities. Moreover, many machine learning algorithms and tools that are developed for complex networks try to take advantage of the existence of communities to improve their performance or speed. As a result, there are many competing algorithms for detecting communities in large networks. Unfortunately, these algorithms are often quite sensitive and so they cannot be fine-tuned for a given, but a constantly changing, real-world network at hand. It is therefore important to test these algorithms for various scenarios that can only be done using synthetic graphs that have built-in community structure, power law degree distribution, and other typical properties observed in complex networks. The standard and extensively used method for generating artificial networks is the LFR graph generator. Unfortunately, this model has some scalability limitations and it is challenging to analyze it theoretically. Finally, the mixing parameter μ, the main parameter of the model guiding the strength of the communities, has a non-obvious interpretation and so can lead to unnaturally defined networks. In this paper, we provide an alternative random graph model with community structure and power law distribution for both degrees and community sizes, the Artificial Benchmark for Community Detection (ABCD graph). The model generates graphs with similar properties as the LFR one, and its main parameter ξ can be tuned to mimic its counterpart in the LFR model, the mixing parameter μ. We show that the new model solves the three issues identified above and more. In particular, we test the speed of our algorithm and do a number of experiments comparing basic properties of both ABCD and LFR. The conclusion is that these models produce graphs with comparable properties but ABCD is fast, simple, and can be easily tuned to allow the user to make a smooth transition between the two extremes: pure (independent) communities and random graph with no community structure.

Download Full-text

A Random Graph Model for Power Law Graphs

Experimental Mathematics ◽

10.1080/10586458.2001.10504428 ◽

2001 ◽

Vol 10 (1) ◽

pp. 53-66 ◽

Cited By ~ 185

Author(s):

William Aiello ◽

Fan Chung ◽

Linyuan Lu

Keyword(s):

Random Graph ◽

Power Law ◽

Graph Model ◽

Random Graph Model

Download Full-text

Scale-Free Property for Degrees and Weights in a Preferential Attachment Random Graph Model

Journal of Probability and Statistics ◽

10.1155/2013/707960 ◽

2013 ◽

Vol 2013 ◽

pp. 1-12 ◽

Cited By ~ 5

Author(s):

István Fazekas ◽

Bettina Porvázsnyik

Keyword(s):

Asymptotic Behaviour ◽

Random Graph ◽

Power Law ◽

Degree Distribution ◽

Preferential Attachment ◽

Graph Model ◽

Random Graph Model ◽

Scale Free ◽

Attachment Model ◽

Law Degree

A random graph evolution mechanism is defined. The evolution studied is a combination of the preferential attachment model and the interaction of four vertices. The asymptotic behaviour of the graph is described. It is proved that the graph exhibits a power law degree distribution; in other words, it is scale-free. It turns out that any exponent in(2,∞)can be achieved. The proofs are based on martingale methods.

Download Full-text

The diameter of KPKVB random graphs

Advances in Applied Probability ◽

10.1017/apr.2019.23 ◽

2019 ◽

Vol 51 (2) ◽

pp. 358-377 ◽

Cited By ~ 1

Author(s):

Tobias Müller ◽

Merlijn Staps

Keyword(s):

Complex Networks ◽

Power Law ◽

Degree Sequence ◽

Graph Model ◽

Clustering Coefficient ◽

Maximum Diameter ◽

Hyperbolic Distance ◽

Random Graph Model ◽

Power Law Exponent ◽

Law Degree

AbstractWe consider a random graph model that was recently proposed as a model for complex networks by Krioukov et al. (2010). In this model, nodes are chosen randomly inside a disk in the hyperbolic plane and two nodes are connected if they are at most a certain hyperbolic distance from each other. It has previously been shown that this model has various properties associated with complex networks, including a power-law degree distribution and a strictly positive clustering coefficient. The model is specified using three parameters: the number of nodes N, which we think of as going to infinity, and $\alpha, \nu > 0$, which we think of as constant. Roughly speaking, $\alpha$ controls the power-law exponent of the degree sequence and $\nu$ the average degree. Earlier work of Kiwi and Mitsche (2015) has shown that, when $\alpha \lt 1$ (which corresponds to the exponent of the power law degree sequence being $\lt 3$), the diameter of the largest component is asymptotically almost surely (a.a.s.) at most polylogarithmic in N. Friedrich and Krohmer (2015) showed it was a.a.s. $\Omega(\log N)$ and improved the exponent of the polynomial in $\log N$ in the upper bound. Here we show the maximum diameter over all components is a.a.s. $O(\log N),$ thus giving a bound that is tight up to a multiplicative constant.

Download Full-text

Community Structure in Industrial SAT Instances

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.11741 ◽

2019 ◽

Vol 66 ◽

pp. 443-472

Author(s):

Carlos Ansótegui ◽

Maria Luisa Bonet ◽

Jesús Giráldez-Cru ◽

Jordi Levy ◽

Laurent Simon

Keyword(s):

Community Structure ◽

Complex Networks ◽

Graph Model ◽

Underlying Structure ◽

Original Structure ◽

Sat Solving ◽

Random Graph Model ◽

Sat Solvers ◽

Sat Solver ◽

Remarkable Progress

Modern SAT solvers have experienced a remarkable progress on solving industrial instances. It is believed that most of these successful techniques exploit the underlying structure of industrial instances. Recently, there have been some attempts to analyze the structure of industrial SAT instances in terms of complex networks, with the aim of explaining the success of SAT solving techniques, and possibly improving them. In this paper, we study the community structure, or modularity, of industrial SAT instances. In a graph with clear community structure, or high modularity, we can find a partition of its nodes into communities such that most edges connect variables of the same community. Representing SAT instances as graphs, we show that most application benchmarks are characterized by a high modularity. On the contrary, random SAT instances are closer to the classical Erdös-Rényi random graph model, where no structure can be observed. We also analyze how this structure evolves by the effects of the execution of a CDCL SAT solver, and observe that new clauses learned by the solver during the search contribute to destroy the original structure of the formula. Motivated by this observation, we finally present an application that exploits the community structure to detect relevant learned clauses, and we show that detecting these clauses results in an improvement on the performance of the SAT solver. Empirically, we observe that this improves the performance of several SAT solvers on industrial SAT formulas, especially on satisfiable instances.

Download Full-text

Analysis of E-Commerce Product Graphs

10.36227/techrxiv.12814244.v1 ◽

2020 ◽

Author(s):

Shalin Shah

Keyword(s):

Random Graph ◽

Random Graphs ◽

Power Law ◽

Real World ◽

Degree Distribution ◽

Graph Model ◽

Graph Analysis ◽

Random Graph Model ◽

Product Graphs ◽

Clustering Coefficients

Consumer behavior in retail stores gives rise to product graphs based on copurchasingor co-viewing behavior. These product graphs can be analyzed usingthe known methods of graph analysis. In this paper, we analyze the product graphat Target Corporation based on the Erd˝os-Renyi random graph model. In particular,we compute clustering coefficients of actual and random graphs, and we find thatthe clustering coefficients of actual graphs are much higher than random graphs.We conduct the analysis on the entire set of products and also on a per categorybasis and find interesting results. We also compute the degree distribution andwe find that the degree distribution is a power law as expected from real worldnetworks, contrasting with the ER random graph.

Download Full-text

Walk-modularity and community structure in networks

Network Science ◽

10.1017/nws.2015.20 ◽

2015 ◽

Vol 3 (3) ◽

pp. 348-360 ◽

Cited By ~ 6

Author(s):

DAVID MEHRLE ◽

AMY STROSSER ◽

ANTHONY HARKIN

Keyword(s):

Community Structure ◽

Social Science ◽

Community Detection ◽

Practical Interest ◽

Graph Model ◽

Natural Generalization ◽

Expected Number ◽

Random Graph Model ◽

Modularity Maximization ◽

The Difference

AbstractModularity maximization has been one of the most widely used approaches in the last decade for discovering community structure in networks of practical interest in biology, computing, social science, statistical mechanics, and more. Modularity is a quality function that measures the difference between the number of edges found within clusters minus the number of edges one would statistically expect to find based on some equivalent random graph model. We explore a natural generalization of modularity based on the difference between the actual and expected number of walks within clusters, which we refer to as walk-modularity. Walk-modularity can be expressed in matrix form, and community detection can be performed by finding the leading eigenvector of the walk-modularity matrix. We demonstrate community detection on both synthetic and real-world networks and find that walk-modularity maximization returns significantly improved results compared to traditional modularity maximization.

Download Full-text

Community Detection Under Exponential Random Graph Model: A Metaheuristic Approach

Lecture Notes in Computer Science - Advances in Swarm Intelligence ◽

10.1007/978-3-319-61833-3_10 ◽

2017 ◽

pp. 87-98 ◽

Cited By ~ 1

Author(s):

Tai-Chi Wang ◽

Frederick Kin Hing Phoa

Keyword(s):

Community Detection ◽

Random Graph ◽

Graph Model ◽

Exponential Random Graph Model ◽

Random Graph Model ◽

Exponential Random Graph

Download Full-text

Random graph model with power-law distributed triangle subgraphs

Physical Review E ◽

10.1103/physreve.72.025103 ◽

2005 ◽

Vol 72 (2) ◽

Cited By ~ 8

Author(s):

Danilo Sergi

Keyword(s):

Random Graph ◽

Power Law ◽

Graph Model ◽

Random Graph Model

Download Full-text

EGBTER: Capturing Degree Distribution, Clustering Coefficients, and Community Structure in a Single Random Graph Model

2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) ◽

10.1109/asonam.2018.8508598 ◽

2018 ◽

Author(s):

Omar El-Daghar ◽

Erik Lundberg ◽

Robert Bridges

Keyword(s):

Community Structure ◽

Random Graph ◽

Degree Distribution ◽

Graph Model ◽

Random Graph Model ◽

Clustering Coefficients

Download Full-text

Analysis of E-Commerce Product Graphs

10.36227/techrxiv.12814244 ◽

2020 ◽

Author(s):

Shalin Shah

Keyword(s):

Random Graph ◽

Random Graphs ◽

Power Law ◽

Real World ◽

Degree Distribution ◽

Graph Model ◽

Graph Analysis ◽

Random Graph Model ◽

Product Graphs ◽

Clustering Coefficients

Download Full-text