The Stochastic Complexity of Spin Models: Are Pairwise Models Really Simple?

Alberto Beretta; Claudia Battistin; Clélia de Mulatier; Iacopo Mastromatteo; Matteo Marsili

doi:10.3390/e20100739

The Stochastic Complexity of Spin Models: Are Pairwise Models Really Simple?

Entropy ◽

10.3390/e20100739 ◽

2018 ◽

Vol 20 (10) ◽

pp. 739 ◽

Cited By ~ 2

Author(s):

Alberto Beretta ◽

Claudia Battistin ◽

Clélia de Mulatier ◽

Iacopo Mastromatteo ◽

Matteo Marsili

Keyword(s):

Information Theory ◽

Statistical Learning ◽

Arbitrary Order ◽

Equivalence Classes ◽

Computationally Efficient ◽

Spin Models ◽

Stochastic Complexity ◽

Pairwise Models ◽

Fully Connected ◽

Simple Models

Models can be simple for different reasons: because they yield a simple and computationally efficient interpretation of a generic dataset (e.g., in terms of pairwise dependencies)—as in statistical learning—or because they capture the laws of a specific phenomenon—as e.g., in physics—leading to non-trivial falsifiable predictions. In information theory, the simplicity of a model is quantified by the stochastic complexity, which measures the number of bits needed to encode its parameters. In order to understand how simple models look like, we study the stochastic complexity of spin models with interactions of arbitrary order. We show that bijections within the space of possible interactions preserve the stochastic complexity, which allows to partition the space of all models into equivalence classes. We thus found that the simplicity of a model is not determined by the order of the interactions, but rather by their mutual arrangements. Models where statistical dependencies are localized on non-overlapping groups of few variables are simple, affording predictions on independencies that are easy to falsify. On the contrary, fully connected pairwise models, which are often used in statistical learning, appear to be highly complex, because of their extended set of interactions, and they are hard to falsify.

Download Full-text

Towards a Unified Theory of Learning and Information

Entropy ◽

10.3390/e22040438 ◽

2020 ◽

Vol 22 (4) ◽

pp. 438

Author(s):

Ibrahim Alabdulmohsin

Keyword(s):

Information Theory ◽

Statistical Learning ◽

Learning Theory ◽

Communication Systems ◽

Differential Privacy ◽

Statistical Learning Theory ◽

Information Leakage ◽

Risk Minimization ◽

Open Problems ◽

Learning Capacity

In this paper, we introduce the notion of “learning capacity” for algorithms that learn from data, which is analogous to the Shannon channel capacity for communication systems. We show how “learning capacity” bridges the gap between statistical learning theory and information theory, and we will use it to derive generalization bounds for finite hypothesis spaces, differential privacy, and countable domains, among others. Moreover, we prove that under the Axiom of Choice, the existence of an empirical risk minimization (ERM) rule that has a vanishing learning capacity is equivalent to the assertion that the hypothesis space has a finite Vapnik–Chervonenkis (VC) dimension, thus establishing an equivalence relation between two of the most fundamental concepts in statistical learning theory and information theory. In addition, we show how the learning capacity of an algorithm provides important qualitative results, such as on the relation between generalization and algorithmic stability, information leakage, and data processing. Finally, we conclude by listing some open problems and suggesting future directions of research.

Download Full-text

BRICK v0.1, a simple, accessible, and transparent model framework for climate and regional sea-level projections

10.5194/gmd-2016-303 ◽

2017 ◽

Cited By ~ 2

Author(s):

Tony E. Wong ◽

Alexander Bakker ◽

Kelsey Ruckert ◽

Patrick Applegate ◽

Aimée Slangen ◽

...

Keyword(s):

Simple Model ◽

Sea Level ◽

Flood Risk ◽

Building Blocks ◽

Computationally Efficient ◽

Model Framework ◽

Sea Levels ◽

Model Components ◽

High Degree ◽

Simple Models

Abstract. Simple models can play pivotal roles in the quantification and framing of uncertainties surrounding climate change and sea-level rise. They are computationally efficient, transparent, and easier to reproduce. These qualities make simple models useful for uncertainty quantification and risk characterization. Simple model codes are increasingly distributed as open source, as well as actively shared and guided. Alas, computer codes used in the geosciences can often be hard to access, run, modify (e.g., with regards to assumptions and model components), and review. Here, we introduce a simple model framework for projections of global mean temperatures as well as regional sea levels and coastal flood risk (BRICK: Building blocks for Relevant Ice and Climate Knowledge). The BRICK model framework is written in R and Fortran and aims to help mitigate these issues, while maintaining a high degree of computational efficiency. We demonstrate the flexibility of this framework through simple model intercomparison experiments. Furthermore, we demonstrate that BRICK is suitable for risk assessment applications by using a didactic example in local flood risk management.

Download Full-text

Structure and information in spatial segregation

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1708201114 ◽

2017 ◽

Vol 114 (44) ◽

pp. 11591-11596 ◽

Cited By ~ 10

Author(s):

Philip S. Chodrow

Keyword(s):

Information Theory ◽

Spatial Structure ◽

Spatial Scale ◽

Residential Segregation ◽

Open Source Software ◽

Decomposition Analysis ◽

Decomposition Methods ◽

Economic Costs ◽

Computationally Efficient ◽

Structure And Dynamics

Ethnoracial residential segregation is a complex, multiscalar phenomenon with immense moral and economic costs. Modeling the structure and dynamics of segregation is a pressing problem for sociology and urban planning, but existing methods have limitations. In this paper, we develop a suite of methods, grounded in information theory, for studying the spatial structure of segregation. We first advance existing profile and decomposition methods by posing two related regionalization methods, which allow for profile curves with nonconstant spatial scale and decomposition analysis with nonarbitrary areal units. We then formulate a measure of local spatial scale, which may be used for both detailed, within-city analysis and intercity comparisons. These methods highlight detailed insights in the structure and dynamics of urban segregation that would be otherwise easy to miss or difficult to quantify. They are computationally efficient, applicable to a broad range of study questions, and freely available in open source software.

Download Full-text

Session MA4b: Information theory and statistical learning

2016 50th Asilomar Conference on Signals, Systems and Computers ◽

10.1109/acssc.2016.7869050 ◽

2016 ◽

Keyword(s):

Information Theory ◽

Statistical Learning

Download Full-text

Information Theory and Statistical Learning

10.1007/978-0-387-84816-7 ◽

2009 ◽

Cited By ~ 5

Keyword(s):

Information Theory ◽

Statistical Learning

Download Full-text

Quantum phase transitions in fully connected spin models: An entanglement perspective

Physical Review A ◽

10.1103/physreva.83.022327 ◽

2011 ◽

Vol 83 (2) ◽

Cited By ~ 27

Author(s):

Michele Filippone ◽

Sébastien Dusuel ◽

Julien Vidal

Keyword(s):

Phase Transitions ◽

Quantum Phase Transitions ◽

Quantum Phase ◽

Spin Models ◽

Fully Connected

Download Full-text

Approximation of Information Divergences for Statistical Learning with Applications

Mathematica Slovaca ◽

10.1515/ms-2017-0177 ◽

2018 ◽

Vol 68 (5) ◽

pp. 1149-1172

Author(s):

Milan Stehlík ◽

Ján Somorčík ◽

Luboš Střelec ◽

Jaromír Antoch

Keyword(s):

Information Theory ◽

Statistical Learning ◽

Fourier Transformation ◽

Exponential Family ◽

Real Data ◽

Statistical Information ◽

Saddle Point Approximation ◽

Optimal Tests ◽

Exponential Family Of Distributions ◽

Family Of Distributions

Abstract In this paper we give a partial response to one of the most important statistical questions, namely, what optimal statistical decisions are and how they are related to (statistical) information theory. We exemplify the necessity of understanding the structure of information divergences and their approximations, which may in particular be understood through deconvolution. Deconvolution of information divergences is illustrated in the exponential family of distributions, leading to the optimal tests in the Bahadur sense. We provide a new approximation of I-divergences using the Fourier transformation, saddle point approximation, and uniform convergence of the Euler polygons. Uniform approximation of deconvoluted parts of I-divergences is also discussed. Our approach is illustrated on a real data example.

Download Full-text

BRICK v0.2, a simple, accessible, and transparent model framework for climate and regional sea-level projections

Geoscientific Model Development ◽

10.5194/gmd-10-2741-2017 ◽

2017 ◽

Vol 10 (7) ◽

pp. 2741-2760 ◽

Cited By ~ 11

Author(s):

Tony E. Wong ◽

Alexander M. R. Bakker ◽

Kelsey Ruckert ◽

Patrick Applegate ◽

Aimée B. A. Slangen ◽

...

Keyword(s):

Simple Model ◽

Sea Level ◽

Flood Risk ◽

Building Blocks ◽

Computationally Efficient ◽

Model Framework ◽

Sea Levels ◽

Global Mean Temperature ◽

Model Components ◽

Simple Models

Abstract. Simple models can play pivotal roles in the quantification and framing of uncertainties surrounding climate change and sea-level rise. They are computationally efficient, transparent, and easy to reproduce. These qualities also make simple models useful for the characterization of risk. Simple model codes are increasingly distributed as open source, as well as actively shared and guided. Alas, computer codes used in the geosciences can often be hard to access, run, modify (e.g., with regards to assumptions and model components), and review. Here, we describe the simple model framework BRICK (Building blocks for Relevant Ice and Climate Knowledge) v0.2 and its underlying design principles. The paper adds detail to an earlier published model setup and discusses the inclusion of a land water storage component. The framework largely builds on existing models and allows for projections of global mean temperature as well as regional sea levels and coastal flood risk. BRICK is written in R and Fortran. BRICK gives special attention to the model values of transparency, accessibility, and flexibility in order to mitigate the above-mentioned issues while maintaining a high degree of computational efficiency. We demonstrate the flexibility of this framework through simple model intercomparison experiments. Furthermore, we demonstrate that BRICK is suitable for risk assessment applications by using a didactic example in local flood risk management.

Download Full-text

What Determines Visual Statistical Learning Performance? Insights From Information Theory

Cognitive Science ◽

10.1111/cogs.12803 ◽

2019 ◽

Vol 43 (12) ◽

Cited By ~ 1

Author(s):

Noam Siegelman ◽

Louisa Bogaerts ◽

Ram Frost

Keyword(s):

Information Theory ◽

Statistical Learning ◽

Learning Performance ◽

Visual Statistical Learning

Download Full-text

Meshing Highly Regular Structures: The Case of Super Carbon Nanotubes of Arbitrary Order

Journal of Nanomaterials ◽

10.1155/2015/736943 ◽

2015 ◽

Vol 2015 ◽

pp. 1-26 ◽

Cited By ~ 3

Author(s):

Christian Schröppel ◽

Jens Wackerfuß

Keyword(s):

Carbon Nanotubes ◽

Mesh Generation ◽

Arbitrary Order ◽

Mechanical Analysis ◽

Neighborhood Search ◽

Complex Structures ◽

Computationally Efficient ◽

Regular Structures ◽

Novel Approach ◽

Neighborhood Search Algorithms

Mesh generation is an important step in many numerical methods. We present the “Hierarchical Graph Meshing” (HGM) method as a novel approach to mesh generation, based on algebraic graph theory. The HGM method can be used to systematically construct configurations exhibiting multiple hierarchies and complex symmetry characteristics. The hierarchical description of structures provided by the HGM method can be exploited to increase the efficiency of multiscale and multigrid methods. In this paper, the HGM method is employed for the systematic construction of super carbon nanotubes of arbitrary order, which present a pertinent example of structurally and geometrically complex, yet highly regular, structures. The HGM algorithm is computationally efficient and exhibits good scaling characteristics. In particular, it scales linearly for super carbon nanotube structures and is working much faster than geometry-based methods employing neighborhood search algorithms. Its modular character makes it conducive to automatization. For the generation of a mesh, the information about the geometry of the structure in a given configuration is added in a way that relates geometric symmetries to structural symmetries. The intrinsically hierarchic description of the resulting mesh greatly reduces the effort of determining mesh hierarchies for multigrid and multiscale applications and helps to exploit symmetry-related methods in the mechanical analysis of complex structures.

Download Full-text