Pointwise Partial Information Decomposition Using the Specificity and Ambiguity Lattices

Conor Finn; Joseph Lizier

doi:10.3390/e20040297

Pointwise Partial Information Decomposition Using the Specificity and Ambiguity Lattices

Entropy ◽

10.3390/e20040297 ◽

2018 ◽

Vol 20 (4) ◽

pp. 297 ◽

Cited By ~ 20

Author(s):

Conor Finn ◽

Joseph Lizier

Keyword(s):

Mutual Information ◽

Partial Information ◽

Predictor Variables ◽

Redundant Information ◽

Discrete Variables ◽

Target Variable ◽

Complementary Information ◽

Operational Interpretation ◽

A Chain ◽

Pointwise Mutual Information

What are the distinct ways in which a set of predictor variables can provide information about a target variable? When does a variable provide unique information, when do variables share redundant information, and when do variables combine synergistically to provide complementary information? The redundancy lattice from the partial information decomposition of Williams and Beer provided a promising glimpse at the answer to these questions. However, this structure was constructed using a much criticised measure of redundant information, and despite sustained research, no completely satisfactory replacement measure has been proposed. In this paper, we take a different approach, applying the axiomatic derivation of the redundancy lattice to a single realisation from a set of discrete variables. To overcome the difficulty associated with signed pointwise mutual information, we apply this decomposition separately to the unsigned entropic components of pointwise mutual information which we refer to as the specificity and ambiguity. This yields a separate redundancy lattice for each component. Then based upon an operational interpretation of redundancy, we define measures of redundant specificity and ambiguity enabling us to evaluate the partial information atoms in each lattice. These atoms can be recombined to yield the sought-after multivariate information decomposition. We apply this framework to canonical examples from the literature and discuss the results and the various properties of the decomposition. In particular, the pointwise decomposition using specificity and ambiguity satisfies a chain rule over target variables, which provides new insights into the so-called two-bit-copy example.

Download Full-text

Bits and pieces: understanding information decomposition from part-whole relationships and formal logic

Proceedings of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rspa.2021.0110 ◽

2021 ◽

Vol 477 (2251) ◽

pp. 20210110

Author(s):

A. J. Gutknecht ◽

M. Wibral ◽

A. Makkeh

Keyword(s):

Mutual Information ◽

Partial Information ◽

Formal Logic ◽

Redundant Information ◽

Multiple Sources ◽

Target Variable ◽

Logical Implication ◽

Logical Constraints ◽

Multivariate Mutual Information ◽

Entire Theory

Partial information decomposition (PID) seeks to decompose the multivariate mutual information that a set of source variables contains about a target variable into basic pieces, the so-called ‘atoms of information’. Each atom describes a distinct way in which the sources may contain information about the target. For instance, some information may be contained uniquely in a particular source, some information may be shared by multiple sources and some information may only become accessible synergistically if multiple sources are combined. In this paper, we show that the entire theory of PID can be derived, firstly, from considerations of part-whole relationships between information atoms and mutual information terms, and secondly, based on a hierarchy of logical constraints describing how a given information atom can be accessed. In this way, the idea of a PID is developed on the basis of two of the most elementary relationships in nature: the part-whole relationship and the relation of logical implication. This unifying perspective provides insights into pressing questions in the field such as the possibility of constructing a PID based on concepts other than redundant information in the general n-sources case. Additionally, it admits of a particularly accessible exposition of PID theory.

Download Full-text

Comparison Extraction Feature Using Double Propagation and Pointwise Mutual Information to Select a Product

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/407/1/012147 ◽

2018 ◽

Vol 407 ◽

pp. 012147

Author(s):

A Rahman

Keyword(s):

Mutual Information ◽

Pointwise Mutual Information

Download Full-text

Correlation detection and an operational interpretation of the Rényi mutual information

2015 IEEE International Symposium on Information Theory (ISIT) ◽

10.1109/isit.2015.7282695 ◽

2015 ◽

Cited By ~ 6

Author(s):

Masahito Hayashi ◽

Marco Tomamichel

Keyword(s):

Mutual Information ◽

Operational Interpretation

Download Full-text

Two-stage Photovoltaic Power Forecasting based on Extreme Learning Machine and Improved Pointwise Mutual Information

2019 IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC) ◽

10.1109/appeec45492.2019.8994387 ◽

2019 ◽

Author(s):

Zhengrong Chen ◽

Yang Hu

Keyword(s):

Mutual Information ◽

Extreme Learning Machine ◽

Two Stage ◽

Photovoltaic Power ◽

Learning Machine ◽

Power Forecasting ◽

Pointwise Mutual Information

Download Full-text

Topic Optimization Method Based on Pointwise Mutual Information

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-26555-1_17 ◽

2015 ◽

pp. 148-155

Author(s):

Yuxin Ding ◽

Shengli Yan

Keyword(s):

Mutual Information ◽

Optimization Method ◽

Pointwise Mutual Information

Download Full-text

Weighted Average Pointwise Mutual Information for Feature Selection in Text Categorization

Knowledge Discovery in Databases: PKDD 2005 - Lecture Notes in Computer Science ◽

10.1007/11564126_27 ◽

2005 ◽

pp. 252-263 ◽

Cited By ~ 8

Author(s):

Karl-Michael Schneider

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Text Categorization ◽

Weighted Average ◽

Pointwise Mutual Information

Download Full-text

More data trumps smarter algorithms: Comparing pointwise mutual information with latent semantic analysis

Behavior Research Methods ◽

10.3758/brm.41.3.647 ◽

2009 ◽

Vol 41 (3) ◽

pp. 647-656 ◽

Cited By ~ 81

Author(s):

Gabriel Recchia ◽

Michael N. Jones

Keyword(s):

Mutual Information ◽

Latent Semantic Analysis ◽

Semantic Analysis ◽

Pointwise Mutual Information

Download Full-text

A semantic space approach to the computational semantics of noun compounds

Natural Language Engineering ◽

10.1017/s135132491200037x ◽

2013 ◽

Vol 20 (2) ◽

pp. 185-234 ◽

Cited By ~ 3

Author(s):

AKIRA UTSUMI

Keyword(s):

Mutual Information ◽

Semantic Analysis ◽

Semantic Relatedness ◽

Semantic Space ◽

Computational Semantics ◽

Semantic Classification ◽

Space Model ◽

Noun Compounds ◽

Comparison Algorithms ◽

Pointwise Mutual Information

AbstractThis study examines the ability of a semantic space model to represent the meaning of noun compounds such as ‘information gathering’ or ‘heart disease.’ For a semantic space model to compute the meaning and the attributional similarity (or semantic relatedness) for unfamiliar noun compounds that do not occur in a corpus, the vector for a noun compound must be computed from the vectors of its constituent words using vector composition algorithms. Six composition algorithms (i.e., centroid, multiplication, circular convolution, predication, comparison, and dilation) are compared in terms of the quality of the computation of the attributional similarity for English and Japanese noun compounds. To evaluate the performance of the computation of the similarity, this study uses three tasks (i.e., related word ranking, similarity correlation, and semantic classification), and two types of semantic spaces (i.e., latent semantic analysis-based and positive pointwise mutual information-based spaces). The result of these tasks is that the dilation algorithm is generally most effective in computing the similarity of noun compounds, while the multiplication algorithm is best suited specifically for the positive pointwise mutual information-based space. In addition, the comparison algorithm works better for unfamiliar noun compounds that do not occur in the corpus. These findings indicate that in general a semantic space model, and in particular the dilation, multiplication, and comparison algorithms have sufficient ability to compute the attributional similarity for noun compounds.

Download Full-text

Mutual Information between Discrete Variables with Many Categories using Recursive Adaptive Partitioning

Scientific Reports ◽

10.1038/srep10981 ◽

2015 ◽

Vol 5 (1) ◽

Cited By ~ 2

Author(s):

Junhee Seok ◽

Yeong Seon Kang

Keyword(s):

Mutual Information ◽

Discrete Variables ◽

Adaptive Partitioning

Download Full-text

Pipeline for a Data-driven Network of Linguistic Terms

10.3384/ecp184176 ◽

2021 ◽

Author(s):

Søren Wichmann

Keyword(s):

Mutual Information ◽

Data Driven ◽

Linguistic Terms ◽

Text Documents ◽

Page Rank ◽

Pointwise Mutual Information

The present work is aimed at (1) developing a search machine adapted to the large DReaM corpus of linguistic descriptive literature and (2) getting insights into how a data-driven ontology of linguistic terminology might be built. Starting from close to 20,000 text documents from the literature of language descriptions, from documents either born digitally or scanned and OCR’d, we extract keywords and pass them through a pruning pipeline where mainly keywords that can be considered as belonging to linguistic terminology survive. Subsequently we quantify relations among those terms using Normalized Pointwise Mutual Information (NPMI) and use the resulting measures, in conjunction with the Google Page Rank (GPR), to build networks of linguistic terms.

Download Full-text