Selecting Summary Statistics in Approximate Bayesian Computation for Calibrating Stochastic Models

BioMed Research International ◽

10.1155/2013/210646 ◽

2013 ◽

Vol 2013 ◽

pp. 1-10 ◽

Cited By ~ 8

Author(s):

Tom Burr ◽

Alexei Skurikhin

Keyword(s):

Posterior Distribution ◽

Approximate Bayesian Computation ◽

Measurement Data ◽

Real Data ◽

Model Parameters ◽

Bayesian Computation ◽

Summary Statistics ◽

Full Data ◽

User Requirement ◽

Approximate Bayesian

Approximate Bayesian computation (ABC) is an approach for using measurement data to calibrate stochastic computer models, which are common in biology applications. ABC is becoming the “go-to” option when the data and/or parameter dimension is large because it relies on user-chosen summary statistics rather than the full data and is therefore computationally feasible. One technical challenge with ABC is that the quality of the approximation to the posterior distribution of model parameters depends on the user-chosen summary statistics. In this paper, the user requirement to choose effective summary statistics in order to accurately estimate the posterior distribution of model parameters is investigated and illustrated by example, using a model and corresponding real data of mitochondrial DNA population dynamics. We show that for some choices of summary statistics, the posterior distribution of model parameters is closely approximated and for other choices of summary statistics, the posterior distribution is not closely approximated. A strategy to choose effective summary statistics is suggested in cases where the stochastic computer model can be run at many trial parameter settings, as in the example.

Download Full-text

Component-wise Approximate Bayesian Computation via Gibbs-like steps

Biometrika ◽

10.1093/biomet/asaa090 ◽

2020 ◽

Author(s):

Grégoire Clarté ◽

Christian P Robert ◽

Robin J Ryder ◽

Julien Stoehr

Keyword(s):

Markov Chain ◽

Posterior Distribution ◽

Approximate Bayesian Computation ◽

Generative Models ◽

Bayesian Computation ◽

Summary Statistics ◽

Posterior Distributions ◽

Reduced Dimensions ◽

Standard Solution ◽

Approximate Bayesian

Abstract Approximate Bayesian computation methods are useful for generative models with intractable likelihoods. These methods are however sensitive to the dimension of the parameter space, requiring exponentially increasing resources as this dimension grows. To tackle this difficulty, we explore a Gibbs version of the Approximate Bayesian computation approach that runs component-wise approximate Bayesian computation steps aimed at the corresponding conditional posterior distributions, and based on summary statistics of reduced dimensions. While lacking the standard justifications for the Gibbs sampler, the resulting Markov chain is shown to converge in distribution under some partial independence conditions. The associated stationary distribution can further be shown to be close to the true posterior distribution and some hierarchical versions of the proposed mechanism enjoy a closed form limiting distribution. Experiments also demonstrate the gain in efficiency brought by the Gibbs version over the standard solution.

Download Full-text

Automatic Tolerance Selection for Approximate Bayesian Computation

10.21203/rs.3.rs-391445/v1 ◽

2021 ◽

Author(s):

George Karabatsos

Keyword(s):

Posterior Distribution ◽

Approximate Bayesian Computation ◽

Parametric Bootstrap ◽

Bayesian Computation ◽

Summary Statistics ◽

Tolerance Level ◽

Intractable Likelihood ◽

Node Network ◽

Approximate Bayesian ◽

Pseudo Data

Abstract Approximate Bayesian Computation (ABC) can provide inferences from the (approximate) posterior distribution based on intractable likelihoods. The quality of ABC inferences relies on the choice of tolerance for the distance between the observed data summary statistics, and the pseudo-data summary statistics simulated from the likelihood, used within the context of an algorithm which samples from the approximate posterior. However, the ABC literature does not provide an automatic method to select the best tolerance level for the given dataset at hand, and in ABC practice finding the best tolerance level can be time consuming. This note introduces a fast automatic estimator of the tolerance, based on the parametric bootstrap. After the tolerance estimate is calculated, it can then be input into any suitable importance sampling or MCMC algorithm to approximate from the target approximate posterior distribution. This tolerance estimator is illustrated through ABC analyses of simulated and real datasets involving several intractable likelihood models. This includes the analysis of a real 23,000-node network dataset involving stochastic search model selection.

Download Full-text

Weighted approximate Bayesian computation via Sanov’s theorem

Computational Statistics ◽

10.1007/s00180-021-01093-4 ◽

2021 ◽

Author(s):

Cecilia Viscardi ◽

Michele Boreale ◽

Fabio Corradi

Keyword(s):

Large Deviations ◽

Posterior Distribution ◽

Approximate Bayesian Computation ◽

Bayesian Computation ◽

Information Theoretic ◽

Discrete Random Variables ◽

Positive Weights ◽

Approximate Bayesian ◽

Information Theoretic Method ◽

Computational Resources

AbstractWe consider the problem of sample degeneracy in Approximate Bayesian Computation. It arises when proposed values of the parameters, once given as input to the generative model, rarely lead to simulations resembling the observed data and are hence discarded. Such “poor” parameter proposals do not contribute at all to the representation of the parameter’s posterior distribution. This leads to a very large number of required simulations and/or a waste of computational resources, as well as to distortions in the computed posterior distribution. To mitigate this problem, we propose an algorithm, referred to as the Large Deviations Weighted Approximate Bayesian Computation algorithm, where, via Sanov’s Theorem, strictly positive weights are computed for all proposed parameters, thus avoiding the rejection step altogether. In order to derive a computable asymptotic approximation from Sanov’s result, we adopt the information theoretic “method of types” formulation of the method of Large Deviations, thus restricting our attention to models for i.i.d. discrete random variables. Finally, we experimentally evaluate our method through a proof-of-concept implementation.

Download Full-text

Probabilistic Updating of Structural Models for Damage Assessment Using Approximate Bayesian Computation

Sensors ◽

10.3390/s20113197 ◽

2020 ◽

Vol 20 (11) ◽

pp. 3197 ◽

Cited By ~ 1

Author(s):

Zhouquan Feng ◽

Yang Lin ◽

Wenzan Wang ◽

Xugang Hua ◽

Zhengqing Chen

Keyword(s):

Damage Assessment ◽

Approximate Bayesian Computation ◽

Model Updating ◽

Likelihood Function ◽

Probabilistic Approach ◽

Model Parameters ◽

Bayesian Computation ◽

The Novel ◽

Subset Simulation ◽

Approximate Bayesian

A novel probabilistic approach for model updating based on approximate Bayesian computation with subset simulation (ABC-SubSim) is proposed for damage assessment of structures using modal data. The ABC-SubSim is a likelihood-free Bayesian approach in which the explicit expression of likelihood function is avoided and the posterior samples of model parameters are obtained using the technique of subset simulation. The novel contributions of this paper are on three fronts: one is the introduction of some new stopping criteria to find an appropriate tolerance level for the metric used in the ABC-SubSim; the second one is the employment of a hybrid optimization scheme to find finer optimal values for the model parameters; and the last one is the adoption of an iterative approach to determine the optimal weighting factors related to the residuals of modal frequency and mode shape in the metric. The effectiveness of this approach is demonstrated using three illustrative examples.

Download Full-text

Using approximate Bayesian computation to infer photosynthesis model parameters

Chinese Journal of Plant Ecology ◽

10.17521/cjpe.2016.0067 ◽

2017 ◽

Vol 41 (3) ◽

pp. 378-385 ◽

Cited By ~ 1

Author(s):

ZENG Ji-Ye ◽

◽

TAN Zheng-Hong ◽

and SAIGUSA Nobuko

Keyword(s):

Approximate Bayesian Computation ◽

Model Parameters ◽

Bayesian Computation ◽

Photosynthesis Model ◽

Approximate Bayesian

Download Full-text

Using approximate Bayesian computation to quantify cell-cell adhesion parameters in a cell migratory process

10.1101/068791 ◽

2016 ◽

Author(s):

Robert J. H. Ross ◽

R. E. Baker ◽

Andrew Parker ◽

M. J. Ford ◽

R. L. Mort ◽

...

Keyword(s):

Cell Adhesion ◽

Approximate Bayesian Computation ◽

Cell Interactions ◽

Model Parameters ◽

Bayesian Computation ◽

Accurate Identification ◽

A Cell ◽

Approximate Bayesian ◽

Set Up ◽

Cell Cell

AbstractIn this work we implement approximate Bayesian computational methods to improve the design of a wound-healing assay used to quantify cell-cell interactions. This is important as cell-cell interactions, such as adhesion and repulsion, have been shown to play an important role in cell migration. Initially, we demonstrate with a model of an ideal experiment that we are able to identify model parameters for agent motility and adhesion, given we choose appropriate summary statistics. Following this, we replace our model of an ideal experiment with a model representative of a practically realisable experiment. We demonstrate that, given the current (and commonly used) experimental set-up, model parameters cannot be accurately identified using approximate Bayesian computation methods. We compare new experimental designs through simulation, and show more accurate identification of model parameters is possible by expanding the size of the domain upon which the experiment is performed, as opposed to increasing the number of experimental repeats. The results presented in this work therefore describe time and cost-saving alterations for a commonly performed experiment for identifying cell motility parameters. Moreover, the results presented in this work will be of interest to those concerned with performing experiments that allow for the accurate identification of parameters governing cell migratory processes, especially cell migratory processes in which cell-cell adhesion or repulsion are known to play a significant role.

Download Full-text

Approximate Bayesian computation in controlled branching processes: the role of summary statistics

Revista de la Real Academia de Ciencias Exactas Físicas y Naturales Serie A Matemáticas ◽

10.1007/s13398-020-00839-x ◽

2020 ◽

Vol 114 (3) ◽

Author(s):

Miguel González ◽

Rodrigo Martínez ◽

Carmen Minuesa ◽

Inés del Puerto

Keyword(s):

Approximate Bayesian Computation ◽

Branching Processes ◽

Bayesian Computation ◽

Summary Statistics ◽

Approximate Bayesian ◽

Controlled Branching Processes

Download Full-text

An automatic adaptive method to combine summary statistics in approximate Bayesian computation

PLoS ONE ◽

10.1371/journal.pone.0236954 ◽

2020 ◽

Vol 15 (8) ◽

pp. e0236954

Author(s):

Jonathan U. Harrison ◽

Ruth E. Baker

Keyword(s):

Approximate Bayesian Computation ◽

Adaptive Method ◽

Bayesian Computation ◽

Summary Statistics ◽

Approximate Bayesian

Download Full-text

Faculty Opinions recommendation of Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718441386.793496094 ◽

2014 ◽

Author(s):

Michael Stumpf

Keyword(s):

Approximate Bayesian Computation ◽

Bayesian Computation ◽

Summary Statistics ◽

Approximate Bayesian

Download Full-text

Choice of Summary Statistic Weights in Approximate Bayesian Computation

Statistical Applications in Genetics and Molecular Biology ◽

10.2202/1544-6115.1586 ◽

2011 ◽

Vol 10 (1) ◽

Cited By ~ 9

Author(s):

Hsuan Jung ◽

Paul Marjoram

Keyword(s):

Genetic Algorithm ◽

Population Genetics ◽

Approximate Bayesian Computation ◽

Fundamental Problem ◽

Bayesian Computation ◽

Summary Statistics ◽

Summary Statistic ◽

Improved Performance ◽

Approximate Bayesian ◽

Computation Analysis

In this paper, we develop a Genetic Algorithm that can address the fundamental problem of how one should weight the summary statistics included in an approximate Bayesian computation analysis built around an accept/reject algorithm, and how one might choose the tolerance for that analysis. We then demonstrate that using weighted statistics, and a well-chosen tolerance, in such an approximate Bayesian computation approach can result in improved performance, when compared to unweighted analyses, using one example drawn purely from statistics and two drawn from the estimation of population genetics parameters.

Download Full-text