Towards Scalable Gaussian Process Modeling

Volume 2B: 45th Design Automation Conference ◽

10.1115/detc2019-97010 ◽

2019 ◽

Author(s):

Piyush Pandita ◽

Jesper Kristensen ◽

Liping Wang

Keyword(s):

Monte Carlo ◽

Gaussian Process ◽

Objective Function ◽

Large Scale ◽

Sequential Monte Carlo ◽

Computational Time ◽

Medium Size ◽

Modeling Framework ◽

Industry Applications ◽

Mathematical Problems

Abstract Numerous engineering problems of interest to the industry are often characterized by expensive black-box objective function evaluations. These objective functions could be physical experiments or computer simulations. Obtaining a comprehensive idea of the problem and/or performing subsequent optimizations generally requires hundreds of thousands of evaluations of the objective function which is most often a practically unachievable task. Gaussian Process (GP) surrogate modeling replaces the expensive function with a cheap-to-evaluate data-driven probabilistic model. While the GP does not assume a functional form of the problem, it is defined by a set of parameters, called hyper-parameters, that need to be learned from the data. The hyperparameters define the characteristics of the objective function, such as smoothness, magnitude, periodicity, etc. Accurately estimating these hyperparameters is a key ingredient in developing a reliable and generalizable surrogate model. Markov chain Monte Carlo (MCMC) is a ubiquitously used Bayesian method to estimate these hyperparameters. At GEs Global Research Center, a customized industry-strength Bayesian hybrid modeling framework utilizing the GP, called GEBHM, has been employed and validated over many years. GEBHM is very effective on problems of small and medium size, typically less than 1000 training points. However, the GP does not scale well in time with a growing dataset and problem dimensionality which can be a major impediment in such problems. For some challenging industry applications, the predictive capability of the GP is required but each second during the training of the GP costs thousands of dollars. In this work, we apply a scalable MCMC-based methodology enabling the modeling of large-scale industry problems. Towards this, we extend and implement in GEBHM an Adaptive Sequential Monte Carlo (ASMC) methodology for training the GP. This implementation saves computational time (especially for large-scale problems) while not sacrificing predictability over the current MCMC implementation. We demonstrate the effectiveness and accuracy of GEBHM with ASMC on four mathematical problems and on two challenging industry applications of varying complexity.

Download Full-text

Scalable Fully Bayesian Gaussian Process Modeling and Calibration with Adaptive Sequential Monte Carlo for Industrial Applications

Journal of Mechanical Design ◽

10.1115/1.4050246 ◽

2021 ◽

pp. 1-11

Author(s):

Piyush Pandita ◽

Panagiotis Tsilifis ◽

Sayan Ghosh ◽

Liping Wang

Keyword(s):

Monte Carlo ◽

Big Data ◽

Gaussian Process ◽

Industry 4.0 ◽

Large Scale ◽

Sequential Monte Carlo ◽

Industrial Applications ◽

Computational Time ◽

Mcmc Sampling ◽

Gp Model

Abstract Gaussian Process (GP) regression or kriging has been extensively applied in the engineering literature for the purposes of building a cheap-to-evaluate surrogate, within the contexts of multi-fidelity modeling, model calibration and design optimization. With the ongoing automation of manufacturing and industrial practices as a part of Industry 4.0, there has been greater need for advancing GP regression techniques to handle challenges such as high input dimensionality, data paucity or big data problems, these consist primarily of proposing efficient design of experiments, optimal data acquisition strategies, and other mathematical tricks. In this work, our attention is focused on the challenges of efficiently training a GP model, which, to the authors opinion, has attracted very little attention and is to-date, poorly addressed. The performance of widely used training approaches such as maximum likelihood estimation and Markov Chain Monte Carlo (MCMC) sampling can deteriorate significantly in high dimensional and big data problems and can lead to cost deficient implementations of critical importance to many industrial applications. Here, we compare an Adaptive Sequential Monte Carlo (ASMC) sampling algorithm to classic MCMC sampling strategies and we demonstrate the effectiveness of our implementation on several mathematical problems and challenging industry applications of varying complexity. The computational time savings of our ASMC approach manifest in large-scale problems helping us to push the boundary of GP regression applicability and scalability in various domain of Industry 4.0, including but not limited to design automation, design engineering, predictive maintenance, and supply chain manufacturing.

Download Full-text

Marginalizing Gaussian process hyperparameters using sequential Monte Carlo

2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP) ◽

10.1109/camsap.2015.7383840 ◽

2015 ◽

Cited By ~ 3

Author(s):

Andreas Svensson ◽

Johan Dahlin ◽

Thomas B. Schon

Keyword(s):

Monte Carlo ◽

Gaussian Process ◽

Sequential Monte Carlo

Download Full-text

Sequential Monte Carlo Methods to Train Neural Network Models

Neural Computation ◽

10.1162/089976600300015664 ◽

2000 ◽

Vol 12 (4) ◽

pp. 955-993 ◽

Cited By ~ 115

Author(s):

J. F. G. de Freitas ◽

M. Niranjan ◽

A. H. Gee ◽

A. Doucet

Keyword(s):

Monte Carlo ◽

Sequential Monte Carlo ◽

Probability Distributions ◽

Network Models ◽

Computational Time ◽

Optimization Strategy ◽

Neural Network Models ◽

Monte Carlo Techniques ◽

Monte Carlo Algorithms ◽

Importance Resampling

We discuss a novel strategy for training neural networks using sequential Monte Carlo algorithms and propose a new hybrid gradient descent/sampling importance resampling algorithm (HySIR). In terms of computational time and accuracy, the hybrid SIR is a clear improvement over conventional sequential Monte Carlo techniques. The new algorithm may be viewed as a global optimization strategy that allows us to learn the probability distributions of the network weights and outputs in a sequential framework. It is well suited to applications involving on-line, nonlinear, and nongaussian signal processing. We show how the new algorithm outperforms extended Kalman filter training on several problems. In particular, we address the problem of pricing option contracts, traded in financial markets. In this context, we are able to estimate the one-step-ahead probability density functions of the options prices.

Download Full-text

On-demand data assimilation of large-scale spatial temporal systems using sequential Monte Carlo methods

Simulation Modelling Practice and Theory ◽

10.1016/j.simpat.2018.03.007 ◽

2018 ◽

Vol 85 ◽

pp. 1-14 ◽

Cited By ~ 2

Author(s):

Feng Gu

Keyword(s):

Monte Carlo ◽

Data Assimilation ◽

Monte Carlo Methods ◽

Large Scale ◽

Sequential Monte Carlo ◽

Sequential Monte Carlo Methods ◽

On Demand

Download Full-text

Peak-shaving Requirement Analysis Method Based on Sequential Monte-Carlo Simulation for Large Scale Wind Power Integrated System

2018 China International Conference on Electricity Distribution (CICED) ◽

10.1109/ciced.2018.8592216 ◽

2018 ◽

Author(s):

Junling Wu ◽

Xiaohui Qin ◽

Hailei He ◽

Yantao Zhang

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Wind Power ◽

Large Scale ◽

Sequential Monte Carlo ◽

Integrated System ◽

Requirement Analysis ◽

Analysis Method ◽

Peak Shaving

Download Full-text

State estimation in large-scale open channel networks using sequential Monte Carlo methods: Optimal sampling importance resampling and implicit particle filters

Water Resources Research ◽

10.1029/2011wr011608 ◽

2013 ◽

Vol 49 (6) ◽

pp. 3194-3214 ◽

Cited By ~ 1

Author(s):

Mohammad Rafiee ◽

Axel Barrau ◽

Alexandre M. Bayen

Keyword(s):

Monte Carlo ◽

State Estimation ◽

Large Scale ◽

Sequential Monte Carlo ◽

Open Channel ◽

Optimal Sampling ◽

Sequential Monte Carlo Methods ◽

Channel Networks ◽

Importance Resampling ◽

Open Channel Networks

Download Full-text

Fast Analysis of Structural Reliability Using Gaussian Process Classification Based Dynamic Response Surface Method

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.501-504.1067 ◽

2014 ◽

Vol 501-504 ◽

pp. 1067-1070

Author(s):

Li Feng Peng ◽

Guo Shao Su ◽

Wei Zhao

Keyword(s):

Monte Carlo ◽

Dynamic Response ◽

Gaussian Process ◽

Response Surface ◽

Large Scale ◽

Structural Reliability ◽

Fem Analysis ◽

Fast Analysis ◽

Highly Nonlinear ◽

Process Classification

The performance function of large-scale complicated engineering structure is always highly nonlinear and implicit, and its reliability needs to be evaluated through a time-consuming Finite Element method (FEM). A new method, Gaussian process classification (GPC) dynamic response surface based on Monte Carlo Simulation (MCS) was proposed. Small training samples were created using FEM and Markov chain. Then, the most probable point (MPP) is predicted quickly using MCS without any extra FEM analysis. Furthermore, an iterative algorithm is presented to reduce the errors of GPC by using information of MPP to improve the reconstructing precision constantly. Then, Monte Carlo method combined with GPC surface is applied to get the probability of failure. Several examples results demonstrate the efficiency and robustness of the proposed method, compared with the results of common reliability methods.

Download Full-text

Sequential Monte Carlo methods in Bayesian joint models for longitudinal and time-to-event data

Statistical Modelling ◽

10.1177/1471082x20916088 ◽

2020 ◽

pp. 1471082X2091608 ◽

Cited By ~ 1

Author(s):

Danilo Alvares ◽

Carmen Armero ◽

Anabel Forte ◽

Nicolas Chopin

Keyword(s):

Monte Carlo ◽

Sequential Monte Carlo ◽

Computational Time ◽

Event Data ◽

Time To Event ◽

Joint Models ◽

Time To Event Data ◽

Important Challenge ◽

Real Scenario

The statistical analysis of the information generated by medical follow-up is a very important challenge in the field of personalized medicine. As the evolutionary course of a patient's disease progresses, his/her medical follow-up generates more and more information that should be processed immediately in order to review and update his/her prognosis and treatment. Hence, we focus on this update process through sequential inference methods for joint models of longitudinal and time-to-event data from a Bayesian perspective. More specifically, we propose the use of sequential Monte Carlo (SMC) methods for static parameter joint models with the intention of reducing computational time in each update of the full Bayesian inferential process. Our proposal is very general and can be easily applied to most popular joint models approaches. We illustrate the use of the presented sequential methodology in a joint model with competing risk events for a real scenario involving patients on mechanical ventilation in intensive care units (ICUs).

Download Full-text

A New Parameterization Method for Large-Scale Reservoir History

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.733.156 ◽

2015 ◽

Vol 733 ◽

pp. 156-160

Author(s):

Xia Yan ◽

Jun Li ◽

Hui Zhao

Keyword(s):

Monte Carlo ◽

Objective Function ◽

History Matching ◽

Large Scale ◽

Monte Carlo Algorithm ◽

Reservoir Properties ◽

Matching Problems ◽

Parameterization Method ◽

Gradient Information ◽

Unconditional Model

A novel and simple parameterization method using an ensemble of unconditional model realizations is applied to decrease the dimension of the misfit objective function in large-scale history matching problems. The major advantage of this parameterization method is that the singular value decomposition (SVD) calculation is completely avoided, which saves time and cost for huge matrix decomposition and the eigenvectors computations in parameterization process. After objective function transforms from a higher dimension to a lower dimension by parameterization, a Monte Carlo approach is introduced to evaluate the gradient information in the lower domain. Unlike the adjoint-gradient algorithms, the gradient in our method is estimated by Monte Carlo stochastic method, which can be easily coupled with different numerical simulator and avoid complicated adjoint code. When the estimated gradient information is obtained, any gradient-based algorithm can be implemented for optimizing the objective function. The Monte Carlo algorithm combined with the parameterization method is applied to Brugge reservoir field. The result shows that our present method gives a good estimation of reservoir properties and decreases the geological uncertainty without SVD but with a lower final objective function value, which provides a more efficient and useful way for history matching in large scale field.

Download Full-text

Anytime Monte Carlo

Data-Centric Engineering ◽

10.1017/dce.2021.6 ◽

2021 ◽

Vol 2 ◽

Author(s):

Lawrence M. Murray ◽

Sumeetpal S. Singh ◽

Anthony Lee

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Real Time ◽

Graphics Processing Units ◽

Large Scale ◽

Sequential Monte Carlo ◽

Time Budget ◽

Mcmc Algorithm ◽

Length Bias ◽

Markov Jump

Abstract Monte Carlo algorithms simulates some prescribed number of samples, taking some random real time to complete the computations necessary. This work considers the converse: to impose a real-time budget on the computation, which results in the number of samples simulated being random. To complicate matters, the real time taken for each simulation may depend on the sample produced, so that the samples themselves are not independent of their number, and a length bias with respect to compute time is apparent. This is especially problematic when a Markov chain Monte Carlo (MCMC) algorithm is used and the final state of the Markov chain—rather than an average over all states—is required, which is the case in parallel tempering implementations of MCMC. The length bias does not diminish with the compute budget in this case. It also occurs in sequential Monte Carlo (SMC) algorithms, which is the focus of this paper. We propose an anytime framework to address the concern, using a continuous-time Markov jump process to study the progress of the computation in real time. We first show that for any MCMC algorithm, the length bias of the final state’s distribution due to the imposed real-time computing budget can be eliminated by using a multiple chain construction. The utility of this construction is then demonstrated on a large-scale SMC $ {}^2 $ implementation, using four billion particles distributed across a cluster of 128 graphics processing units on the Amazon EC2 service. The anytime framework imposes a real-time budget on the MCMC move steps within the SMC $ {}^2 $ algorithm, ensuring that all processors are simultaneously ready for the resampling step, demonstrably reducing idleness to due waiting times and providing substantial control over the total compute budget.

Download Full-text