Concurrent Surrogate Model Selection (COSMOS) Based on Predictive Estimation of Model Fidelity

Author(s):  
Souma Chowdhury ◽  
Ali Mehmani ◽  
Achille Messac

One of the primary drawbacks plaguing wider acceptance of surrogate models is their low fidelity in general. This issue can be in a large part attributed to the lack of automated model selection techniques, particularly ones that do not make limiting assumptions regarding the choice of model types and kernel types. A novel model selection technique was recently developed to perform optimal model search concurrently at three levels: (i) optimal model type (e.g., RBF), (ii) optimal kernel type (e.g., multiquadric), and (iii) optimal values of hyper-parameters (e.g., shape parameter) that are conventionally kept constant. The error measures to be minimized in this optimal model selection process are determined by the Predictive Estimation of Model Fidelity (PEMF) method, which has been shown to be significantly more accurate than typical cross-validation-based error metrics. In this paper, we make the following important advancements to the PEMF-based model selection framework, now called the Concurrent Surrogate Model Selection or COSMOS framework: (i) The optimization formulation is modified through binary coding to allow surrogates with differing numbers of candidate kernels and kernels with differing numbers of hyper-parameters (which was previously not allowed). (ii) A robustness criterion, based on the variance of errors, is added to the existing criteria for model selection. (iii) A larger candidate pool of 16 surrogate-kernel combinations is considered for selection — possibly making COSMOS one of the most comprehensive surrogate model selection framework (in theory and implementation) currently available. The effectiveness of the COSMOS framework is demonstrated by successfully applying it to four benchmark problems (with 2–30 variables) and an airfoil design problem. The optimal model selection results illustrate how diverse models provide important tradeoffs for different problems.

2010 ◽  
Vol 19 (17) ◽  
pp. 3603-3619 ◽  
Author(s):  
A. J. SHIRK ◽  
D. O. WALLIN ◽  
S. A. CUSHMAN ◽  
C. G. RICE ◽  
K. I. WARHEIT

AIChE Journal ◽  
2017 ◽  
Vol 64 (3) ◽  
pp. 822-834 ◽  
Author(s):  
Hong Zhao ◽  
Chunhui Zhao ◽  
Chengxia Yu ◽  
Eyal Dassau

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Mona Fuhrländer ◽  
Sebastian Schöps

Abstract In this paper an efficient and reliable method for stochastic yield estimation is presented. Since one main challenge of uncertainty quantification is the computational feasibility, we propose a hybrid approach where most of the Monte Carlo sample points are evaluated with a surrogate model, and only a few sample points are reevaluated with the original high fidelity model. Gaussian process regression is a non-intrusive method which is used to build the surrogate model. Without many prerequisites, this gives us not only an approximation of the function value, but also an error indicator that we can use to decide whether a sample point should be reevaluated or not. For two benchmark problems, a dielectrical waveguide and a lowpass filter, the proposed methods outperform classic approaches.


2021 ◽  
Author(s):  
Yuanzhi Liu ◽  
Jie Zhang

Abstract Vehicle velocity forecasting plays a critical role in scheduling the operations of varying systems and devices in a passenger vehicle. This paper first generates a repeated urban driving cycle dataset at a fixed route in the Dallas area, aiming to contribute to the improvement of vehicle energy efficiency for commuting routes. The generated driving cycles are divided into cycle segments based on intersection/stop identification, deceleration and reacceleration identification, and waiting time estimation, which could be used for better evaluating the effectiveness of model localization. Then, a segment-based vehicle velocity forecasting model is developed, where a machine learning model is trained/developed at each segment, using the hidden Markov chain (HMM) model and long short-term memory network (LSTM). To further improve the forecasting accuracy, a localized model selection framework is developed, which can dynamically choose a forecasting model (i.e., HMM or LSTM) for each segment. Results show that (i) the segment-based forecast could improve the forecasting accuracy by up to 24%, compared the whole cycle-based forecast; and (ii) the localized model selection framework could further improve the forecasting accuracy by 6.8%, compared to the segment-based LSTM model. Moreover, the potential of leveraging the stopping location at an intersection to estimate the waiting time is also evaluated in this study.


Economies ◽  
2020 ◽  
Vol 8 (2) ◽  
pp. 49 ◽  
Author(s):  
Waqar Badshah ◽  
Mehmet Bulut

Only unstructured single-path model selection techniques, i.e., Information Criteria, are used by Bounds test of cointegration for model selection. The aim of this paper was twofold; one was to evaluate the performance of these five routinely used information criteria {Akaike Information Criterion (AIC), Akaike Information Criterion Corrected (AICC), Schwarz/Bayesian Information Criterion (SIC/BIC), Schwarz/Bayesian Information Criterion Corrected (SICC/BICC), and Hannan and Quinn Information Criterion (HQC)} and three structured approaches (Forward Selection, Backward Elimination, and Stepwise) by assessing their size and power properties at different sample sizes based on Monte Carlo simulations, and second was the assessment of the same based on real economic data. The second aim was achieved by the evaluation of the long-run relationship between three pairs of macroeconomic variables, i.e., Energy Consumption and GDP, Oil Price and GDP, and Broad Money and GDP for BRICS (Brazil, Russia, India, China and South Africa) countries using Bounds cointegration test. It was found that information criteria and structured procedures have the same powers for a sample size of 50 or greater. However, BICC and Stepwise are better at small sample sizes. In the light of simulation and real data results, a modified Bounds test with Stepwise model selection procedure may be used as it is strongly theoretically supported and avoids noise in the model selection process.


2015 ◽  
Vol 80 (2) ◽  
pp. 397-407 ◽  
Author(s):  
Adrian V. Bell ◽  
Thomas E. Currie ◽  
Geoffrey Irwin ◽  
Christopher Bradbury

Migration is a key driver of human cultural and genetic evolution, with recent theoretical advances calling for work to accurately identify factors behind early colonization patterns. However, inferring prehistoric migration strategies is a controversial field of inquiry that largely relies on interpreting settlement chronologies and constructing plausible narratives around environmental factors. Model selection approaches, along with new statistical models that match the dynamic nature of colonization, offers a more rigorous framework to test competing theories. We demonstrate the utility of this approach by developing an Island-Level Model of Colonization adapted from epidemiology in a Bayesian model-selection framework. Using model selection techniques, we assess competing colonization theories of Near and Remote Oceania, showing that models of exploration angles and risk performed considerably better than models using inter-island distance, suggesting early seafarers were already adept at long-distance travel. These results are robust after artificially increasing the uncertainty around settlement times. We show how decades of thinking on colonization strategies can be brought together and assessed in one statistical framework, providing us with greater interpretive power to understand a fundamental feature of our past.


2021 ◽  
Author(s):  
Delnavaz Mobedpour

With the proliferation of web services, the selection process, especially the one based on the non-functional properties (e.g. Quality of Service – QoS attributes) has become a more and more important step to help requestors locate a desired service. There have been many research works proposing various QoS description languages and selection models. However, the end user is not generally the focal point of their designs and the user support is either missing or lacking in these systems. The QoS language sometimes is not flexible enough to accommodate users’ various requirements and is too complex so that it puts extra burden on users. In order to solve this problem, in this thesis we design a more expressive and flexible QoS query language (QQL) targeted for non-expert users, together with the user support on formulating queries and understanding services in the registry. An enhanced selection model based on Mixed Integer Programming (MIP) is also proposed to handle the QQL queries. We performed experiments with a real QoS dataset to show the effectiveness of our framework.


Author(s):  
S. O. Ongbali ◽  
A. C. Igboanugo ◽  
S. A. Afolalu ◽  
M. O. Udo ◽  
I. P. Okokpujie

Sign in / Sign up

Export Citation Format

Share Document