Benchmarking segmentation results using a Markov model and a Bayes information criterion

Author(s):  
Fionn D. Murtagh ◽  
Xiaoyu Qiao ◽  
Danny Crookes ◽  
Paul Walsh ◽  
P. A. M. Basheer ◽  
...  
Risks ◽  
2020 ◽  
Vol 9 (1) ◽  
pp. 5
Author(s):  
Karim Barigou ◽  
Stéphane Loisel ◽  
Yahia Salhi

Predicting the evolution of mortality rates plays a central role for life insurance and pension funds. Standard single population models typically suffer from two major drawbacks: on the one hand, they use a large number of parameters compared to the sample size and, on the other hand, model choice is still often based on in-sample criterion, such as the Bayes information criterion (BIC), and therefore not on the ability to predict. In this paper, we develop a model based on a decomposition of the mortality surface into a polynomial basis. Then, we show how regularization techniques and cross-validation can be used to obtain a parsimonious and coherent predictive model for mortality forecasting. We analyze how COVID-19-type effects can affect predictions in our approach and in the classical one. In particular, death rates forecasts tend to be more robust compared to models with a cohort effect, and the regularized model outperforms the so-called P-spline model in terms of prediction and stability.


Author(s):  
C. Olivier ◽  
T. Paquet ◽  
M. Avila ◽  
Y. Lecourtier

The aim of this study is to show that the optimal order of Markov Model of cursive words can be rigorously stated in order to fit the structural properties of the observed data using Akaike information criterion. The method has been tested on French Postal check amounts up to order 4. An original structural representation of cursive words based on graphemes is used. The conditional probability to have a word model given an observed sequence of graphemes is computed independently of the length of the sequence. The recognition results obtained confirm the optimal order found using Akaike criterion.


2017 ◽  
Author(s):  
Lekha Patel ◽  
Nils Gustafsson ◽  
Yu Lin ◽  
Raimund Ober ◽  
Ricardo Henriques ◽  
...  

AbstractFluorescing molecules (fluorophores) that stochastically switch between photon-emitting and dark states underpin some of the most celebrated advancements in super-resolution microscopy. While this stochastic behavior has been heavily exploited, full characterization of the underlying models can potentially drive forward further imaging methodologies. Under the assumption that fluorophores move between fluorescing and dark states as continuous time Markov processes, the goal is to use a sequence of images to select a model and estimate the transition rates. We use a hidden Markov model to relate the observed discrete time signal to the hidden continuous time process. With imaging involving several repeat exposures of the fluorophore, we show the observed signal depends on both the current and past states of the hidden process, producing emission probabilities that depend on the transition rate parameters to be estimated. To tackle this unusual coupling of the transition and emission probabilities, we conceive transmission (transition-emission) matrices that capture all dependencies of the model. We provide a scheme of computing these matrices and adapt the forward-backward algorithm to compute a likelihood which is readily optimized to provide rate estimates. When confronted with several model proposals, combining this procedure with the Bayesian Information Criterion provides accurate model selection.


2020 ◽  
Vol 39 (1) ◽  
Author(s):  
Farshid Mehrdoust

This paper presents bid and ask formulas for cap and floor contracts prices byusing Wang transform under a Liouville fractional Vasicek (LfVasicek) interest rate model. To do this, the parameters of the model are calibrated by using the Newton-Raphson (NR) method. Then the standard and Liouville fractional versions of the Vasicek model are compared by the Bayes information criterion (BIC). Finally, we obtain the bid-ask boundaries for interest rate amount and cap and foor prices.


2020 ◽  
Vol 1 (2) ◽  
pp. 65
Author(s):  
Vieri Koerniawan ◽  
Nurtiti Sunusi ◽  
Raupong Raupong

The Poisson hidden Markov model is a model that consists of two parts. The first part is the cause of events that are hidden or cannot be observed directly and form a Markov chain, while the second part is the process of observation or observable parts that depend on the cause of the event and following the Poisson distribution. The Poisson hidden Markov model parameters are estimated using the Maximum Likelihood Estimator (MLE). But it is difficult to find analytical solutions from the ln-likelihood function. Therefore, the Expectation Maximization (EM) algorithm is used to obtain its numerical solutions which are then applied to life insurance data. The best model is obtained with 2 states or m = 2 based on the smallest Bayesian Information Criterion (BIC) value of 338,778 and the average predicted number of claims arrivals is 0.385 per day.


2020 ◽  
Author(s):  
Sarah Kristine Nørgaard ◽  
Kristoffer Linder-Steinlein ◽  
Anders Ulrik Eliasen ◽  
Jakob Stokholm ◽  
Bo L. Chawez ◽  
...  

Integration of unstructured and very diverse data is often required for a deeper understanding of complex biological systems. In order to uncover communalities between heterogeneous data, the data is often harmonized by constructing a kernel and numerical integration is performed. In this study we propose a method for data integration in the framework of an undirected graphical model, where the nodes represent individual data sources of varying nature in terms of complexity and underlying distribution, and where the edges represent the partial correlation between two blocks of data. We propose a modified GLASSO for estimation of the graph, with a combination of cross-validation and extended Bayes Information Criterion for sparsity tuning. Furthermore, hierarchical clustering on the weighted consensus kernels from a fixed network is used to partitioning the samples into different classes. Simulations show increasing ability to uncover true edges with increasing sample size and signal to noise. Likewise, identification of non existing edges towards disconnected nodes is feasible. The framework is demonstrated for integration of longitudinal symptom burden data from the 2nd and 3rd year of life with 21 diseases precursors as well as the development of asthma and eczema at the age of 6 years from 403 children from the COPSAC2010 mother-child cohort, suggesting that maternal predisposition as well as being born preterm indirectly lead to higher risk of asthma via increased respiratory symptom burden.


2015 ◽  
Vol 47 (2) ◽  
pp. 521-531 ◽  
Author(s):  
Javier Almorox ◽  
Jürgen Grieser

The Penman–Monteith equation (FAO-56) is accepted as the standard model for estimating reference evapotranspiration (ETo). However, the major obstacle to using FAO-56 widely is that it requires numerous climatic data. The Hargreaves–Samani (HS) method is frequently used for the calculation of ETo since it is based on measurements of daily minimum and maximum air temperature alone. Those are commonly recorded at many meteorological stations throughout the world. It is the objective of this paper to evaluate the quality of HS and calibrate the coefficients of this method for different climates as represented by the Köppen classification. Estimated values are compared with Penman–Monteith ETo values in terms of the coefficient of efficiency Ceff as well as the root mean square error, the mean absolute error and the Bayes information criterion. The Penman–Monteith equation for ETo (FAO-56) is based on physics and known to provide best estimates of ETo. The results of our work show that the correlation between long-term monthly means of HS and FAO-56 can be improved significantly by introducing climate-class specific coefficients.


Sign in / Sign up

Export Citation Format

Share Document