scholarly journals A general goodness-of-fit test for survival analysis

2017 ◽  
Author(s):  
Michael Holton Price ◽  
James Holland Jones

AbstractExisting goodness-of-fit tests for survival data are either exclusively graphical in nature or only test specific model assumptions, such as the proportional hazards assumption. We describe a flexible, parameter-free goodness-of-fit test that provides a simple numerical assessment of a model’s suitability regardless of the structure of the underlying model. Intuitively, the goodness-of-fit test utilizes the fact that for a good model early event occurrence is predicted to be just as likely as late event occurrence, whereas a bad model has a bias towards early or late events. Formally, the goodness-of-fit test is based on a novel generalized Martingale residual which we call the martingale survival residual. The martingale survival residual has a uniform probability density function defined on the interval −0.5 to +0.5 if censoring is either absent or accounted for as one outcome in a competing hazards framework. For a good model, the set of calculated residuals is statistically indistinguishable from the uniform distribution, which is tested using the Kolmogorov-Smirnov statistic.

1985 ◽  
Vol 13 (2) ◽  
pp. 552-563 ◽  
Author(s):  
Robert J. Gray ◽  
Donald A. Pierce

2020 ◽  
Vol 3 (1) ◽  
pp. 189-207
Author(s):  
Sandeep Samantaray ◽  
Abinash Sahoo

Abstract Estimating stream flow has a substantial financial influence, because this can be of assistance in water resources management and provides safety from scarcity of water and conceivable flood destruction. Four common statistical methods, namely, Normal, Gumbel max, Log-Pearson III (LP III), and Gen. extreme value method are employed for 10, 20, 30, 35, 40, 50, 60, 70, 75, 100, 150 years to forecast stream flow. Monthly flow data from four stations on Mahanadi River, in Eastern Central India, namely, Rampur, Sundargarh, Jondhra, and Basantpur, are used in the study. Results show that Gumbel max gives better flow discharge value than the Normal, LP III, and Gen. extreme value methods for all four gauge stations. Estimated flood values for Rampur, Sundargarh, Jondhra, and Basantpur stations are 372.361 m3/sec, 530.415 m3/sec, 2,133.888 m3/sec, and 3,836.22 m3/sec, respectively, considering Gumbel max. Goodness-of-fit tests for four statistical distribution techniques applied in the present study are also evaluated using Kolmogorov–Smirov, Anderson–Darling, Chi-squared tests at critical value 0.05 for the four proposed gauge stations. Goodness-of-fit test results show that Gen. extreme value gives best results at Rampur, Sundergarh, and Jondhra gauge stations followed by LP III, whereas LP III is the best fit for Basantpur, followed by Gen. extreme value.


Author(s):  
Naz Saud ◽  
Sohail Chand

A class of goodness of fit tests for Marshal-Olkin Extended Rayleigh distribution with estimated parameters is proposed. The tests are based on the empirical distribution function. For determination of asymptotic percentage points, Kolomogorov-Sminrov, Cramer-von-Mises, Anderson-Darling,Watson, and Liao-Shimokawa test statistic are used. This article uses Monte Carlo simulations to obtain asymptotic percentage points for Marshal-Olkin extended Rayleigh distribution. Moreover, power of the goodness of fit test statistics is investigated for this lifetime model against several alternatives.


2019 ◽  
Vol 22 (3) ◽  
pp. 207-222
Author(s):  
Kuangyu Wen ◽  
Ximing Wu

Summary We have developed a customizable goodness-of-fit test of a parametric density based on its distance to a consistently estimated density. This consistent estimate is obtained via a nonparametric density estimator with a parametric start, wherein the start is set to be the hypothesized parametric density. To cope with the influence of nonparametric estimation bias, nonparametric goodness-of-fit tests have resorted to remedies such as undersmoothing or convolution of the hypothesized density. Our test requires no such devices and possesses enhanced powers against alternative densities because the guided density estimator is free of the typical nonparametric bias under the null hypothesis and attains bias reduction when the underlying density is in a broad nonparametric neighborhood of the hypothesized density. Here, we establish the statistical properties of our test and use Monte Carlo simulations to demonstrate its finite sample performance. We use this test to examine the goodness-of-fit of normal mixtures to the distributions of log income of U.S. states. Although normality is rejected decisively, our results suggest that normal mixtures with two or three components suffice for all but one state.


2011 ◽  
Vol 150 (3) ◽  
pp. 319-334 ◽  
Author(s):  
L. D. B. SURIYAGODA ◽  
M. H. RYAN ◽  
H. LAMBERS ◽  
M. RENTON

SUMMARYThe present paper compares standard and novel methods for analysing aggregated patterns of plant death in designed field experiments; these methods include binomial (BN), beta-binomial (BBN), logistic-normal-binomial (LNB), BN models with random blocks, BN models with smooth-scale spatial components and principal coordinates of neighbour matrices (PCNM). PCNM is a relatively new technique used in ecology to determine how much observed variability can be explained by spatial and environmental variables, and has not yet been applied to agricultural studies. The survival data of two pasture species, collected from a designed field experiment that was replicated at multiple locations, were used. First, the occurrence of overdispersion was tested using the BN and BBN distributions. Goodness-of-fit tests proved that the BBN model provided a better description (better fit) of the observed data in some cases than did the BN distribution, indicating overdispersion was present. When overdispersion was not present, the BN distribution was adequate to describe the data, and the use of the BBN distribution was superfluous. It is then shown that the PCNM approach, the BN model with smooth-scale spatial components and the LNB model were able to account for some of the variation as spatial variability, thus reducing the species effect compared with that explained under the standard BN model. The amount of variation among species according to the BN model and the BN model with random blocks was similar. Therefore, it is argued that the novel PCNM approach warrants further testing when exploring the spatial variability in designed experiments in agriculture and using LNB, PCNM and BN with smooth-scale spatial components may provide better predictions of species effects than do other, more conventional, approaches.


Sign in / Sign up

Export Citation Format

Share Document