The performance of multiple imputation for missing covariate data within the context of regression relative survival analysis

Roch Giorgi; Aurélien Belot; Jean Gaudart; Guy Launoy

doi:10.1002/sim.3476

The performance of multiple imputation for missing covariate data within the context of regression relative survival analysis

Statistics in Medicine ◽

10.1002/sim.3476 ◽

2008 ◽

Vol 27 (30) ◽

pp. 6310-6331 ◽

Cited By ~ 21

Author(s):

Roch Giorgi ◽

Aurélien Belot ◽

Jean Gaudart ◽

Guy Launoy

Keyword(s):

Survival Analysis ◽

Multiple Imputation ◽

Relative Survival ◽

Covariate Data ◽

Missing Covariate Data

Download Full-text

A stochastic multiple imputation algorithm for missing covariate data in tree-structured survival analysis

Statistics in Medicine ◽

10.1002/sim.4079 ◽

2010 ◽

Vol 29 (29) ◽

pp. 3004-3016 ◽

Cited By ~ 4

Author(s):

Meredith L. Wallace ◽

Stewart J. Anderson ◽

Sati Mazumdar

Keyword(s):

Survival Analysis ◽

Multiple Imputation ◽

Covariate Data ◽

Missing Covariate Data

Download Full-text

Analysing Mark–Recapture–Recovery Data in the Presence of Missing Covariate Data Via Multiple Imputation

Journal of Agricultural Biological and Environmental Statistics ◽

10.1007/s13253-014-0184-z ◽

2014 ◽

Vol 20 (1) ◽

pp. 28-46 ◽

Cited By ~ 7

Author(s):

Hannah Worthington ◽

Ruth King ◽

Stephen T. Buckland

Keyword(s):

Multiple Imputation ◽

Mark Recapture ◽

Covariate Data ◽

Missing Covariate Data

Download Full-text

First Use of Multiple Imputation with the National Tuberculosis Surveillance System

Epidemiology Research International ◽

10.1155/2013/875234 ◽

2013 ◽

Vol 2013 ◽

pp. 1-6 ◽

Cited By ~ 4

Author(s):

Christopher Vinnard ◽

E. Paul Wileyto ◽

Gregory P. Bisson ◽

Carla A. Winston

Keyword(s):

Multiple Imputation ◽

Odds Ratio ◽

Surveillance System ◽

High Rate ◽

Isoniazid Resistance ◽

Imputation Methods ◽

Covariate Data ◽

Missing Covariate Data ◽

National Tuberculosis ◽

Control And Prevention

Aims. The purpose of this study was to compare methods for handling missing data in analysis of the National Tuberculosis Surveillance System of the Centers for Disease Control and Prevention. Because of the high rate of missing human immunodeficiency virus (HIV) infection status in this dataset, we used multiple imputation methods to minimize the bias that may result from less sophisticated methods. Methods. We compared analysis based on multiple imputation methods with analysis based on deleting subjects with missing covariate data from regression analysis (case exclusion), and determined whether the use of increasing numbers of imputed datasets would lead to changes in the estimated association between isoniazid resistance and death. Results. Following multiple imputation, the odds ratio for initial isoniazid resistance and death was 2.07 (95% CI 1.30, 3.29); with case exclusion, this odds ratio decreased to 1.53 (95% CI 0.83, 2.83). The use of more than 5 imputed datasets did not substantively change the results. Conclusions. Our experience with the National Tuberculosis Surveillance System dataset supports the use of multiple imputation methods in epidemiologic analysis, but also demonstrates that close attention should be paid to the potential impact of missing covariates at each step of the analysis.

Download Full-text

Propensity Score Estimation Using Classification and Regression Trees in the Presence of Missing Covariate Data

Epidemiologic Methods ◽

10.1515/em-2017-0020 ◽

2018 ◽

Vol 7 (1) ◽

Cited By ~ 2

Author(s):

Bas B.L. Penning de Vries ◽

Maarten van Smeden ◽

Rolf H.H. Groenwold

Keyword(s):

Logistic Regression ◽

Missing Data ◽

Propensity Score ◽

Multiple Imputation ◽

Incomplete Data ◽

Regression Trees ◽

Covariate Data ◽

Missing Covariate Data ◽

Classification And Regression ◽

Cart Algorithm

AbstractData mining and machine learning techniques such as classification and regression trees (CART) represent a promising alternative to conventional logistic regression for propensity score estimation. Whereas incomplete data preclude the fitting of a logistic regression on all subjects, CART is appealing in part because some implementations allow for incomplete records to be incorporated in the tree fitting and provide propensity score estimates for all subjects. Based on theoretical considerations, we argue that the automatic handling of missing data by CART may however not be appropriate. Using a series of simulation experiments, we examined the performance of different approaches to handling missing covariate data; (i) applying the CART algorithm directly to the (partially) incomplete data, (ii) complete case analysis, and (iii) multiple imputation. Performance was assessed in terms of bias in estimating exposure-outcome effects among the exposed, standard error, mean squared error and coverage. Applying the CART algorithm directly to incomplete data resulted in bias, even in scenarios where data were missing completely at random. Overall, multiple imputation followed by CART resulted in the best performance. Our study showed that automatic handling of missing data in CART can cause serious bias and does not outperform multiple imputation as a means to account for missing data.

Download Full-text

Large sample results for frequentist multiple imputation for Cox regression with missing covariate data

Annals of the Institute of Statistical Mathematics ◽

10.1007/s10463-019-00716-4 ◽

2019 ◽

Vol 72 (4) ◽

pp. 969-996

Author(s):

Frank Eriksson ◽

Torben Martinussen ◽

Søren Feodor Nielsen

Keyword(s):

Multiple Imputation ◽

Cox Regression ◽

Large Sample ◽

Covariate Data ◽

Missing Covariate Data

Download Full-text

Non-ignorable missing covariate data in survival analysis: a case-study of an International Breast Cancer Study Group trial

Journal of the Royal Statistical Society Series C (Applied Statistics) ◽

10.1046/j.1467-9876.2003.05168.x ◽

2004 ◽

Vol 53 (2) ◽

pp. 293-310 ◽

Cited By ~ 13

Author(s):

Amy H. Herring ◽

Joseph G. Ibrahim ◽

Stuart R. Lipsitz

Keyword(s):

Breast Cancer ◽

Survival Analysis ◽

Study Group ◽

Covariate Data ◽

Cancer Study ◽

Missing Covariate Data ◽

Cancer Study Group ◽

Breast Cancer Study ◽

Group Trial

Download Full-text

Simultaneous confidence bands for nonparametric regression with missing covariate data

Annals of the Institute of Statistical Mathematics ◽

10.1007/s10463-021-00784-5 ◽

2021 ◽

Author(s):

Li Cai ◽

Lijie Gu ◽

Qihua Wang ◽

Suojin Wang

Keyword(s):

Nonparametric Regression ◽

Confidence Bands ◽

Covariate Data ◽

Missing Covariate Data ◽

Simultaneous Confidence Bands

Download Full-text

Survival Analysis vs Longitudinal Modeling With Multiple Imputation—a False Dichotomy—Reply

JAMA Ophthalmology ◽

10.1001/jamaophthalmol.2021.0518 ◽

2021 ◽

Author(s):

Catey Bunce ◽

Dun Jack Fu ◽

Irene Stratton

Keyword(s):

Survival Analysis ◽

Multiple Imputation ◽

Longitudinal Modeling ◽

False Dichotomy

Download Full-text

SP1-75 Multiple imputation and survival analysis: an example using cancer registry data

Journal of Epidemiology & Community Health ◽

10.1136/jech.2011.142976n.52 ◽

2011 ◽

Vol 65 (Suppl 1) ◽

pp. A395-A395

Author(s):

M. van Laar ◽

D. P. Stark ◽

R. G. Feltbower

Keyword(s):

Survival Analysis ◽

Multiple Imputation ◽

Cancer Registry ◽

Registry Data ◽

Cancer Registry Data

Download Full-text

Missing covariate data within cancer prognostic studies: a review of current reporting and proposed guidelines

British Journal of Cancer ◽

10.1038/sj.bjc.6601907 ◽

2004 ◽

Vol 91 (1) ◽

pp. 4-8 ◽

Cited By ~ 121

Author(s):

A Burton ◽

D G Altman

Keyword(s):

Covariate Data ◽

Missing Covariate Data

Download Full-text