Assessing the adequacy of the logistic regression model for matched case-control studies

Suresh H. Moolgavkar; Edward D. Lustbader; David J. Venzon

doi:10.1002/sim.4780040404

Assessing the adequacy of the logistic regression model for matched case-control studies

Statistics in Medicine ◽

10.1002/sim.4780040404 ◽

1985 ◽

Vol 4 (4) ◽

pp. 425-435 ◽

Cited By ~ 11

Author(s):

Suresh H. Moolgavkar ◽

Edward D. Lustbader ◽

David J. Venzon

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Case Control ◽

Case Control Studies ◽

Matched Case

Download Full-text

Goodness-of-Fit Tests for the Logistic Regression Model for Matched Case-Control Studies

Biometrical Journal ◽

10.1002/bimj.4710270506 ◽

1985 ◽

Vol 27 (5) ◽

pp. 511-520 ◽

Cited By ~ 2

Author(s):

David W. Hosmer ◽

Stanley Lemeshow

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Goodness Of Fit ◽

Logistic Regression Model ◽

Case Control ◽

Case Control Studies ◽

Goodness Of Fit Tests ◽

Matched Case

Download Full-text

Sparse estimation for case–control studies with multiple disease subtypes

Biostatistics ◽

10.1093/biostatistics/kxz063 ◽

2020 ◽

Author(s):

Nadim Ballout ◽

Cedric Garcia ◽

Vivian Viallon

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Multinomial Logistic Regression ◽

Conditional Logistic Regression ◽

Case Control ◽

Case Control Studies ◽

Conditional Logistic Regression Model ◽

Symmetric Formulation ◽

Disease Subtypes

Summary The analysis of case–control studies with several disease subtypes is increasingly common, e.g. in cancer epidemiology. For matched designs, a natural strategy is based on a stratified conditional logistic regression model. Then, to account for the potential homogeneity among disease subtypes, we adapt the ideas of data shared lasso, which has been recently proposed for the estimation of stratified regression models. For unmatched designs, we compare two standard methods based on $L_1$-norm penalized multinomial logistic regression. We describe formal connections between these two approaches, from which practical guidance can be derived. We show that one of these approaches, which is based on a symmetric formulation of the multinomial logistic regression model, actually reduces to a data shared lasso version of the other. Consequently, the relative performance of the two approaches critically depends on the level of homogeneity that exists among disease subtypes: more precisely, when homogeneity is moderate to high, the non-symmetric formulation with controls as the reference is not recommended. Empirical results obtained from synthetic data are presented, which confirm the benefit of properly accounting for potential homogeneity under both matched and unmatched designs, in terms of estimation and prediction accuracy, variable selection and identification of heterogeneities. We also present preliminary results from the analysis of a case–control study nested within the EPIC (European Prospective Investigation into Cancer and nutrition) cohort, where the objective is to identify metabolites associated with the occurrence of subtypes of breast cancer.

Download Full-text

Testing goodness-of-fit of the logistic regression model in case–control studies using sample reweighting

Statistics in Medicine ◽

10.1002/sim.1997 ◽

2004 ◽

Vol 24 (1) ◽

pp. 121-130 ◽

Cited By ~ 15

Author(s):

Nico Nagelkerke ◽

Jeroen Smits ◽

Saskia le Cessie ◽

Hans van Houwelingen

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Goodness Of Fit ◽

Logistic Regression Model ◽

Case Control ◽

Case Control Studies

Download Full-text

Unconditional or Conditional Logistic Regression Model for Age-Matched Case–Control Data?

Frontiers in Public Health ◽

10.3389/fpubh.2018.00057 ◽

2018 ◽

Vol 6 ◽

Cited By ~ 20

Author(s):

Chia-Ling Kuo ◽

Yinghui Duan ◽

James Grady

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Conditional Logistic Regression ◽

Case Control ◽

Control Data ◽

Conditional Logistic Regression Model ◽

Matched Case

Download Full-text

Logistic Regression for Matched Case-Control Studies

Applied Logistic Regression ◽

10.1002/0471722146.ch7 ◽

2005 ◽

pp. 223-259 ◽

Cited By ~ 2

Keyword(s):

Logistic Regression ◽

Case Control ◽

Case Control Studies ◽

Matched Case

Download Full-text

Testing goodness-of-fit of a logistic regression model with case–control data

Journal of Statistical Planning and Inference ◽

10.1016/s0378-3758(03)00207-6 ◽

2004 ◽

Vol 124 (2) ◽

pp. 409-422 ◽

Cited By ~ 3

Author(s):

K.F. Cheng ◽

L.C. Chen

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Goodness Of Fit ◽

Logistic Regression Model ◽

Case Control ◽

Control Data

Download Full-text

RE: “POLYCHOTOMOUS LOGISTIC REGRESSION METHODS FOR MATCHED CASE-CONTROL STUDIES WITH MULTIPLE CASE OR CONTROL GROUPS”

American Journal of Epidemiology ◽

10.1093/oxfordjournals.aje.a114990 ◽

1988 ◽

Vol 128 (2) ◽

pp. 445-446 ◽

Cited By ~ 5

Author(s):

Bruce Levin

Keyword(s):

Logistic Regression ◽

Case Control ◽

Case Control Studies ◽

Control Groups ◽

Regression Methods ◽

Multiple Case ◽

Matched Case

Download Full-text

Matched versus Unmatched Analysis of Matched Case-Control Studies

American Journal of Epidemiology ◽

10.1093/aje/kwab056 ◽

2021 ◽

Author(s):

Fei Wan ◽

Graham A Colditz ◽

Siobhan Sutcliffe

Keyword(s):

Logistic Regression ◽

Functional Form ◽

Conditional Logistic Regression ◽

Control Sample ◽

Target Population ◽

Case Control ◽

Model Specification ◽

Exact Matching ◽

Case Control Studies ◽

Matched Case

Abstract Although the need for addressing matching in the analysis of matched case-control studies is well established, debate remains as to the most appropriate analytic method when matching on at least one continuous factor. We compare the bias and efficiency of unadjusted and adjusted conditional logistic regression (CLR) and unconditional logistic regression (ULR) in the setting of both exact and non-exact matching. To demonstrate that case-control matching distorts the association between the matching variables and the outcome in the matched sample relative to the target population, we derive the logit model for the matched case-control sample under exact matching. We conduct simulations to validate our theoretical conclusions and to explore different ways of adjusting for the matching variables in CLR and ULR to reduce biases. When matching is exact, CLR is unbiased in all settings. When matching is not exact, unadjusted CLR tends to be biased and this bias increases with increasing matching caliper size. Spline smoothing of the matching variables in CLR can alleviate biases. Regardless of exact or non-exact matching, adjusted ULR is generally biased unless the functional form of the matched factors is modelled correctly. The validity of adjusted ULR is vulnerable to model specification error. CLR should remain the primary analytic approach.

Download Full-text

Assessing the Fit of the Logistic Regression Model to Individual Matched Sets of Case-Control Data

Biometrics ◽

10.2307/2533139 ◽

1996 ◽

Vol 52 (1) ◽

pp. 1 ◽

Cited By ~ 6

Author(s):

Edward J. Bedrick ◽

Joe R. Hill

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Case Control ◽

Control Data

Download Full-text

Comparison of the Missing-Indicator Method and Conditional Logistic Regression in 1:m Matched Case-Control Studies with Missing Exposure Values

American Journal of Epidemiology ◽

10.1093/aje/kwh075 ◽

2004 ◽

Vol 159 (6) ◽

pp. 603-610 ◽

Cited By ~ 10

Author(s):

X. Li

Keyword(s):

Logistic Regression ◽

Conditional Logistic Regression ◽

Case Control ◽

Case Control Studies ◽

Indicator Method ◽

Matched Case

Download Full-text