A cross-validation procedure for general pedigrees and matched odds ratio fitness metric implemented for the multifactor dimensionality reduction pedigree disequilibrium test

Todd L. Edwards; Eric Torstensen; Scott Dudek; Eden R. Martin; Marylyn D. Ritchie

doi:10.1002/gepi.20447

Odds ratio based multifactor-dimensionality reduction method for detecting gene-gene interactions

Bioinformatics ◽

10.1093/bioinformatics/btl557 ◽

2006 ◽

Vol 23 (1) ◽

pp. 71-76 ◽

Cited By ~ 97

Author(s):

Y. Chung ◽

S. Y. Lee ◽

R. C. Elston ◽

T. Park

Keyword(s):

Dimensionality Reduction ◽

Odds Ratio ◽

Multifactor Dimensionality Reduction ◽

Reduction Method ◽

Gene Interactions ◽

Multifactor Dimensionality Reduction Method ◽

Dimensionality Reduction Method

Download Full-text

The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction

Genetic Epidemiology ◽

10.1002/gepi.20166 ◽

2006 ◽

Vol 30 (6) ◽

pp. 546-555 ◽

Cited By ~ 35

Author(s):

Alison A. Motsinger ◽

Marylyn D. Ritchie

Keyword(s):

Dimensionality Reduction ◽

Multifactor Dimensionality Reduction ◽

Cross Validation

Download Full-text

A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype

10.21203/rs.3.rs-90733/v2 ◽

2021 ◽

Author(s):

JungWun Lee ◽

Seungyeoun Lee

Keyword(s):

Dimensionality Reduction ◽

Regression Model ◽

Multifactor Dimensionality Reduction ◽

Cross Validation ◽

Gene Interactions ◽

Covariate Effect ◽

Indicator Variable ◽

Main Effect ◽

Simulation Results ◽

Better Than

Abstract Background: For gene-gene interaction analysis, the multifactor dimensionality reduction (MDR) method has been widely employed to reduce multi-levels of gene-gene interactions into high- or low-risk groups using a binary attribute. For the survival phenotype, the Cox-MDR method has been proposed using a martingale residual of a Cox model since Surv-MDR was first proposed using a log-rank test statistic. Recently, the KM-MDR method was proposed using the Kaplan-Meier median survival time as a classifier. All three methods used the cross-validation procedure to identify single nucleotide polymorphism (SNP) using SNP interactions among all possible SNP pairs. Furthermore, these methods require the permutation test to verify the significance of the selected SNP pairs. However, the unified model-based multifactor dimensionality reduction method (UM-MDR) overcomes this shortcoming of MDR by unifying the significance testing with the MDR algorithm within the framework of the regression model. Neither cross-validation nor permutation testing is required to identify SNP by SNP interactions in the UM-MDR method. The UM-MDR method comprises two steps: in the first step, multi-level genotypes are classified into high- or low-risk groups, and an indicator variable for the high-risk group is defined. In the second step, the significance of the indicator variable of the high-risk group is tested in the regression model included with other adjusting covariates. The Cox-UMMDR method was recently proposed by combining Cox-MDR with UM-MDR to identify gene-gene interactions associated with the survival phenotype. In this study, we propose two simple methods either by combining KM-MDR with UM-MDR, called KM-UMMDR or by modifying Cox-UMMDR by adjusting for the covariate effect in step 1, rather than in step 2, a process called Cox2-UMMDR. The KM-UMMDR method allows the covariate effect to be adjusted for in the regression model of step 2, although KM-MDR cannot adjust for the covariate effect in the classification procedure of step 1. In contrast, Cox2-UMMDR differs from Cox-UMMDR in the sense that the martingale residuals are obtained from a Cox model by adjusting for the covariate effect in step 1 of Cox2-UMMDR whereas Cox-UMMDR adjusts for the covariate effect in the regression model in step 2. We performed simulation studies to compare the power of several methods such as KM-UMMDR, Cox-UMMDR, Cox2-UMMDR, Cox-MDR, and KM-MDR by considering the effect of covariates and the marginal effect of SNPs. We also analyzed a real example of Korean leukemia patient data for illustration and a short discussion is provided.Results: In the simulation study, two different scenarios are considered: the first scenario compares the power of the cases with and without the covariate effect. The second scenario is to compare the power of cases with the main effect of SNPs versus without the main effect of SNPs. From the simulation results, Cox-UMMDR performs the best across all scenarios among KM-UMMDR, Cox2-UMMDR, Cox-MDR and KM-MDR. As expected, both Cox-UMMDR and Cox-MDR perform better than KM-UMMDR and KM-MDR when a covariate effect exists because the former adjusts for the covariate effect but the latter cannot. However, Cox2-UMMDR behaves similarly to KM-UMMDR and KM-MDR even though there is a covariate effect. This implies that the covariate effect would be more efficiently adjusted for in the regression model of the second step rather than under the classification procedure of the first step. When there is a main effect of any SNP, Cox-UMMDR, Cox2-UMMDR and KM-UMMDR perform better than Cox-MDR and KM-MDR if the main effects of SNPs are properly adjusted for in the regression model. From the simulation results of two different scenarios, Cox-UMMDR seems to be the most robust when there is either any covariate effect adjusting for or any SNP that has a main effect on the survival phenotype. In addition, the power of all methods decreased as the censoring fraction increased from 0.1 to 0.3, as heritability increased. The power of all methods seems to be greater under MAF = 0.2 than under MAF = 0.4. For illustration, both KM-UMMDR and Cox2-UMMDR were applied to identify SNP by SNP interactions with the survival phenotype to a real dataset of Korean leukemia patients.Conclusion: Both KM-UMMDR and Cox2-UMMDR were easily implemented by combining KM-MDR and Cox-MDR with UM-MDR, respectively, to detect significant gene-gene interactions associated with survival time without cross-validation and permutation testing. The simulation results demonstrate the utility of KM-UMMDR, Cox2-UMMDR and Cox-UMMDR, which outperforms Cox-MDR and KM-MDR when some SNPs with only marginal effects might mask the detection of causal epistasis. In addition, Cox-UMMDR, Cox2-UMMDR and Cox-MDR performed better than KM-UMMDR and KM-MDR when there were potentially confounding covariate effects.

Download Full-text

A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype

10.21203/rs.3.rs-90733/v1 ◽

2020 ◽

Author(s):

JungWun Lee ◽

Seungyeoun Lee

Keyword(s):

Dimensionality Reduction ◽

Regression Model ◽

Multifactor Dimensionality Reduction ◽

Cross Validation ◽

Gene Interactions ◽

Covariate Effect ◽

Indicator Variable ◽

Main Effect ◽

Simulation Results ◽

Better Than

Abstract BackgroundFor gene-gene interaction analysis, the multifactor dimensionality reduction (MDR) method has been widely employed to reduce multi-levels of gene-gene interactions into high- or low-risk groups using a binary attribute. For the survival phenotype, the Cox-MDR method has been proposed using a martingale residual of a Cox model since Surv-MDR was first proposed using a log-rank test statistic. Recently, the KM-MDR method was proposed using the Kaplan-Meier median survival time as a classifier. All three methods used the cross-validation procedure to identify single nucleotide polymorphism (SNP) using SNP interactions among all possible SNP pairs. Furthermore, these methods require the permutation test to verify the significance of the selected SNP pairs. However, the unified model-based multifactor dimensionality reduction method (UM-MDR) overcomes this shortcoming of MDR by unifying the significance testing with the MDR algorithm within the framework of the regression model. Neither cross-validation nor permutation testing is required to identify SNP by SNP interactions in the UM-MDR method. The UM-MDR method comprises two steps: in the first step, multi-level genotypes are classified into high- or low-risk groups, and an indicator variable for the high-risk group is defined. In the second step, the significance of the indicator variable of the high-risk group is tested in the regression model included with other adjusting covariates. The Cox-UMMDR method was recently proposed by combining Cox-MDR with UM-MDR to identify gene-gene interactions associated with the survival phenotype. In this study, we propose two simple methods either by combining KM-MDR with UM-MDR, called KM-UMMDR or by modifying Cox-UMMDR by adjusting for the covariate effect in step 1, rather than in step 2, a process called Cox2-UMMDR. The KM-UMMDR method allows the covariate effect to be adjusted for in the regression model of step 2, although KM-MDR cannot adjust for the covariate effect in the classification procedure of step 1. In contrast, Cox2-UMMDR differs from Cox-UMMDR in the sense that the martingale residuals are obtained from a Cox model by adjusting for the covariate effect in step 1 of Cox2-UMMDR whereas Cox-UMMDR adjusts for the covariate effect in the regression model in step 2. We performed simulation studies to compare the power of several methods such as KM-UMMDR, Cox-UMMDR, Cox2-UMMDR, Cox-MDR, and KM-MDR by considering the effect of covariates and the marginal effect of SNPs. We also analyzed a real example of Korean leukemia patient data for illustration and a short discussion is provided.ResultsIn the simulation study, two different scenarios are considered: the first scenario compares the power of the cases with and without the covariate effect. The second scenario is to compare the power of cases with the main effect of SNPs versus without the main effect of SNPs. From the simulation results, Cox-UMMDR performs the best across all scenarios among KM-UMMDR, Cox2-UMMDR, Cox-MDR and KM-MDR. As expected, both Cox-UMMDR and Cox-MDR perform better than KM-UMMDR and KM-MDR when a covariate effect exists because the former adjusts for the covariate effect but the latter cannot. However, Cox2-UMMDR behaves similarly to KM-UMMDR and KM-MDR even though there is a covariate effect. This implies that the covariate effect would be more efficiently adjusted for in the regression model of the second step rather than under the classification procedure of the first step. When there is a main effect of any SNP, Cox-UMMDR, Cox2-UMMDR and KM-UMMDR perform better than Cox-MDR and KM-MDR if the main effects of SNPs are properly adjusted for in the regression model. From the simulation results of two different scenarios, Cox-UMMDR seems to be the most robust when there is either any covariate effect adjusting for or any SNP that has a main effect on the survival phenotype. In addition, the power of all methods decreased as the censoring fraction increased from 0.1 to 0.3, but increased as heritability increased. The power of all methods seems to be greater under MAF = 0.2 than under MAF = 0.4. For illustration, both KM-UMMDR and Cox2-UMMDR were applied to identify SNP by SNP interactions with the survival phenotype to a real dataset of Korean leukemia patients.ConclusionBoth KM-UMMDR and Cox2-UMMDR were easily implemented by combining KM-MDR and Cox-MDR with UM-MDR, respectively, to detect significant gene-gene interactions associated with survival time without cross-validation and permutation testing. The intensive simulation results demonstrate the utility of KM-UMMDR, Cox2-UMMDR and Cox-UMMDR, which outperforms Cox-MDR and KM-MDR when some SNPs with only marginal effects might mask the detection of causal epistasis. In addition, Cox-UMMDR, Cox2-UMMDR and Cox-MDR performed better than KM-UMMDR and KM-MDR when there were potentially confounding covariate effects.

Download Full-text

XRCC1 632 as a candidate for cancer predisposition via a complex interaction with genetic variants of base excision repair and double strand break repair genes

Future Oncology ◽

10.2217/fon-2019-0297 ◽

2019 ◽

Vol 15 (33) ◽

pp. 3845-3859 ◽

Cited By ~ 1

Author(s):

Amrita Singh ◽

Navneet Singh ◽

Digambar Behera ◽

Siddharth Sharma

Keyword(s):

Regression Analysis ◽

High Risk ◽

Dimensionality Reduction ◽

Odds Ratio ◽

Multifactor Dimensionality Reduction ◽

Excision Repair ◽

Five Factor Model ◽

Genetic Alterations ◽

Repair System ◽

Multifactor Dimensionality Reduction Analysis

Aim: The DNA repair system safeguards integrity of DNA. Genetic alterations force the improper repair which in conjugation with other factors ultimately results in carcinogenesis. Materials & methods: PCR-restriction fragment length polymorphism was used for genotyping, which was followed by statistical analysis using logistic regression analysis, multifactor dimensionality reduction and classification and regression analysis tree, elaborating the association with lung cancer subjects. Results: Combination of XRCC1 632 and OGG1326 showcased a high risk of eightfold (odds ratio: 7.92; 95% CI: 2.68–23.4; p = 0.0002; false discovery rate (FDR) p = 0.002). Similarly, XRCC1 632 and MUTYH 324 (odds ratio: 5.07; 95% CI: 2.6–9.67; p < 0.0001; FDRp = 0.002) had a high risk. Multifactor dimensionality reduction analysis revealed five factor model as the best model with prediction error of 0.37 (p = 0.02). Conclusion: There was a clear indication that high order interactions were major role players in the study.

Download Full-text

A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype

BioData Mining ◽

10.1186/s13040-021-00248-9 ◽

2021 ◽

Vol 14 (1) ◽

Author(s):

Jung Wun Lee ◽

Seungyeoun Lee

Keyword(s):

Dimensionality Reduction ◽

Regression Model ◽

Multifactor Dimensionality Reduction ◽

Cross Validation ◽

Gene Interactions ◽

Covariate Effect ◽

Indicator Variable ◽

Main Effect ◽

Simulation Results ◽

Better Than

Abstract Background For gene-gene interaction analysis, the multifactor dimensionality reduction (MDR) method has been widely employed to reduce multi-levels of gene-gene interactions into high- or low-risk groups using a binary attribute. For the survival phenotype, the Cox-MDR method has been proposed using a martingale residual of a Cox model since Surv-MDR was first proposed using a log-rank test statistic. Recently, the KM-MDR method was proposed using the Kaplan-Meier median survival time as a classifier. All three methods used the cross-validation procedure to identify single nucleotide polymorphism (SNP) using SNP interactions among all possible SNP pairs. Furthermore, these methods require the permutation test to verify the significance of the selected SNP pairs. However, the unified model-based multifactor dimensionality reduction method (UM-MDR) overcomes this shortcoming of MDR by unifying the significance testing with the MDR algorithm within the framework of the regression model. Neither cross-validation nor permutation testing is required to identify SNP by SNP interactions in the UM-MDR method. The UM-MDR method comprises two steps: in the first step, multi-level genotypes are classified into high- or low-risk groups, and an indicator variable for the high-risk group is defined. In the second step, the significance of the indicator variable of the high-risk group is tested in the regression model included with other adjusting covariates. The Cox-UMMDR method was recently proposed by combining Cox-MDR with UM-MDR to identify gene-gene interactions associated with the survival phenotype. In this study, we propose two simple methods either by combining KM-MDR with UM-MDR, called KM-UMMDR or by modifying Cox-UMMDR by adjusting for the covariate effect in step 1, rather than in step 2, a process called Cox2-UMMDR. The KM-UMMDR method allows the covariate effect to be adjusted for in the regression model of step 2, although KM-MDR cannot adjust for the covariate effect in the classification procedure of step 1. In contrast, Cox2-UMMDR differs from Cox-UMMDR in the sense that the martingale residuals are obtained from a Cox model by adjusting for the covariate effect in step 1 of Cox2-UMMDR whereas Cox-UMMDR adjusts for the covariate effect in the regression model in step 2. We performed simulation studies to compare the power of several methods such as KM-UMMDR, Cox-UMMDR, Cox2-UMMDR, Cox-MDR, and KM-MDR by considering the effect of covariates and the marginal effect of SNPs. We also analyzed a real example of Korean leukemia patient data for illustration and a short discussion is provided. Results In the simulation study, two different scenarios are considered: the first scenario compares the power of the cases with and without the covariate effect. The second scenario is to compare the power of cases with the main effect of SNPs versus without the main effect of SNPs. From the simulation results, Cox-UMMDR performs the best across all scenarios among KM-UMMDR, Cox2-UMMDR, Cox-MDR and KM-MDR. As expected, both Cox-UMMDR and Cox-MDR perform better than KM-UMMDR and KM-MDR when a covariate effect exists because the former adjusts for the covariate effect but the latter cannot. However, Cox2-UMMDR behaves similarly to KM-UMMDR and KM-MDR even though there is a covariate effect. This implies that the covariate effect would be more efficiently adjusted for in the regression model of the second step rather than under the classification procedure of the first step. When there is a main effect of any SNP, Cox-UMMDR, Cox2-UMMDR and KM-UMMDR perform better than Cox-MDR and KM-MDR if the main effects of SNPs are properly adjusted for in the regression model. From the simulation results of two different scenarios, Cox-UMMDR seems to be the most robust when there is either any covariate effect adjusting for or any SNP that has a main effect on the survival phenotype. In addition, the power of all methods decreased as the censoring fraction increased from 0.1 to 0.3, as heritability increased. The power of all methods seems to be greater under MAF = 0.2 than under MAF = 0.4. For illustration, both KM-UMMDR and Cox2-UMMDR were applied to identify SNP by SNP interactions with the survival phenotype to a real dataset of Korean leukemia patients. Conclusion Both KM-UMMDR and Cox2-UMMDR were easily implemented by combining KM-MDR and Cox-MDR with UM-MDR, respectively, to detect significant gene-gene interactions associated with survival time without cross-validation and permutation testing. The simulation results demonstrate the utility of KM-UMMDR, Cox2-UMMDR and Cox-UMMDR, which outperforms Cox-MDR and KM-MDR when some SNPs with only marginal effects might mask the detection of causal epistasis. In addition, Cox-UMMDR, Cox2-UMMDR and Cox-MDR performed better than KM-UMMDR and KM-MDR when there were potentially confounding covariate effects.

Download Full-text

The study on risk factors for diagnosis of metabolic syndrome and odds ratio using multifactor dimensionality reduction method

Journal of the Korean Data and Information Science Society ◽

10.7465/jkdi.2013.24.4.867 ◽

2013 ◽

Vol 24 (4) ◽

pp. 867-876 ◽

Cited By ~ 1

Author(s):

Mi-Hyun Jin ◽

Jea-Young Lee

Keyword(s):

Risk Factors ◽

Metabolic Syndrome ◽

Dimensionality Reduction ◽

Odds Ratio ◽

Multifactor Dimensionality Reduction ◽

Reduction Method ◽

Multifactor Dimensionality Reduction Method ◽

Dimensionality Reduction Method

Download Full-text

СONTRIBUTION OF POLYMORPHIC VARIATION OF ТP53 AND XRCC1 GENES TO THE SUSCEPTIBILITY TO BREAST CANCER FOR WOMEN OF KYRGYZ AND BELARUSIAN NATIONALITY - A COMPARATIVE STUDY ON MULTIFACTOR DIMENSIONALITY REDUCTION METHODS

Problems in oncology ◽

10.37469/0507-3758-2018-64-1-95-101 ◽

2018 ◽

Vol 64 (1) ◽

pp. 95-101

Author(s):

Nazira Aldasheva ◽

Vyacheslav Kipen ◽

Zhaynagul Isakova ◽

Sergey Melnov ◽

Raisa Smolyakova ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Dimensionality Reduction ◽

Multifactor Dimensionality Reduction ◽

Rflp Analysis ◽

Kyrgyz Republic ◽

Increased Risk ◽

Polymorphic Variants ◽

The Republic

Basing on Multifactor Dimensionality Reduction method we showed that polymorphic variants p.Q399R (rs25487, XRCC1) and p.P72R (rs1042522, TP53) correlated with increased risk of breast cancer for women from the Kyrgyz Republic and the Republic of Belarus. Cohort for investigation included patients with clinically verified breast cancer: 117 women from the Kyrgyz Republic (nationality - Kyrgyz) and 169 - of the Republic of Belarus (nationality - Belarusians). Group for comparison included (healthy patients without history of cancer pathology at the time of blood sampling) 102 patients from the Kyrgyz Republic, 185 - from the Republic of Belarus. Respectively genotyping of polymorphic variants p.Q399R (rs25487, XRCC1) and p.P72R (rs1042522, TP53) was done by PCR-RFLP. Analysis of the intergenic interactions conducted with MDR 3.0.2 software. Both ethnic groups showed an increase of breast cancer risk in the presence of alleles for SNPs Gln p.Q399R (XRCC1) in the heterozygous state: for the group “Kyrgyz” - OR=2,78 (95% CI=[1,60-4,82]), p=0,001; for the group “Belarusians” - OR=1,85 (95% СІ=[1Д1-2,82], p=0,004. Carriers with combination of alleles Gln (p.Q399R, XRCC1) and Pro (p.P72R, TP53) showed statistically significance increases of breast cancer risk as for patients from the Kyrgyz Republic (OR=2,89, 95% CI=[1,33-6,31]), so as for patients from the Republic of Belarus (OR=3,01, 95% CI=[0,79-11,56]).

Download Full-text

Multifactor-dimensionality reduction reveals interaction of important gene variants involved in allergy

International Journal of Immunogenetics ◽

10.1111/iji.12200 ◽

2015 ◽

Vol 42 (3) ◽

pp. 182-189 ◽

Cited By ~ 9

Author(s):

R. M. de Guia ◽

M. D. J. Echavez ◽

E. L. C. Gaw ◽

M. R. R. Gomez ◽

K. A. J. Lopez ◽

...

Keyword(s):

Dimensionality Reduction ◽

Multifactor Dimensionality Reduction ◽

Gene Variants

Download Full-text

Exploring the Performance of Multifactor Dimensionality Reduction in Large Scale SNP Studies and in the Presence of Genetic Heterogeneity among Epistatic Disease Models

Human Heredity ◽

10.1159/000181157 ◽

2008 ◽

Vol 67 (3) ◽

pp. 183-192 ◽

Cited By ~ 24

Author(s):

Todd L. Edwards ◽

Kenneth Lewis ◽

Digna R. Velez ◽

Scott Dudek ◽

Marylyn D. Ritchie

Keyword(s):

Dimensionality Reduction ◽

Genetic Heterogeneity ◽

Multifactor Dimensionality Reduction ◽

Large Scale ◽

Disease Models

Download Full-text