scholarly journals The use of negative control outcomes in Mendelian randomization to detect potential population stratification

Author(s):  
Eleanor Sanderson ◽  
Tom G Richardson ◽  
Gibran Hemani ◽  
George Davey Smith

Abstract A key assumption of Mendelian randomization (MR) analysis is that there is no association between the genetic variants used as instruments and the outcome other than through the exposure of interest. One way in which this assumption can be violated is through population stratification, which can introduce confounding of the relationship between the genetic variants and the outcome and so induce an association between them. Negative control outcomes are increasingly used to detect unobserved confounding in observational epidemiological studies. Here we consider the use of negative control outcomes in MR studies to detect confounding of the genetic variants and the exposure or outcome. As a negative control outcome in an MR study, we propose the use of phenotypes which are determined before the exposure and outcome but which are likely to be subject to the same confounding as the exposure or outcome of interest. We illustrate our method with a two-sample MR analysis of a preselected set of exposures on self-reported tanning ability and hair colour. Our results show that, of the 33 exposures considered, genome-wide association studies (GWAS) of adiposity and education-related traits are likely to be subject to population stratification that is not controlled for through adjustment, and so any MR study including these traits may be subject to bias that cannot be identified through standard pleiotropy robust methods. Negative control outcomes should therefore be used regularly in MR studies to detect potential population stratification in the data used.

2020 ◽  
Author(s):  
Eleanor Sanderson ◽  
Tom G Richardson ◽  
Gibran Hemani ◽  
George Davey Smith

AbstractA key assumption of Mendelian randomisation (MR) analysis is that there is no association between the genetic variants used as instruments and the outcome other than through the exposure of interest. Two ways in which this assumption can be violated are through population stratification and selection bias which can introduce confounding of the relationship between the genetic variants and the outcome and so induce an association between them. Negative control outcomes are increasingly used to detect unobserved confounding in observational epidemiological studies. Here we consider the use of negative control outcomes in MR studies. As a negative control outcome in an MR study we propose the use of phenotypes which are determined before the exposure and outcome but which are likely to be subject to the same confounding as the exposure or outcome of interest. We illustrate our method with a two-sample MR analysis of a preselected set of exposures on self-reported tanning ability and hair colour. Our results show that, of the 33 exposures considered, GWAS studies of adiposity and education related traits are likely to be subject to population stratification and/or selection bias that is not controlled for through adjustment and so any MR study including these traits may be subject to bias that cannot be identified through standard pleiotropy robust methods.


2021 ◽  
Vol 22 (11) ◽  
pp. 6083
Author(s):  
Aintzane Rueda-Martínez ◽  
Aiara Garitazelaia ◽  
Ariadna Cilleros-Portet ◽  
Sergi Marí ◽  
Rebeca Arauzo ◽  
...  

Endometriosis is a common gynecological disorder that has been associated with endometrial, breast and epithelial ovarian cancers in epidemiological studies. Since complex diseases are a result of multiple environmental and genetic factors, we hypothesized that the biological mechanism underlying their comorbidity might be explained, at least in part, by shared genetics. To assess their potential genetic relationship, we performed a two-sample mendelian randomization (2SMR) analysis on results from public genome-wide association studies (GWAS). This analysis confirmed previously reported genetic pleiotropy between endometriosis and endometrial cancer. We present robust evidence supporting a causal genetic association between endometriosis and ovarian cancer, particularly with the clear cell and endometrioid subtypes. Our study also identified genetic variants that could explain those associations, opening the door to further functional experiments. Overall, this work demonstrates the value of genomic analyses to support epidemiological data, and to identify targets of relevance in multiple disorders.


Author(s):  
Fernando Pires Hartwig ◽  
Kate Tilling ◽  
George Davey Smith ◽  
Deborah A Lawlor ◽  
Maria Carolina Borges

Abstract Background Two-sample Mendelian randomization (MR) allows the use of freely accessible summary association results from genome-wide association studies (GWAS) to estimate causal effects of modifiable exposures on outcomes. Some GWAS adjust for heritable covariables in an attempt to estimate direct effects of genetic variants on the trait of interest. One, both or neither of the exposure GWAS and outcome GWAS may have been adjusted for covariables. Methods We performed a simulation study comprising different scenarios that could motivate covariable adjustment in a GWAS and analysed real data to assess the influence of using covariable-adjusted summary association results in two-sample MR. Results In the absence of residual confounding between exposure and covariable, between exposure and outcome, and between covariable and outcome, using covariable-adjusted summary associations for two-sample MR eliminated bias due to horizontal pleiotropy. However, covariable adjustment led to bias in the presence of residual confounding (especially between the covariable and the outcome), even in the absence of horizontal pleiotropy (when the genetic variants would be valid instruments without covariable adjustment). In an analysis using real data from the Genetic Investigation of ANthropometric Traits (GIANT) consortium and UK Biobank, the causal effect estimate of waist circumference on blood pressure changed direction upon adjustment of waist circumference for body mass index. Conclusions Our findings indicate that using covariable-adjusted summary associations in MR should generally be avoided. When that is not possible, careful consideration of the causal relationships underlying the data (including potentially unmeasured confounders) is required to direct sensitivity analyses and interpret results with appropriate caution.


2021 ◽  
Vol 12 ◽  
Author(s):  
Yuquan Wang ◽  
Tingting Li ◽  
Liwan Fu ◽  
Siqian Yang ◽  
Yue-Qing Hu

Mendelian randomization makes use of genetic variants as instrumental variables to eliminate the influence induced by unknown confounders on causal estimation in epidemiology studies. However, with the soaring genetic variants identified in genome-wide association studies, the pleiotropy, and linkage disequilibrium in genetic variants are unavoidable and may produce severe bias in causal inference. In this study, by modeling the pleiotropic effect as a normally distributed random effect, we propose a novel mixed-effects regression model-based method PLDMR, pleiotropy and linkage disequilibrium adaptive Mendelian randomization, which takes linkage disequilibrium into account and also corrects for the pleiotropic effect in causal effect estimation and statistical inference. We conduct voluminous simulation studies to evaluate the performance of the proposed and existing methods. Simulation results illustrate the validity and advantage of the novel method, especially in the case of linkage disequilibrium and directional pleiotropic effects, compared with other methods. In addition, by applying this novel method to the data on Atherosclerosis Risk in Communications Study, we conclude that body mass index has a significant causal effect on and thus might be a potential risk factor of systolic blood pressure. The novel method is implemented in R and the corresponding R code is provided for free download.


2017 ◽  
Author(s):  
Jorien L. Treur ◽  
Mark Gibson ◽  
Amy E Taylor ◽  
Peter J Rogers ◽  
Marcus R Munafò

AbstractStudy Objectives:Higher caffeine consumption has been linked to poorer sleep and insomnia complaints. We investigated whether these observational associations are the result of genetic risk factors influencing both caffeine consumption and poorer sleep, and/or whether they reflect (possibly bidirectional) causal effects.Methods:Summary-level data were available from genome-wide association studies (GWAS) on caffeine consumption (n=91,462), sleep duration, and chronotype (i.e., being a ‘morning’ versus an ‘evening’ person) (both n=128,266), and insomnia complaints (n=113,006). Linkage disequilibrium (LD) score regression was used to calculate genetic correlations, reflecting the extent to which genetic variants influencing caffeine consumption and sleep behaviours overlap. Causal effects were tested with bidirectional, two-sample Mendelian randomization (MR), an instrumental variable approach that utilizes genetic variants robustly associated with an exposure variable as an instrument to test causal effects. Estimates from individual genetic variants were combined using inverse-variance weighted meta-analysis, weighted median regression and MR Egger regression methods.Results:There was no clear evidence for genetic correlation between caffeine consumption and sleep duration (rg=0.000,p=0.998), chronotype (rg=0.086,p=0.192) or insomnia (rg=-0.034,p=0.700). Two-sample Mendelian randomization analyses did not support causal effects from caffeine consumption to sleep behaviours, or the other way around.Conclusions:We found no evidence in support of genetic correlation or causal effects between caffeine consumption and sleep. While caffeine may have acute effects on sleep when taken shortly before habitual bedtime, our findings suggest that a more sustained pattern of high caffeine consumption is likely associated with poorer sleep through shared environmental factors.


2019 ◽  
Author(s):  
Tom G Richardson ◽  
Gibran Hemani ◽  
Tom R Gaunt ◽  
Caroline L Relton ◽  
George Davey Smith

AbstractBackgroundDeveloping insight into tissue-specific transcriptional mechanisms can help improve our understanding of how genetic variants exert their effects on complex traits and disease. By applying the principles of Mendelian randomization, we have undertaken a systematic analysis to evaluate transcriptome-wide associations between gene expression across 48 different tissue types and 395 complex traits.ResultsOverall, we identified 100,025 gene-trait associations based on conventional genome-wide corrections (P < 5 × 10−08) that also provided evidence of genetic colocalization. These results indicated that genetic variants which influence gene expression levels in multiple tissues are more likely to influence multiple complex traits. We identified many examples of tissue-specific effects, such as genetically-predicted TPO, NR3C2 and SPATA13 expression only associating with thyroid disease in thyroid tissue. Additionally, FBN2 expression was associated with both cardiovascular and lung function traits, but only when analysed in heart and lung tissue respectively.We also demonstrate that conducting phenome-wide evaluations of our results can help flag adverse on-target side effects for therapeutic intervention, as well as propose drug repositioning opportunities. Moreover, we find that exploring the tissue-dependency of associations identified by genome-wide association studies (GWAS) can help elucidate the causal genes and tissues responsible for effects, as well as uncover putative novel associations.ConclusionsThe atlas of tissue-dependent associations we have constructed should prove extremely valuable to future studies investigating the genetic determinants of complex disease. The follow-up analyses we have performed in this study are merely a guide for future research. Conducting similar evaluations can be undertaken systematically at http://mrcieu.mrsoftware.org/Tissue_MR_atlas/.


2020 ◽  
Vol 2 (2) ◽  
Author(s):  
Qing Cheng ◽  
Yi Yang ◽  
Xingjie Shi ◽  
Kar-Fu Yeung ◽  
Can Yang ◽  
...  

Abstract The proliferation of genome-wide association studies (GWAS) has prompted the use of two-sample Mendelian randomization (MR) with genetic variants as instrumental variables (IVs) for drawing reliable causal relationships between health risk factors and disease outcomes. However, the unique features of GWAS demand that MR methods account for both linkage disequilibrium (LD) and ubiquitously existing horizontal pleiotropy among complex traits, which is the phenomenon wherein a variant affects the outcome through mechanisms other than exclusively through the exposure. Therefore, statistical methods that fail to consider LD and horizontal pleiotropy can lead to biased estimates and false-positive causal relationships. To overcome these limitations, we proposed a probabilistic model for MR analysis in identifying the causal effects between risk factors and disease outcomes using GWAS summary statistics in the presence of LD and to properly account for horizontal pleiotropy among genetic variants (MR-LDP) and develop a computationally efficient algorithm to make the causal inference. We then conducted comprehensive simulation studies to demonstrate the advantages of MR-LDP over the existing methods. Moreover, we used two real exposure–outcome pairs to validate the results from MR-LDP compared with alternative methods, showing that our method is more efficient in using all-instrumental variants in LD. By further applying MR-LDP to lipid traits and body mass index (BMI) as risk factors for complex diseases, we identified multiple pairs of significant causal relationships, including a protective effect of high-density lipoprotein cholesterol on peripheral vascular disease and a positive causal effect of BMI on hemorrhoids.


2019 ◽  
Author(s):  
Qian Yang ◽  
Eleanor Sanderson ◽  
Kate Tilling ◽  
M Carolina Borges ◽  
Deborah A Lawlor

AbstractBackgroundOur aim is to produce guidance on exploring and mitigating possible bias when genetic instrumental variables (IVs) associate with traits other than the exposure of interest in Mendelian randomization (MR) studies.MethodsWe use causal diagrams to illustrate scenarios that could result in IVs being related to (non-exposure) traits. We recommend that MR studies explore possible IV-non-exposure associations across a much wider range of traits than is usually the case. Where associations are found, confounding by population stratification should be assessed through adjusting for relevant population structure variables. To distinguish vertical from horizontal pleiotropy we suggest using bidirectional MR between the exposure and non-exposure traits and MR of the effect of the non-exposure traits on the outcome of interest. If vertical pleiotropy is plausible, standard MR methods should be unbiased. If horizontal pleiotropy is plausible, we recommend using multivariable MR to control for observed pleiotropic traits and conducting sensitivity analyses which do not require prior knowledge of specific invalid IVs or pleiotropic paths.ResultsWe applied our recommendations to an illustrative example of the effect of maternal insomnia on offspring birthweight in the UK Biobank. We found little evidence that unexpected IV-non-exposure associations were driven by population stratification. Three out of six observed non-exposure traits plausibly reflected horizontal pleiotropy. Multivariable MR and sensitivity analyses suggested an inverse association of insomnia with birthweight, but effects were imprecisely estimated in some of these analyses.ConclusionsWe provide guidance for MR studies where genetic IVs associate with non-exposure traits.Key messagesGenetic variants are increasingly found to associate with more than one social, behavioural or biological trait at genome-wide significance, which is a challenge in Mendelian randomization (MR) studies.Four broad scenarios (i.e. population stratification, vertical pleiotropy, horizontal pleiotropy and reverse causality) could result in an IV-non-exposure trait association.Population stratification can be assessed through adjusting for population structure with individual data, while two-sample MR studies should check whether the original genome-wide association studies have used robust methods to properly account for it.We apply currently available MR methods for discriminating between vertical and horizontal pleiotropy and mitigating against horizontal pleiotropy to an example exploring the effect of maternal insomnia on offspring birthweight.Our study highlights the pros and cons of relying more on sensitivity analyses without considering particular pleiotropic paths versus systematically exploring and controlling for potential pleiotropic paths via known characteristics.


Author(s):  
Xichang Wang ◽  
Xiaotong Gao ◽  
Yutong Han ◽  
Fan Zhang ◽  
Zheyu Lin ◽  
...  

Abstract Context The association between serum thyroid-stimulating hormone (TSH) and obesity traits has been investigated previously in several epidemiological studies. However, the underlying causal association has not been established. Objective To determine and analyze the causal association between serum TSH level and obesity-related traits (BMI and obesity). Design, Setting, Participants The latest genome-wide association studies (GWASs) on TSH, BMI and obesity were searched to obtain full statistics. Bidirectional two-sample Mendelian randomization (MR) was performed to explore the causal relationship between serum TSH and BMI and obesity. The inverse variance-weighted (IVW) and MR-Egger methods were used to combine the estimation for each SNP. Based on the preliminary MR results, free thyroxine (fT4) and free triiodothyronine (fT3) levels were also set as outcomes to further analyze the impact of BMI on them. Main Outcome Measures BMI and obesity were treated as the outcomes to evaluate the effect of serum TSH on them, and TSH was set as the outcome to estimate the effect of BMI and obesity on it. Results Both IVW and MR-Egger results indicated that genetically driven serum TSH did not causally lead to changes in BMI or obesity. Moreover, the IVW method showed that the TSH level could be significantly elevated by genetically predicted high BMI (β=0.038, se=0.013, p=0.004). In further MR analysis, the IVW method indicated that BMI could causally increase the fT3 (β=10.123, se=2.523, p&lt;0.001) while not significantly affecting the fT4 level. Conclusion Together with fT3, TSH can be significantly elevated by an increase in genetically driven BMI.


2015 ◽  
Vol 4 (4) ◽  
pp. 249-260 ◽  
Author(s):  
Ali Abbasi

Many biomarkers are associated with type 2 diabetes (T2D) risk in epidemiological observations. The aim of this study was to identify and summarize current evidence for causal effects of biomarkers on T2D. A systematic literature search in PubMed and EMBASE (until April 2015) was done to identify Mendelian randomization studies that examined potential causal effects of biomarkers on T2D. To replicate the findings of identified studies, data from two large-scale, genome-wide association studies (GWAS) were used: DIAbetes Genetics Replication And Meta-analysis (DIAGRAMv3) for T2D and the Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC) for glycaemic traits. GWAS summary statistics were extracted for the same genetic variants (or proxy variants), which were used in the original Mendelian randomization studies. Of the 21 biomarkers (from 28 studies), ten have been reported to be causally associated with T2D in Mendelian randomization. Most biomarkers were investigated in a single cohort study or population. Of the ten biomarkers that were identified, nominally significant associations with T2D or glycaemic traits were reached for those genetic variants related to bilirubin, pro-B-type natriuretic peptide, delta-6 desaturase and dimethylglycine based on the summary data from DIAGRAMv3 or MAGIC. Several Mendelian randomization studies investigated the nature of associations of biomarkers with T2D. However, there were only a few biomarkers that may have causal effects on T2D. Further research is needed to broadly evaluate the causal effects of multiple biomarkers on T2D and glycaemic traits using data from large-scale cohorts or GWAS including many different genetic variants.


Sign in / Sign up

Export Citation Format

Share Document