scholarly journals Thousands of missing variants in the UK BioBank are recoverable by genome realignment

2019 ◽  
Author(s):  
Tongqiu Jia ◽  
Brenton Munson ◽  
Hana Lango Allen ◽  
Trey Ideker ◽  
Amit R. Majithia

AbstractThe UK Biobank is an unprecedented resource for human disease research. In March 2019, 49,997 exomes were made publicly available to investigators. Here we note that thousands of variant calls are unexpectedly absent from the current dataset, with 641 genes showing zero variation. We show that the reason for this was an erroneous read alignment to the GRCh38 reference. The missing variants can be recovered by modifying read alignment parameters to correctly handle the expanded set of contigs available in the human genome reference.

2015 ◽  
Vol 70 (1) ◽  
pp. 184-195 ◽  
Author(s):  
John W. Galloway ◽  
Bart J. F. Keijser ◽  
David M. Williams
Keyword(s):  

2021 ◽  
Author(s):  
Shaan Khurshid ◽  
Lu-Chen Weng ◽  
Victor Nauffal ◽  
James P Pirruccello ◽  
Rachael A Venn ◽  
...  

Physical activity is favorable to health but the relations with human disease and causal effects are poorly quantified. Previous studies have largely relied on self-reported estimates1-3 which are subject to recall bias, confounding, and reverse causality. Using wrist-worn accelerometer measurements from the UK Biobank, we tested associations between moderate-to-vigorous physical activity (MVPA) - both total MVPA minutes and whether MVPA was above a guideline-based threshold of ≥150 minutes/week4-6 - and incidence of over 1,200 diseases. In 96,466 adults who wore accelerometers for one week (mean age 62±8 years), MVPA was associated with 401 (33%) tested diseases at a false discovery rate of 1% during a median of 6.2 years follow-up. Greater MVPA was overwhelmingly associated with reduced disease risk (98% of associations) with hazard ratios (HRs) ranging from 0.10-0.95 per standard deviation (SD) and associations spanning all 18 disease categories tested. A similar pattern of associations was observed when assessing the guideline-based threshold of ≥150 MVPA minutes/week. We examined a polygenic risk score for MVPA as an instrumental variable for activity within a separate UK Biobank sample (N=392,058, mean age 57±8 years). Greater genetically inferred MVPA was associated with reduced risk of 182 (14%) incident diseases (HR range 0.74-0.97 per 1 SD), and included strong associations conferring reduced risks of obstructive bronchitis, peripheral vascular disease, type 2 diabetes, and gastritis/duodenitis. Objective physical activity is broadly associated with lower disease incidence and many associations are consistent with a causal effect.


2019 ◽  
Author(s):  
Elizabeth Curtis ◽  
Justin Liu ◽  
Kate Ward ◽  
Karen Jameson ◽  
Zahra Raisi-Estabragh ◽  
...  

2020 ◽  
Author(s):  
John E. McGeary ◽  
Chelsie Benca-Bachman ◽  
Victoria Risner ◽  
Christopher G Beevers ◽  
Brandon Gibb ◽  
...  

Twin studies indicate that 30-40% of the disease liability for depression can be attributed to genetic differences. Here, we assess the explanatory ability of polygenic scores (PGS) based on broad- (PGSBD) and clinical- (PGSMDD) depression summary statistics from the UK Biobank using independent cohorts of adults (N=210; 100% European Ancestry) and children (N=728; 70% European Ancestry) who have been extensively phenotyped for depression and related neurocognitive phenotypes. PGS associations with depression severity and diagnosis were generally modest, and larger in adults than children. Polygenic prediction of depression-related phenotypes was mixed and varied by PGS. Higher PGSBD, in adults, was associated with a higher likelihood of having suicidal ideation, increased brooding and anhedonia, and lower levels of cognitive reappraisal; PGSMDD was positively associated with brooding and negatively related to cognitive reappraisal. Overall, PGS based on both broad and clinical depression phenotypes have modest utility in adult and child samples of depression.


SLEEP ◽  
2021 ◽  
Vol 44 (Supplement_2) ◽  
pp. A273-A273
Author(s):  
Xi Zheng ◽  
Ma Cherrysse Ulsa ◽  
Peng Li ◽  
Lei Gao ◽  
Kun Hu

Abstract Introduction While there is emerging evidence for acute sleep disruption in the aftermath of coronavirus disease 2019 (COVID-19), it is unknown whether sleep traits contribute to mortality risk. In this study, we tested whether earlier-life sleep duration, chronotype, insomnia, napping or sleep apnea were associated with increased 30-day COVID-19 mortality. Methods We included 34,711 participants from the UK Biobank, who presented for COVID-19 testing between March and October 2020 (mean age at diagnosis: 69.4±8.3; range 50.2–84.6). Self-reported sleep duration (less than 6h/6-9h/more than 9h), chronotype (“morning”/”intermediate”/”evening”), daytime dozing (often/rarely), insomnia (often/rarely), napping (often/rarely) and presence of sleep apnea (ICD-10 or self-report) were obtained between 2006 and 2010. Multivariate logistic regression models were used to adjust for age, sex, education, socioeconomic status, and relevant risk factors (BMI, hypertension, diabetes, respiratory diseases, smoking, and alcohol). Results The mean time between sleep measures and COVID-19 testing was 11.6±0.9 years. Overall, 5,066 (14.6%) were positive. In those who were positive, 355 (7.0%) died within 30 days (median = 8) after diagnosis. Long sleepers (>9h vs. 6-9h) [20/103 (19.4%) vs. 300/4,573 (6.6%); OR 2.09, 95% 1.19–3.64, p=0.009), often daytime dozers (OR 1.68, 95% 1.04–2.72, p=0.03), and nappers (OR 1.52, 95% 1.04–2.23, p=0.03) were at greater odds of mortality. Prior diagnosis of sleep apnea also saw a two-fold increased odds (OR 2.07, 95% CI: 1.25–3.44 p=0.005). No associations were seen for short sleepers, chronotype or insomnia with COVID-19 mortality. Conclusion Data across all current waves of infection show that prior sleep traits/disturbances, in particular long sleep duration, daytime dozing, napping and sleep apnea, are associated with increased 30-day mortality after COVID-19, independent of health-related risk factors. While sleep health traits may reflect unmeasured poor health, further work is warranted to examine the exact underlying mechanisms, and to test whether sleep health optimization offers resilience to severe illness from COVID-19. Support (if any) NIH [T32GM007592 and R03AG067985 to L.G. RF1AG059867, RF1AG064312, to K.H.], the BrightFocus Foundation A2020886S to P.L. and the Foundation of Anesthesia Education and Research MRTG-02-15-2020 to L.G.


BMJ ◽  
2021 ◽  
pp. n214
Author(s):  
Weedon MN ◽  
Jackson L ◽  
Harrison JW ◽  
Ruth KS ◽  
Tyrrell J ◽  
...  

Abstract Objective To determine whether the sensitivity and specificity of SNP chips are adequate for detecting rare pathogenic variants in a clinically unselected population. Design Retrospective, population based diagnostic evaluation. Participants 49 908 people recruited to the UK Biobank with SNP chip and next generation sequencing data, and an additional 21 people who purchased consumer genetic tests and shared their data online via the Personal Genome Project. Main outcome measures Genotyping (that is, identification of the correct DNA base at a specific genomic location) using SNP chips versus sequencing, with results split by frequency of that genotype in the population. Rare pathogenic variants in the BRCA1 and BRCA2 genes were selected as an exemplar for detailed analysis of clinically actionable variants in the UK Biobank, and BRCA related cancers (breast, ovarian, prostate, and pancreatic) were assessed in participants through use of cancer registry data. Results Overall, genotyping using SNP chips performed well compared with sequencing; sensitivity, specificity, positive predictive value, and negative predictive value were all above 99% for 108 574 common variants directly genotyped on the SNP chips and sequenced in the UK Biobank. However, the likelihood of a true positive result decreased dramatically with decreasing variant frequency; for variants that are very rare in the population, with a frequency below 0.001% in UK Biobank, the positive predictive value was very low and only 16% of 4757 heterozygous genotypes from the SNP chips were confirmed with sequencing data. Results were similar for SNP chip data from the Personal Genome Project, and 20/21 individuals analysed had at least one false positive rare pathogenic variant that had been incorrectly genotyped. For pathogenic variants in the BRCA1 and BRCA2 genes, which are individually very rare, the overall performance metrics for the SNP chips versus sequencing in the UK Biobank were: sensitivity 34.6%, specificity 98.3%, positive predictive value 4.2%, and negative predictive value 99.9%. Rates of BRCA related cancers in UK Biobank participants with a positive SNP chip result were similar to those for age matched controls (odds ratio 1.31, 95% confidence interval 0.99 to 1.71) because the vast majority of variants were false positives, whereas sequence positive participants had a significantly increased risk (odds ratio 4.05, 2.72 to 6.03). Conclusions SNP chips are extremely unreliable for genotyping very rare pathogenic variants and should not be used to guide health decisions without validation.


Sign in / Sign up

Export Citation Format

Share Document