Imputation aware tag SNP selection to improve power for multi-ethnic association studies

Mapping Intimacies ◽

10.1101/105551 ◽

2017 ◽

Cited By ~ 2

Author(s):

Genevieve L. Wojcik ◽

Christian Fuchsberger ◽

Daniel Taliun ◽

Ryan Welch ◽

Alicia R Martin ◽

...

Keyword(s):

Rare Variant ◽

Association Studies ◽

Genotype Imputation ◽

Genomic Research ◽

Array Design ◽

Rare Variant Association ◽

Tag Snps ◽

Tag Snp ◽

Tag Snp Selection ◽

The Impact

AbstractThe emergence of very large cohorts in genomic research has facilitated a focus on genotype-imputation strategies to power rare variant association. Consequently, a new generation of genotyping arrays are being developed designed with tag single nucleotide polymorphisms (SNPs) to improve rare variant imputation. Selection of these tag SNPs poses several challenges as rare variants tend to be continentally-or even population-specific and reflect fine-scale linkage disequilibrium (LD) structure impacted by recent demographic events. To explore the landscape of tag-able variation and guide design considerations for large-cohort and biobank arrays, we developed a novel pipeline to select tag SNPs using the 26 population reference panel from Phase of the 1000 Genomes Project. We evaluate our approach using leave-one-out internal validation via standard imputation methods that allows the direct comparison of tag SNP performance by estimating the correlation of the imputed and real genotypes for each iteration of potential array sites. We show how this approach allows for an assessment of array design and performance that can take advantage of the development of deeper and more diverse sequenced reference panels. We quantify the impact of demography on tag SNP performance across populations and provide population-specific guidelines for tag SNP selection. We also examine array design strategies that target single populations versus multi-ethnic cohorts, and demonstrate a boost in performance for the latter can be obtained by prioritizing tag SNPs that contribute information across multiple populations simultaneously. Finally, we demonstrate the utility of improved array design to provide meaningful improvements in power, particularly in trans-ethnic studies. The unified framework presented will enable investigators to make informed decisions for the design of new arrays, and help empower the next phase of rare variant association for global health.

Download Full-text

EFFECTIVE ALGORITHMS FOR TAG SNP SELECTION

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720005001521 ◽

2005 ◽

Vol 03 (05) ◽

pp. 1089-1106 ◽

Cited By ~ 2

Author(s):

TIE-FEI LIU ◽

WING-KIN SUNG ◽

YI LI ◽

JIAN-JUN LIU ◽

ANKUSH MITTAL ◽

...

Keyword(s):

Search Algorithm ◽

Association Studies ◽

Strong Association ◽

Genetic Association Studies ◽

Approximate Solutions ◽

Computational Time ◽

Nucleotide Polymorphisms ◽

Tag Snps ◽

Tag Snp ◽

Tag Snp Selection

Single nucleotide polymorphisms (SNPs), due to their abundance and low mutation rate, are very useful genetic markers for genetic association studies. However, the current genotyping technology cannot afford to genotype all common SNPs in all the genes. By making use of linkage disequilibrium, we can reduce the experiment cost by genotyping a subset of SNPs, called Tag SNPs, which have a strong association with the ungenotyped SNPs, while are as independent from each other as possible. The problem of selecting Tag SNPs is NP-complete; when there are large number of SNPs, in order to avoid extremely long computational time, most of the existing Tag SNP selection methods first partition the SNPs into blocks based on certain block definitions, then Tag SNPs are selected in each block by brute-force search. The size of the Tag SNP set obtained in this way may usually be reduced further due to the inter-dependency among blocks. This paper proposes two algorithms, TSSA and TSSD, to tackle the block-independent Tag SNP selection problem. TSSA is based on A* search algorithm, and TSSD is a heuristic algorithm. Experiments show that TSSA can find the optimal solutions for medium-sized problems in reasonable time, while TSSD can handle very large problems and report approximate solutions very close to the optimal ones.

Download Full-text

Imputation-Aware Tag SNP Selection To Improve Power for Large-Scale, Multi-ethnic Association Studies

G3 Genes|Genome|Genetics ◽

10.1534/g3.118.200502 ◽

2018 ◽

Vol 8 (10) ◽

pp. 3255-3267 ◽

Cited By ~ 10

Author(s):

Genevieve L. Wojcik ◽

Christian Fuchsberger ◽

Daniel Taliun ◽

Ryan Welch ◽

Alicia R Martin ◽

...

Keyword(s):

Large Scale ◽

Association Studies ◽

Tag Snp ◽

Tag Snp Selection

Download Full-text

Intersection analysis and rare variant association studies

Computational Exome and Genome Analysis ◽

10.1201/9781315154770-21 ◽

2017 ◽

pp. 315-320

Author(s):

Peter N. Robinson ◽

Rosario M. Piro ◽

Marten Jäger

Keyword(s):

Rare Variant ◽

Association Studies ◽

Rare Variant Association ◽

Intersection Analysis

Download Full-text

Increasing the Power of Association Studies by Imputation-based Sparse Tag SNP Selection

Communications in Information and Systems ◽

10.4310/cis.2009.v9.n3.a4 ◽

2009 ◽

Vol 9 (3) ◽

pp. 269-282 ◽

Cited By ~ 1

Author(s):

Ofir Davidovich ◽

Eran Halperin ◽

Gad Kimmel ◽

Ron Shamir

Keyword(s):

Association Studies ◽

Tag Snp ◽

Tag Snp Selection

Download Full-text

A novel statistical method for rare-variant association studies in general pedigrees

BMC Proceedings ◽

10.1186/s12919-016-0029-6 ◽

2016 ◽

Vol 10 (S7) ◽

Author(s):

Huanhuan Zhu ◽

Zhenchuan Wang ◽

Xuexia Wang ◽

Qiuying Sha

Keyword(s):

Statistical Method ◽

Rare Variant ◽

Association Studies ◽

Rare Variant Association

Download Full-text

The impact of a fine-scale population stratification on rare variant association test results

PLoS ONE ◽

10.1371/journal.pone.0207677 ◽

2018 ◽

Vol 13 (12) ◽

pp. e0207677 ◽

Cited By ~ 13

Author(s):

Elodie Persyn ◽

Richard Redon ◽

Lise Bellanger ◽

Christian Dina

Keyword(s):

Rare Variant ◽

Population Stratification ◽

Association Test ◽

Fine Scale ◽

Test Results ◽

Rare Variant Association ◽

Scale Population ◽

Rare Variant Association Test ◽

The Impact

Download Full-text

Strategies to improve the performance of rare variant association studies by optimizing the selection of controls

Bioinformatics ◽

10.1093/bioinformatics/btv457 ◽

2015 ◽

pp. btv457

Author(s):

Na Zhu ◽

Verena Heinrich ◽

Thorsten Dickhaus ◽

Jochen Hecht ◽

Peter N. Robinson ◽

...

Keyword(s):

Rare Variant ◽

Association Studies ◽

Rare Variant Association ◽

Selection Of

Download Full-text

Tag SNP selection for candidate gene association studies using HapMap and gene resequencing data

European Journal of Human Genetics ◽

10.1038/sj.ejhg.5201875 ◽

2007 ◽

Vol 15 (10) ◽

pp. 1063-1070 ◽

Cited By ~ 28

Author(s):

Zongli Xu ◽

Norman L Kaplan ◽

Jack A Taylor

Keyword(s):

Candidate Gene ◽

Association Studies ◽

Gene Association ◽

Tag Snp ◽

Candidate Gene Association ◽

Selection For ◽

Tag Snp Selection

Download Full-text

An adaptive test for meta‐analysis of rare variant association studies

Genetic Epidemiology ◽

10.1002/gepi.22273 ◽

2019 ◽

Vol 44 (1) ◽

pp. 104-116

Author(s):

Tianzhong Yang ◽

Junghi Kim ◽

Chong Wu ◽

Yiding Ma ◽

Peng Wei ◽

...

Keyword(s):

Rare Variant ◽

Association Studies ◽

Meta Analysis ◽

Rare Variant Association ◽

Adaptive Test

Download Full-text

Quantitative Proteomics of Human Heart Samples Collected In Vivo Reveal the Remodeled Protein Landscape of Dilated Left Atrium Without Atrial Fibrillation

Molecular & Cellular Proteomics ◽

10.1074/mcp.ra119.001878 ◽

2020 ◽

Vol 19 (7) ◽

pp. 1132-1144

Author(s):

Nora Linscheid ◽

Pi Camilla Poulsen ◽

Ida Dalgaard Pedersen ◽

Emilie Gregers ◽

Jesper Hastrup Svendsen ◽

...

Keyword(s):

Atrial Fibrillation ◽

Mitral Valve ◽

Protein Expression ◽

High Resolution Mass Spectrometry ◽

Association Studies ◽

Genomic Research ◽

Genome Wide Association Studies ◽

History Of ◽

The Impact

Genetic and genomic research has greatly advanced our understanding of heart disease. Yet, comprehensive, in-depth, quantitative maps of protein expression in hearts of living humans are still lacking. Using samples obtained during valve replacement surgery in patients with mitral valve prolapse (MVP), we set out to define inter-chamber differences, the intersect of proteomic data with genetic or genomic datasets, and the impact of left atrial dilation on the proteome of patients with no history of atrial fibrillation (AF).We collected biopsies from right atria (RA), left atria (LA) and left ventricle (LV) of seven male patients with mitral valve regurgitation with dilated LA but no history of AF. Biopsy samples were analyzed by high-resolution mass spectrometry (MS), where peptides were pre-fractionated by reverse phase high-pressure liquid chromatography prior to MS measurement on a Q-Exactive-HF Orbitrap instrument. We identified 7,314 proteins based on 130,728 peptides. Results were confirmed in an independent set of biopsies collected from three additional individuals. Comparative analysis against data from post-mortem samples showed enhanced quantitative power and confidence level in samples collected from living hearts. Our analysis, combined with data from genome wide association studies suggested candidate gene associations to MVP, identified higher abundance in ventricle for proteins associated with cardiomyopathies and revealed the dilated LA proteome, demonstrating differential representation of molecules previously associated with AF, in non-AF hearts.This is the largest dataset of cardiac protein expression from human samples collected in vivo. It provides a comprehensive resource that allows insight into molecular fingerprints of MVP and facilitates novel inferences between genomic data and disease mechanisms. We propose that over-representation of proteins in ventricle is consequent not to redundancy but to functional need, and conclude that changes in abundance of proteins known to associate with AF are not sufficient for arrhythmogenesis.

Download Full-text