Tagging SNP-set selection with maximum information based on linkage disequilibrium structure in genome-wide association studies

Shudong Wang; Sicheng He; Fayou Yuan; Xinjie Zhu

doi:10.1093/bioinformatics/btx151

Tagging SNP-set selection with maximum information based on linkage disequilibrium structure in genome-wide association studies

Bioinformatics ◽

10.1093/bioinformatics/btx151 ◽

2017 ◽

Vol 33 (14) ◽

pp. 2078-2081 ◽

Cited By ~ 3

Author(s):

Shudong Wang ◽

Sicheng He ◽

Fayou Yuan ◽

Xinjie Zhu

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Linkage Disequilibrium Structure ◽

Tagging Snp ◽

Maximum Information ◽

Genome Wide

Download Full-text

The Impact of Incomplete Linkage Disequilibrium and Genetic Model Choice on the Analysis and Interpretation of Genome-wide Association Studies

Annals of Human Genetics ◽

10.1111/j.1469-1809.2010.00579.x ◽

2010 ◽

Vol 74 (4) ◽

pp. 375-379 ◽

Cited By ~ 6

Author(s):

Mark M. Iles

Keyword(s):

Linkage Disequilibrium ◽

Genetic Model ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Model Choice ◽

Genome Wide ◽

The Impact

Download Full-text

A New Diversity Panel for Winter Rapeseed (Brassica napus L.) Genome-Wide Association Studies

Agronomy ◽

10.3390/agronomy10122006 ◽

2020 ◽

Vol 10 (12) ◽

pp. 2006

Author(s):

David P. Horvath ◽

Michael Stamm ◽

Zahirul I. Talukder ◽

Jason Fiedler ◽

Aidan P. Horvath ◽

...

Keyword(s):

Linkage Disequilibrium ◽

Brassica Napus ◽

Association Studies ◽

Decay Rates ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

High Quality ◽

Brassica Napus L ◽

Genome Wide ◽

Quality Markers

A diverse population (429 member) of canola (Brassica napus L.) consisting primarily of winter biotypes was assembled and used in genome-wide association studies. Genotype by sequencing analysis of the population identified and mapped 290,972 high-quality markers ranging from 18.5 to 82.4% missing markers per line and an average of 36.8%. After interpolation, 251,575 high-quality markers remained. After filtering for markers with low minor allele counts (count > 5), we were left with 190,375 markers. The average distance between these markers is 4463 bases with a median of 69 and a range from 1 to 281,248 bases. The heterozygosity among the imputed population ranges from 0.9 to 11.0% with an average of 5.4%. The filtered and imputed dataset was used to determine population structure and kinship, which indicated that the population had minimal structure with the best K value of 2–3. These results also indicated that the majority of the population has substantial sequence from a single population with sub-clusters of, and admixtures with, a very small number of other populations. Analysis of chromosomal linkage disequilibrium decay ranged from ~7 Kb for chromosome A01 to ~68 Kb for chromosome C01. Local linkage decay rates determined for all 500 kb windows with a 10kb sliding step indicated a wide range of linkage disequilibrium decay rates, indicating numerous crossover hotspots within this population, and provide a resource for determining the likely limits of linkage disequilibrium from any given marker in which to identify candidate genes. This population and the resources provided here should serve as helpful tools for investigating genetics in winter canola.

Download Full-text

A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies

BMC Bioinformatics ◽

10.1186/1471-2105-12-16 ◽

2011 ◽

Vol 12 (1) ◽

Cited By ~ 26

Author(s):

Raphaël Mourad ◽

Christine Sinoquet ◽

Philippe Leray

Keyword(s):

Linkage Disequilibrium ◽

Dimensionality Reduction ◽

Bayesian Network ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Hierarchical Bayesian ◽

Network Approach ◽

Genome Wide ◽

Data Dimensionality Reduction

Download Full-text

Faculty Opinions recommendation of Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1032179.497533 ◽

2006 ◽

Author(s):

Tony Long

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Faculty Opinions recommendation of Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1032179.373886 ◽

2006 ◽

Author(s):

Karin Schmitt

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Improving the detection of pathways in genome-wide association studies by combined effects of SNPs from Linkage Disequilibrium blocks

Scientific Reports ◽

10.1038/s41598-017-03826-2 ◽

2017 ◽

Vol 7 (1) ◽

Cited By ~ 4

Author(s):

Huiying Zhao ◽

Dale R. Nyholt ◽

Yuanhao Yang ◽

Jihua Wang ◽

Yuedong Yang

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Combined Effects ◽

Genome Wide

Download Full-text

A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies

BMC Bioinformatics ◽

10.1186/s12859-018-2054-0 ◽

2018 ◽

Vol 19 (1) ◽

Cited By ~ 3

Author(s):

Christine Sinoquet

Keyword(s):

Linkage Disequilibrium ◽

Random Forest ◽

Latent Variables ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Joint Genotype- and Ancestry-based Genome-wide Association Studies in Admixed Populations

10.1101/062554 ◽

2016 ◽

Cited By ~ 2

Author(s):

Piotr Szulc ◽

Malgorzata Bogdan ◽

Florian Frommlet ◽

Hua Tang

Keyword(s):

Linkage Disequilibrium ◽

Complex Traits ◽

Association Studies ◽

Real Data ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Single Marker Analysis ◽

Marker Analysis ◽

Genome Wide ◽

Single Marker

AbstractIn Genome-Wide Association Studies (GWAS) genetic loci that influence complex traits are localized by inspecting associations between genotypes of genetic markers and the values of the trait of interest. On the other hand Admixture Mapping, which is performed in case of populations consisting of a recent mix of two ancestral groups, relies on the ancestry information at each locus (locus-specific ancestry).Recently it has been proposed to jointly model genotype and locus-specific ancestry within the framework of single marker tests. Here we extend this approach for population-based GWAS in the direction of multi marker models. A modified version of the Bayesian Information Criterion is developed for building a multi-locus model, which accounts for the differential correlation structure due to linkage disequilibrium and admixture linkage disequilibrium. Simulation studies and a real data example illustrate the advantages of this new approach compared to single-marker analysis and modern model selection strategies based on separately analyzing genotype and ancestry data, as well as to single-marker analysis combining genotypic and ancestry information. Depending on the signal strength our procedure automatically chooses whether genotypic or locus-specific ancestry markers are added to the model. This results in a good compromise between the power to detect causal mutations and the precision of their localization. The proposed method has been implemented in R and is available at http://www.math.uni.wroc.pl/~mbogdan/admixtures/.

Download Full-text

Genome wide linkage disequilibrium in Chinese asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm: implications for domestication history and genome wide association studies

Heredity ◽

10.1038/hdy.2012.8 ◽

2012 ◽

Vol 109 (1) ◽

pp. 34-40 ◽

Cited By ~ 20

Author(s):

P Xu ◽

X Wu ◽

B Wang ◽

J Luo ◽

Y Liu ◽

...

Keyword(s):

Linkage Disequilibrium ◽

Vigna Unguiculata ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Asparagus Bean ◽

Genome Wide Linkage Disequilibrium

Download Full-text

Expectation of the intercept from bivariate LD score regression in the presence of population stratification

10.1101/310565 ◽

2018 ◽

Cited By ~ 10

Author(s):

Loic Yengo ◽

Jian Yang ◽

Peter M. Visscher

Keyword(s):

Linkage Disequilibrium ◽

Population Stratification ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Popular Method ◽

Extended Theory ◽

Genome Wide ◽

Estimate Heritability ◽

Theoretical Results

Linkage disequilibrium (LD) score regression is an increasingly popular method used to quantify the level of confounding in genome-wide association studies (GWAS) or to estimate heritability and genetic correlation between traits. When applied to a pair of GWAS, the LD score regression (LDSC) methodology produces a statistic, referred to as the bivariate LDSC intercept, which deviation from 0 is classically interpreted as an indication of sample overlap between the two GWAS. Here we propose an extension of the theory underlying the bivariate LDSC methodology, which accounts for population stratification within and between GWAS. Our extended theory predicts an inflation of the bivariate LDSC intercept when sample sizes and heritability are large, even in the absence of sample overlap. We illustrate our theoretical results with simulations based on actual SNP genotypes and we propose a re-interpretation of previously published results in the light of our extended theory.

Download Full-text