scholarly journals SNP allele frequency estimation in DNA pools and variance components analysis

BioTechniques ◽  
2004 ◽  
Vol 36 (5) ◽  
pp. 840-845 ◽  
Author(s):  
Kate Downes ◽  
Bryan J. Barratt ◽  
Pelin Akan ◽  
Sue J. Bumpstead ◽  
Stacey D. Taylor ◽  
...  
2003 ◽  
Vol 23 (1) ◽  
pp. 92-97 ◽  
Author(s):  
Catharina Lavebratt ◽  
Selim Sengul ◽  
Marten Jansson ◽  
Martin Schalling

2019 ◽  
Author(s):  
Ali Pazokitoroudi ◽  
Yue Wu ◽  
Kathryn S. Burch ◽  
Kangcheng Hou ◽  
Aaron Zhou ◽  
...  

AbstractVariance components analysis has emerged as a powerful tool in complex trait genetics, with applications ranging from heritability estimation to association mapping. While the application of these methods to large-scale genetic datasets can potentially reveal important insights into genetic architecture, existing methods for fitting variance components do not scale well to these datasets. Here, we present a new algorithm for variance components analysis that is accurate and highly efficient, capable of estimating one hundred variance components on a million individuals genotyped at a million SNPs in a few hours. We illustrate the utility of our method in estimating variation in a trait explained by genotyped SNPs (SNP heritability) as well in partitioning heritability across population and functional genomic annotations. Analyzing 22 diverse traits with genotypes from 300, 000 individuals across about 8 million common and low frequency SNPs (minor allele frequency > 0.1%), we observe that the allelic effect size increases with decreasing MAF (minor allele frequency) and LD (linkage disequilibrium) across the analyzed traits consistent with the action of negative selection. Partitioning heritability across 28 functional annotations, we observe enrichment of heritability in FANTOM5 enhancers in asthma, eczema, thyroid and autoimmune disorders.


2002 ◽  
Vol 16 (6) ◽  
pp. 429-434 ◽  
Author(s):  
Sagiv Shifman ◽  
Anne Pisanté-Shalom ◽  
Benjamin Yakir ◽  
Ariel Darvasi

2000 ◽  
Vol 107 (5) ◽  
pp. 488-493 ◽  
Author(s):  
Bastiaan Hoogendoorn ◽  
Nadine Norton ◽  
George Kirov ◽  
Nigel Williams ◽  
Marian Hamshere ◽  
...  

BMC Genomics ◽  
2012 ◽  
Vol 13 (1) ◽  
pp. 16 ◽  
Author(s):  
Michael P Mullen ◽  
Christopher J Creevey ◽  
Donagh P Berry ◽  
Matt S McCabe ◽  
David A Magee ◽  
...  

2021 ◽  
Author(s):  
Michael Schneider ◽  
Asis Shrestha ◽  
Agim Ballvora ◽  
Jens Leon

Abstract BackgroundThe identification of environmentally specific alleles and the observation of evolutional processes is a goal of conservation genomics. By generational changes of allele frequencies in populations, questions regarding effective population size, gene flow, drift, and selection can be addressed. The observation of such effects often is a trade-off of costs and resolution, when a decent sample of genotypes should be genotyped for many loci. Pool genotyping approaches can derive a high resolution and precision in allele frequency estimation, when high coverage sequencing is utilized. Still, pool high coverage pool sequencing of big genomes comes along with high costs.ResultsHere we present a reliable method to estimate a barley population’s allele frequency at low coverage sequencing. Three hundred genotypes were sampled from a barley backcross population to estimate the entire population’s allele frequency. The allele frequency estimation accuracy and yield were compared for three next generation sequencing methods. To reveal accurate allele frequency estimates on a low coverage sequencing level, a haplotyping approach was performed. Low coverage allele frequency of positional connected single polymorphisms were aggregated to a single haplotype allele frequency, resulting in two to 271 times higher depth and increased precision. We compared different haplotyping tactics, showing that gene and chip marker-based haplotypes perform on par or better than simple contig haplotype windows. The comparison of multiple pool samples and the referencing against an individual sequencing approach revealed whole genome pool resequencing having the highest correlation to individual genotyping (up to 0.97), while transcriptomics and genotyping by sequencing indicated higher error rates and lower correlations.ConclusionUsing the proposed method allows to identify the allele frequency of populations with high accuracy at low cost. This is particularly interesting for conservation genomics in species with big genomes, like barley or wheat. Whole genome low coverage resequencing at 10x coverage can deliver a highly accurate estimation of the allele frequency, when a loci-based haplotyping approach is applied. Using annotated haplotypes allows to capitalize from biological background and statistical robustness.


2018 ◽  
Author(s):  
Lin Zhang ◽  
Lei Sun

AbstractFor genetic association studies with related individuals, standard linear mixed-effect model is the most popular approach. The model treats a complex trait (phenotype) as the response variable while a genetic variant (genotype) as a covariate. An alternative approach is to reverse the roles of phenotype and genotype. This class of tests includes quasi-likelihood based score tests. In this work, after reviewing these existing methods, we propose a general, unifying ‘reverse’ regression framework. We then show that the proposed method can also explicitly adjust for potential departure from Hardy–Weinberg equilibrium. Lastly, we demonstrate the additional flexibility of the proposed model on allele frequency estimation, as well as its connection with earlier work of best linear unbiased allele-frequency estimator. We conclude the paper with supporting evidence from simulation and application studies.


Sign in / Sign up

Export Citation Format

Share Document