SNP allele frequency estimation in DNA pools and variance components analysis

Kate Downes; Bryan J. Barratt; Pelin Akan; Sue J. Bumpstead; Stacey D. Taylor; David G. Clayton; Panos Deloukas

doi:10.2144/04365rr01

Single nucleotide polymorphism (SNP) allele frequency estimation in DNA pools using Pyrosequencing™

Nature Protocols ◽

10.1038/nprot.2006.442 ◽

2006 ◽

Vol 1 (6) ◽

pp. 2573-2582 ◽

Cited By ~ 34

Author(s):

Catharina Lavebratt ◽

Selim Sengul

Keyword(s):

Single Nucleotide Polymorphism ◽

Allele Frequency ◽

Frequency Estimation ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Allele Frequency Estimation ◽

Dna Pools

Download Full-text

Pyrosequencing?-based SNP allele frequency estimation in DNA pools

Human Mutation ◽

10.1002/humu.10292 ◽

2003 ◽

Vol 23 (1) ◽

pp. 92-97 ◽

Cited By ~ 33

Author(s):

Catharina Lavebratt ◽

Selim Sengul ◽

Marten Jansson ◽

Martin Schalling

Keyword(s):

Allele Frequency ◽

Frequency Estimation ◽

Allele Frequency Estimation ◽

Dna Pools

Download Full-text

Efficient variance components analysis across millions of genomes

10.1101/522003 ◽

2019 ◽

Cited By ~ 3

Author(s):

Ali Pazokitoroudi ◽

Yue Wu ◽

Kathryn S. Burch ◽

Kangcheng Hou ◽

Aaron Zhou ◽

...

Keyword(s):

Allele Frequency ◽

Minor Allele Frequency ◽

Variance Components ◽

Large Scale ◽

Low Frequency ◽

Complex Trait ◽

Minor Allele ◽

Variance Components Analysis ◽

Heritability Estimation ◽

Components Analysis

AbstractVariance components analysis has emerged as a powerful tool in complex trait genetics, with applications ranging from heritability estimation to association mapping. While the application of these methods to large-scale genetic datasets can potentially reveal important insights into genetic architecture, existing methods for fitting variance components do not scale well to these datasets. Here, we present a new algorithm for variance components analysis that is accurate and highly efficient, capable of estimating one hundred variance components on a million individuals genotyped at a million SNPs in a few hours. We illustrate the utility of our method in estimating variation in a trait explained by genotyped SNPs (SNP heritability) as well in partitioning heritability across population and functional genomic annotations. Analyzing 22 diverse traits with genotypes from 300, 000 individuals across about 8 million common and low frequency SNPs (minor allele frequency > 0.1%), we observe that the allelic effect size increases with decreasing MAF (minor allele frequency) and LD (linkage disequilibrium) across the analyzed traits consistent with the action of negative selection. Partitioning heritability across 28 functional annotations, we observe enrichment of heritability in FANTOM5 enhancers in asthma, eczema, thyroid and autoimmune disorders.

Download Full-text

Quantitative technologies for allele frequency estimation of SNPs in DNA pools

Molecular and Cellular Probes ◽

10.1006/mcpr.2002.0440 ◽

2002 ◽

Vol 16 (6) ◽

pp. 429-434 ◽

Cited By ~ 32

Author(s):

Sagiv Shifman ◽

Anne Pisanté-Shalom ◽

Benjamin Yakir ◽

Ariel Darvasi

Keyword(s):

Allele Frequency ◽

Frequency Estimation ◽

Allele Frequency Estimation ◽

Dna Pools

Download Full-text

Cheap, accurate and rapid allele frequency estimation of single nucleotide polymorphisms by primer extension and DHPLC in DNA pools

Human Genetics ◽

10.1007/s004390000397 ◽

2000 ◽

Vol 107 (5) ◽

pp. 488-493 ◽

Cited By ~ 116

Author(s):

Bastiaan Hoogendoorn ◽

Nadine Norton ◽

George Kirov ◽

Nigel Williams ◽

Marian Hamshere ◽

...

Keyword(s):

Single Nucleotide Polymorphisms ◽

Allele Frequency ◽

Frequency Estimation ◽

Primer Extension ◽

Nucleotide Polymorphisms ◽

Single Nucleotide ◽

Allele Frequency Estimation ◽

Dna Pools

Download Full-text

Determination of detection and quantification limits for SNP allele frequency estimation in DNA pools using real time PCR

Nucleic Acids Research ◽

10.1093/nar/gnh020 ◽

2004 ◽

Vol 32 (3) ◽

pp. 24e-24 ◽

Cited By ~ 40

Author(s):

G. Schwarz

Keyword(s):

Real Time ◽

Allele Frequency ◽

Real Time Pcr ◽

Frequency Estimation ◽

Allele Frequency Estimation ◽

Dna Pools ◽

Detection And Quantification

Download Full-text

Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples

BMC Genomics ◽

10.1186/1471-2164-13-16 ◽

2012 ◽

Vol 13 (1) ◽

pp. 16 ◽

Cited By ~ 12

Author(s):

Michael P Mullen ◽

Christopher J Creevey ◽

Donagh P Berry ◽

Matt S McCabe ◽

David A Magee ◽

...

Keyword(s):

Dna Sequencing ◽

Allele Frequency ◽

High Throughput ◽

Frequency Estimation ◽

Allele Frequency Estimation ◽

Polymorphism Discovery ◽

Pooled Dna ◽

High Throughput Dna Sequencing

Download Full-text

High throughput crop genome genotyping by a combination of pool next generation sequencing and haplotype-based data processing

10.21203/rs.3.rs-415602/v1 ◽

2021 ◽

Author(s):

Michael Schneider ◽

Asis Shrestha ◽

Agim Ballvora ◽

Jens Leon

Keyword(s):

Next Generation Sequencing ◽

Allele Frequency ◽

Frequency Estimation ◽

Whole Genome ◽

Next Generation ◽

Conservation Genomics ◽

High Coverage ◽

Allele Frequency Estimation ◽

Low Coverage ◽

Generation Sequencing

Abstract BackgroundThe identification of environmentally specific alleles and the observation of evolutional processes is a goal of conservation genomics. By generational changes of allele frequencies in populations, questions regarding effective population size, gene flow, drift, and selection can be addressed. The observation of such effects often is a trade-off of costs and resolution, when a decent sample of genotypes should be genotyped for many loci. Pool genotyping approaches can derive a high resolution and precision in allele frequency estimation, when high coverage sequencing is utilized. Still, pool high coverage pool sequencing of big genomes comes along with high costs.ResultsHere we present a reliable method to estimate a barley population’s allele frequency at low coverage sequencing. Three hundred genotypes were sampled from a barley backcross population to estimate the entire population’s allele frequency. The allele frequency estimation accuracy and yield were compared for three next generation sequencing methods. To reveal accurate allele frequency estimates on a low coverage sequencing level, a haplotyping approach was performed. Low coverage allele frequency of positional connected single polymorphisms were aggregated to a single haplotype allele frequency, resulting in two to 271 times higher depth and increased precision. We compared different haplotyping tactics, showing that gene and chip marker-based haplotypes perform on par or better than simple contig haplotype windows. The comparison of multiple pool samples and the referencing against an individual sequencing approach revealed whole genome pool resequencing having the highest correlation to individual genotyping (up to 0.97), while transcriptomics and genotyping by sequencing indicated higher error rates and lower correlations.ConclusionUsing the proposed method allows to identify the allele frequency of populations with high accuracy at low cost. This is particularly interesting for conservation genomics in species with big genomes, like barley or wheat. Whole genome low coverage resequencing at 10x coverage can deliver a highly accurate estimation of the allele frequency, when a loci-based haplotyping approach is applied. Using annotated haplotypes allows to capitalize from biological background and statistical robustness.

Download Full-text

On a Unifying ‘Reverse’ Regression for Robust Association Studies and Allele Frequency Estimation with Related Individuals

10.1101/470328 ◽

2018 ◽

Cited By ~ 1

Author(s):

Lin Zhang ◽

Lei Sun

Keyword(s):

Allele Frequency ◽

Frequency Estimation ◽

Association Studies ◽

Genetic Association Studies ◽

Linear Mixed Effect Model ◽

Supporting Evidence ◽

Mixed Effect ◽

Allele Frequency Estimation ◽

Reverse Regression ◽

Related Individuals

AbstractFor genetic association studies with related individuals, standard linear mixed-effect model is the most popular approach. The model treats a complex trait (phenotype) as the response variable while a genetic variant (genotype) as a covariate. An alternative approach is to reverse the roles of phenotype and genotype. This class of tests includes quasi-likelihood based score tests. In this work, after reviewing these existing methods, we propose a general, unifying ‘reverse’ regression framework. We then show that the proposed method can also explicitly adjust for potential departure from Hardy–Weinberg equilibrium. Lastly, we demonstrate the additional flexibility of the proposed model on allele frequency estimation, as well as its connection with earlier work of best linear unbiased allele-frequency estimator. We conclude the paper with supporting evidence from simulation and application studies.

Download Full-text

P4070 SNP discovery and allele frequency estimation in indigenous breeds of South Africa

Journal of Animal Science ◽

10.2527/jas2016.94supplement4114x ◽

2016 ◽

Vol 94 (suppl_4) ◽

pp. 114-114

Author(s):

A. Zwane ◽

A. A. Maiwashe ◽

E. van Marle-Koster

Keyword(s):

South Africa ◽

Allele Frequency ◽

Frequency Estimation ◽

Snp Discovery ◽

Allele Frequency Estimation

Download Full-text