scholarly journals The effect of strong purifying selection on genetic diversity

2017 ◽  
Author(s):  
Ivana Cvijović ◽  
Benjamin H. Good ◽  
Michael M. Desai

Purifying selection reduces genetic diversity, both at sites under direct selection and at linked neutral sites. This process, known as background selection, is thought to play an important role in shaping genomic diversity in natural populations. Yet despite its importance, the effects of background selection are not fully understood. Previous theoretical analyses of this process have taken a backwards-time approach based on the structured coalescent. While they provide some insight, these methods are either limited to very small samples or are computationally prohibitive. Here, we present a new forward-time analysis of the trajectories of both neutral and deleterious mutations at a nonrecombining locus. We find that strong purifying selection leads to remarkably rich dynamics: neutral mutations can exhibit sweep-like behavior, and deleterious mutations can reach substantial frequencies even when they are guaranteed to eventually go extinct. Our analysis of these dynamics allows us to calculate analytical expressions for the full site frequency spectrum. We find that whenever background selection is strong enough to lead to a reduction in genetic diversity, it also results in substantial distortions to the site frequency spectrum, which can mimic the effects of population expansions or positive selection. Because these distortions are most pronounced in the low and high frequency ends of the spectrum, they become particularly important in larger samples, but may have small effects in smaller samples. We also apply our forward-time framework to calculate other quantities, such as the ultimate fates of polymorphisms or the fitnesses of their ancestral backgrounds.

Genetics ◽  
2001 ◽  
Vol 158 (2) ◽  
pp. 657-665 ◽  
Author(s):  
Peter Andolfatto ◽  
Molly Przeworski

AbstractA correlation between diversity levels and rates of recombination is predicted both by models of positive selection, such as hitchhiking associated with the rapid fixation of advantageous mutations, and by models of purifying selection against strongly deleterious mutations (commonly referred to as “background selection”). With parameter values appropriate for Drosophila populations, only the first class of models predicts a marked skew in the frequency spectrum of linked neutral variants, relative to a neutral model. Here, we consider 29 loci scattered throughout the Drosophila melanogaster genome. We show that, in African populations, a summary of the frequency spectrum of polymorphic mutations is positively correlated with the meiotic rate of crossing over. This pattern is demonstrated to be unlikely under a model of background selection. Models of weakly deleterious selection are not expected to produce both the observed correlation and the extent to which nucleotide diversity is reduced in regions of low (but nonzero) recombination. Thus, of existing models, hitchhiking due to the recurrent fixation of advantageous variants is the most plausible explanation for the data.


2019 ◽  
Author(s):  
Kimberly J. Gilbert ◽  
Fanny Pouyet ◽  
Laurent Excoffier ◽  
Stephan Peischl

SummaryLinked selection is a major driver of genetic diversity. Selection against deleterious mutations removes linked neutral diversity (background selection, BGS, Charlesworth et al. 1993), creating a positive correlation between recombination rates and genetic diversity. Purifying selection against recessive variants, however, can also lead to associative overdominance (AOD, Ohta 1971, Zhao & Charlesworth, 2016), due to an apparent heterozygote advantage at linked neutral loci that opposes the loss of neutral diversity by BGS. Zhao & Charlesworth (2016) identified the conditions when AOD should dominate over BGS in a single-locus model and suggested that the effect of AOD could become stronger if multiple linked deleterious variants co-segregate. We present a model describing how and under which conditions multi-locus dynamics can amplify the effects of AOD. We derive the conditions for a transition from BGS to AOD due to pseudo-overdominance (Ohta & Kimura 1970), i.e. a form of balancing selection that maintains complementary deleterious haplotypes that mask the effect of recessive deleterious mutations. Simulations confirm these findings and show that multi-locus AOD can increase diversity in low recombination regions much more strongly than previously appreciated. While BGS is known to drive genome-wide diversity in humans (Pouyet et al. 2018), the observation of a resurgence of genetic diversity in regions of very low recombination is indicative of AOD. We identify 21 such regions in the human genome showing clear signals of multi-locus AOD. Our results demonstrate that AOD may play an important role in the evolution of low recombination regions of many species.


2021 ◽  
Author(s):  
David Murphy ◽  
Eyal Elyashiv ◽  
Guy Amster ◽  
Guy Sella

Analyses of genetic variation in many taxa have established that neutral genetic diversity is shaped by natural selection at linked sites. Whether the source of selection is primarily the fixation of strongly beneficial alleles (selective sweeps) or purifying selection on deleterious mutations (background selection) remains unknown, however. We address this question in humans by fitting a model of the joint effects of selective sweeps and background selection to autosomal polymorphism data from the 1000 Genomes Project. After controlling for variation in mutation rates along the genome, a model of background selection alone explains ~60% of the variance in diversity levels at the megabase scale. Adding the effects of selective sweeps driven by adaptive substitutions to the model does not improve the fit, and when both modes of selection are considered jointly, selective sweeps are estimated to have had little or no effect on linked neutral diversity. The regions under purifying selection are best predicted by phylogenetic conservation, with ~80% of the deleterious mutations affecting neutral diversity occurring in non-exonic regions. Thus, background selection is the dominant mode of linked selection in humans, with marked effects on diversity levels throughout autosomes.


Genetics ◽  
1999 ◽  
Vol 153 (1) ◽  
pp. 497-506 ◽  
Author(s):  
Rasmus Nielsen ◽  
Daniel M Weinreich

Abstract McDonald/Kreitman tests performed on animal mtDNA consistently reveal significant deviations from strict neutrality in the direction of an excess number of polymorphic nonsynonymous sites, which is consistent with purifying selection acting on nonsynonymous sites. We show that under models of recurrent neutral and deleterious mutations, the mean age of segregating neutral mutations is greater than the mean age of segregating selected mutations, even in the absence of recombination. We develop a test of the hypothesis that the mean age of segregating synonymous mutations equals the mean age of segregating nonsynonymous mutations in a sample of DNA sequences. The power of this age-of-mutation test and the power of the McDonald/Kreitman test are explored by computer simulations. We apply the new test to 25 previously published mitochondrial data sets and find weak evidence for selection against nonsynonymous mutations.


1994 ◽  
Vol 63 (3) ◽  
pp. 213-227 ◽  
Author(s):  
Brian Charlesworth

SummaryThis paper analyses the effects of selection against deleterious alleles maintained by mutation (‘ background selection’) on rates of evolution and levels of genetic diversity at weakly selected, completely linked, loci. General formulae are derived for the expected rates of gene substitution and genetic diversity, relative to the neutral case, as a function of selection and dominance coefficients at the loci in question, and of the frequency of gametes that are free of deleterious mutations with respect to the loci responsible for background selection. As in the neutral case, most effects of background selection can be predicted by considering the effective size of the population to be multiplied by the frequency of mutation-free gametes. Levels of genetic diversity can be sharply reduced by background selection, with the result that values for sites under selection approach those for neutral variants subject to the same regime of background selection. Rates of fixation of slightly deleterious mutations are increased by background selection, and rates of fixation of advantageous mutations are reduced. The properties of sex-linked and autosomal asexual and self-fertilizing populations are considered. The implications of these results for the interpretation of studies of molecular evolution and variation are discussed.


2015 ◽  
Author(s):  
Bendix Koopmann ◽  
Johannes Müeller ◽  
Aurélien Tellier ◽  
Daniel Živković

AbstractSeed banks are a common characteristics to many plant species, which allow storage of genetic diversity in the soil as dormant seeds for various periods of time. We investigate an above-ground population following a Fisher-Wright model with selection coupled with a deterministic seed bank assuming the length of the seed bank is kept constant and the number of seeds is large. To assess the combined impact of seed banks and selection on genetic diversity, we derive a general diffusion model. We compute the equilibrium solution of the site-frequency spectrum and derive the times to fixation of an allele with and without selection. Finally, it is demonstrated that seed banks enhance the effect of selection onto the site-frequency spectrum while slowing down the time until the mutation-selection equilibrium is reached.


eLife ◽  
2018 ◽  
Vol 7 ◽  
Author(s):  
Fanny Pouyet ◽  
Simon Aeschbacher ◽  
Alexandre Thiéry ◽  
Laurent Excoffier

Disentangling the effect on genomic diversity of natural selection from that of demography is notoriously difficult, but necessary to properly reconstruct the history of species. Here, we use high-quality human genomic data to show that purifying selection at linked sites (i.e. background selection, BGS) and GC-biased gene conversion (gBGC) together affect as much as 95% of the variants of our genome. We find that the magnitude and relative importance of BGS and gBGC are largely determined by variation in recombination rate and base composition. Importantly, synonymous sites and non-transcribed regions are also affected, albeit to different degrees. Their use for demographic inference can lead to strong biases. However, by conditioning on genomic regions with recombination rates above 1.5 cM/Mb and mutation types (C↔G, A↔T), we identify a set of SNPs that is mostly unaffected by BGS or gBGC, and that avoids these biases in the reconstruction of human history.


2021 ◽  
Author(s):  
Robert Horvath ◽  
Mitra Menon ◽  
Michelle C Stitzer ◽  
Jeffrey Ross-Ibarra

Recognition of the important role of transposable elements (TEs) in eukaryotic genomes quickly led to a burgeoning literature modeling and estimating the effects of selection on TEs. Much of the empirical work on selection has focused on analyzing the site frequency spectrum (SFS) of TEs. But TEs differ from standard evolutionary models in a number of ways that can impact the power and interpretation of the SFS. For example, rather than mutating under a clock-like model, transposition often occurs in bursts which can inflate particular frequency categories compared to expectations under a standard neutral model. If a TE burst has been recent, the excess of low frequency polymorphisms can mimic the effect of purifying selection. Here, we investigate how transposition bursts affect the frequency distribution of TEs and the correlation between age and allele frequency. Using information on the TE age distribution, we propose an age-adjusted site frequency spectrum to compare TEs and neutral polymorphisms to more effectively evaluate whether TEs are under selective constraints. We show that our approach can minimize instances of false inference of selective constraint, but also allows for a correct identification of even weak selection affecting TEs which experienced a transposition burst and is robust to at least simple demographic changes. The results presented here will help researchers working on TEs to more reliably identify the effects of selection on TEs without having to rely on the assumption of a constant transposition rate.


1996 ◽  
Vol 68 (2) ◽  
pp. 131-149 ◽  
Author(s):  
Brian Charlesworth

SummaryTheoretical models of the effects of selection against deleterious mutations on variation at linked neutral sites (background selection) are used to predict the relations between chromosomal location and genetic variability at the DNA level, in Drosophila melanogaster. The sensitivity of the predictions to variation in the mutation, selection and recombination parameters on which they are based is examined. It is shown that many features of the observed relations between chromosomal location and level of genetic diversity in D. melanogaster can be explained by background selection, especially if the weak selective forces acting on transposable elements are taken into account. In particular, the gradient in diversity in the distal portion of the X chromosome, and the lack of diversity on chromosome 4 and at the bases of the major chromosomes, can be fully accounted for. There are, however, discrepancies between predicted and observed values for some loci in D. melanogaster, which may reflect the effects of forces other than background selection.


2018 ◽  
Author(s):  
Daniel P. Rice ◽  
John Novembre ◽  
Michael M. Desai

AbstractDemographic inference methods in population genetics typically assume that the ancestry of a sample can be modeled by the Kingman coalescent. A defining feature of this stochastic process is that it generates genealogies that are binary trees: no more than two ancestral lineages may coalesce at the same time. However, this assumption breaks down under several scenarios. For example, pervasive natural selection and extreme variation in offspring number can both generate genealogies with “multiple-merger” events in which more than two lineages coalesce instantaneously. Therefore, detecting multiple mergers is important both for understanding which forces have shaped the diversity of a population and for avoiding fitting misspecified models to data. Current methods to detect multiple mergers in genomic data rely on the site frequency spectrum (SFS). However, the signatures of multiple mergers in the SFS are also consistent with a Kingman coalescent with a time-varying population size. Here, we present a new method for detecting multiple mergers based on the pointwise mutual information of the two-site frequency spectrum for pairs of linked sites. Unlike the SFS, the pointwise mutual information depends mostly on the topologies of genealogies rather than on their branch lengths and is therefore largely insensitive to population size change. This statistic is global in the sense that it can detect when the genome-wide genetic diversity is inconsistent with the Kingman coalescent, rather than detecting outlier regions, as in selection scan methods. Finally, we demonstrate a graphical model-checking procedure based on the point-wise mutual information using genomic diversity data from Drosophila melanogaster.


Sign in / Sign up

Export Citation Format

Share Document