scholarly journals An approximate full-likelihood method for inferring selection and allele frequency trajectories from DNA sequence data

2019 ◽  
Author(s):  
Aaron J. Stern ◽  
Peter R. Wilton ◽  
Rasmus Nielsen

AbstractMost current methods for detecting natural selection from DNA sequence data are limited in that they are either based on summary statistics or a composite likelihood, and as a consequence, do not make full use of the information available in DNA sequence data. We here present a new importance sampling approach for approximating the full likelihood function for the selection coefficient. The method treats the ancestral recombination graph (ARG) as a latent variable that is integrated out using previously published Markov Chain Monte Carlo (MCMC) methods. The method can be used for detecting selection, estimating selection coefficients, testing models of changes in the strength of selection, estimating the time of the start of a selective sweep, and for inferring the allele frequency trajectory of a selected or neutral allele. We perform extensive simulations to evaluate the method and show that it uniformly improves power to detect selection compared to current popular methods such as nSL and SDS, under various demographic models and can provide reliable inferences of allele frequency trajectories under many conditions. We also explore the potential of our method to detect extremely recent changes in the strength of selection. We use the method to infer the past allele frequency trajectory for a lactase persistence SNP (MCM6) in Europeans. We also study a set of 11 pigmentation-associated variants. Several genes show evidence of strong selection particularly within the last 5,000 years, including ASIP, KITLG, and TYR. However, selection on OCA2/HERC2 seems to be much older and, in contrast to previous claims, we find no evidence of selection on TYRP1.Author summaryCurrent methods to study natural selection using modern population genomic data are limited in their power and flexibility. Here, we present a new method to infer natural selection that builds on recent methodological advances in estimating genome-wide genealogies. By using importance sampling we are able to efficiently estimate the likelihood function of the selection coefficient. We show our method improves power to test for selection over competing methods across a diverse range of scenarios, and also accurately infers the selection coefficient. We also demonstrate a novel capability of our model, using it to infer the allele’s frequency over time. We validate these results with a study of a lactase persistence SNP in Europeans, and also study a set of 11 pigmentation-associated variants.

Genetics ◽  
2009 ◽  
Vol 183 (3) ◽  
pp. 757-766 ◽  
Author(s):  
Brian Charlesworth ◽  
Deborah Charlesworth

Darwin's theory of natural selection lacked an adequate account of inheritance, making it logically incomplete. We review the interaction between evolution and genetics, showing how, unlike Mendel, Darwin's lack of a model of the mechanism of inheritance left him unable to interpret his own data that showed Mendelian ratios, even though he shared with Mendel a more mathematical and probabilistic outlook than most biologists of his time. Darwin's own “pangenesis” model provided a mechanism for generating ample variability on which selection could act. It involved, however, the inheritance of characters acquired during an organism's life, which Darwin himself knew could not explain some evolutionary situations. Once the particulate basis of genetics was understood, it was seen to allow variation to be passed intact to new generations, and evolution could then be understood as a process of changes in the frequencies of stable variants. Evolutionary genetics subsequently developed as a central part of biology. Darwinian principles now play a greater role in biology than ever before, which we illustrate with some examples of studies of natural selection that use DNA sequence data and with some recent advances in answering questions first asked by Darwin.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Heleen Plaisier ◽  
Thomas R. Meagher ◽  
Daniel Barker

Abstract Objective Visualisation methods, primarily color-coded representation of sequence data, have been a predominant means of representation of DNA data. Algorithmic conversion of DNA sequence data to sound—sonification—represents an alternative means of representation that uses a different range of human sensory perception. We propose that sonification has value for public engagement with DNA sequence information because it has potential to be entertaining as well as informative. We conduct preliminary work to explore the potential of DNA sequence sonification in public engagement with bioinformatics. We apply a simple sonification technique for DNA, in which each DNA base is represented by a specific note. Additionally, a beat may be added to indicate codon boundaries or for musical effect. We report a brief analysis from public engagement events we conducted that featured this method of sonification. Results We report on use of DNA sequence sonification at two public events. Sonification has potential in public engagement with bioinformatics, both as a means of data representation and as a means to attract audience to a drop-in stand. We also discuss further directions for research on integration of sonification into bioinformatics public engagement and education.


Zootaxa ◽  
2020 ◽  
Vol 4766 (3) ◽  
pp. 472-484
Author(s):  
HANNAH E. SOM ◽  
L. LEE GRISMER ◽  
PERRY L. JR. WOOD ◽  
EVAN S. H. QUAH ◽  
RAFE M. BROWN ◽  
...  

Liopeltis is a genus of poorly known, infrequently sampled species of colubrid snakes in tropical Asia. We collected a specimen of Liopeltis from Pulau Tioman, Peninsular Malaysia, that superficially resembled L. philippina, a rare species that is endemic to the Palawan Pleistocene Aggregate Island Complex, western Philippines. We analyzed morphological and mitochondrial DNA sequence data from the Pulau Tioman specimen and found distinct differences to L. philippina and all other congeners. On the basis of these corroborated lines of evidence, the Pulau Tioman specimen is described as a new species, L. tiomanica sp. nov. The new species occurs in sympatry with L. tricolor on Pulau Tioman, and our description of L. tiomanica sp. nov. brings the number of endemic amphibians and reptiles on Pulau Tioman to 12. 


2007 ◽  
Vol 3 ◽  
pp. 193-197 ◽  
Author(s):  
Kou Amano ◽  
Hiroaki Ichikawa ◽  
Hidemitsu Nakamura ◽  
Hisataka Numa ◽  
Kaoru Fukami-Kobayashi ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document