scholarly journals Detection and Polarization of Introgression in a Five-taxon Phylogeny

2014 ◽  
Author(s):  
James B Pease ◽  
Matthew W. Hahn

In clades of closely related taxa, discordant genealogies due to incomplete lineage sorting (ILS) can complicate the detection of introgression. TheD-statistic (a.k.a. the ABBA/BABA test) was proposed to infer introgression in the presence of ILS for a four-taxon clade. However, the originalD-statistic cannot be directly applied to a symmetric five-taxon phylogeny, and the direction of introgression cannot be inferred for any tree topology. Here we explore the issues associated with previous methods for adapting theD-statistic to a larger tree topology, and propose new “DFOIL” tests to infer both the taxa involved in and the direction of introgressions for a symmetric five-taxon phylogeny. Using theory and simulations, we find that previous modifications of theD-statistic to five-taxon phylogenies incorrectly identify both the pairs of taxa exchanging migrants as well as the direction of introgression. TheDFOILstatistics are shown to overcome this deficiency and to correctly determine the direction of introgressions. TheDFOILtests are relatively simple and computationally inexpensive to calculate, and can be easily applied to various phylogenomic datasets. In addition, our general approach to the problem of introgression detection could be adapted to larger tree topologies and other models of sequence evolution.

2019 ◽  
Vol 68 (6) ◽  
pp. 937-955 ◽  
Author(s):  
Alison Cloutier ◽  
Timothy B Sackton ◽  
Phil Grayson ◽  
Michele Clamp ◽  
Allan J Baker ◽  
...  

Abstract Palaeognathae represent one of the two basal lineages in modern birds, and comprise the volant (flighted) tinamous and the flightless ratites. Resolving palaeognath phylogenetic relationships has historically proved difficult, and short internal branches separating major palaeognath lineages in previous molecular phylogenies suggest that extensive incomplete lineage sorting (ILS) might have accompanied a rapid ancient divergence. Here, we investigate palaeognath relationships using genome-wide data sets of three types of noncoding nuclear markers, together totaling 20,850 loci and over 41 million base pairs of aligned sequence data. We recover a fully resolved topology placing rheas as the sister to kiwi and emu + cassowary that is congruent across marker types for two species tree methods (MP-EST and ASTRAL-II). This topology is corroborated by patterns of insertions for 4274 CR1 retroelements identified from multispecies whole-genome screening, and is robustly supported by phylogenomic subsampling analyses, with MP-EST demonstrating particularly consistent performance across subsampling replicates as compared to ASTRAL. In contrast, analyses of concatenated data supermatrices recover rheas as the sister to all other nonostrich palaeognaths, an alternative that lacks retroelement support and shows inconsistent behavior under subsampling approaches. While statistically supporting the species tree topology, conflicting patterns of retroelement insertions also occur and imply high amounts of ILS across short successive internal branches, consistent with observed patterns of gene tree heterogeneity. Coalescent simulations and topology tests indicate that the majority of observed topological incongruence among gene trees is consistent with coalescent variation rather than arising from gene tree estimation error alone, and estimated branch lengths for short successive internodes in the inferred species tree fall within the theoretical range encompassing the anomaly zone. Distributions of empirical gene trees confirm that the most common gene tree topology for each marker type differs from the species tree, signifying the existence of an empirical anomaly zone in palaeognaths.


2020 ◽  
Vol 20 (1) ◽  
Author(s):  
Ting Ren ◽  
Zi-Xuan Li ◽  
Deng-Feng Xie ◽  
Ling-Jian Gui ◽  
Chang Peng ◽  
...  

Abstract Background The genus Ligusticum consists of approximately 60 species distributed in the Northern Hemisphere. It is one of the most taxonomically difficult taxa within Apiaceae, largely due to the varied morphological characteristics. To investigate the plastome evolution and phylogenetic relationships of Ligusticum, we determined the complete plastome sequences of eight Ligusticum species using a de novo assembly approach. Results Through a comprehensive comparative analysis, we found that the eight plastomes were similar in terms of repeat sequence, SSR, codon usage, and RNA editing site. However, compared with the other seven species, L. delavayi exhibited striking differences in genome size, gene number, IR/SC borders, and sequence identity. Most of the genes remained under the purifying selection, whereas four genes showed relaxed selection, namely ccsA, rpoA, ycf1, and ycf2. Non-monophyly of Ligusticum species was inferred from the plastomes and internal transcribed spacer (ITS) sequences phylogenetic analyses. Conclusion The plastome tree and ITS tree produced incongruent tree topologies, which may be attributed to the hybridization and incomplete lineage sorting. Our study highlighted the advantage of plastome with mass informative sites in resolving phylogenetic relationships. Moreover, combined with the previous studies, we considered that the current taxonomy system of Ligusticum needs to be improved and revised. In summary, our study provides new insights into the plastome evolution, phylogeny, and taxonomy of Ligusticum species.


2015 ◽  
Author(s):  
Leonardo de Oliveira Martins ◽  
David Posada

The history of particular genes and that of the species that carry them can be different due to different reasons. In particular, gene trees and species trees can truly differ due to well-known evolutionary processes like gene duplication and loss, lateral gene transfer or incomplete lineage sorting. Different species tree reconstruction methods have been developed to take this incongruence into account, which can be divided grossly into supertree and supermatrix approaches. Here, we introduce a new Bayesian hierarchical model that we have recently developed and implemented in the program Guenomu, that considers multiple sources of gene tree/species tree disagreement. Guenomu takes as input the posterior distributions of unrooted gene tree topologies for multiple gene families, in order to estimate the posterior distribution of rooted species tree topologies.


2019 ◽  
Vol 9 (1) ◽  
pp. 9-69
Author(s):  
Annemarie Verkerk

Abstract Recent applications of phylogenetic methods to historical linguistics have been criticized for assuming a tree structure in which ancestral languages differentiate and split up into daughter languages, while language evolution is inherently non-tree-like (François 2014; Blench 2015: 32–33). This article attempts to contribute to this debate by discussing the use of the multiple topologies method (Pagel & Meade 2006a) implemented in BayesPhylogenies (Pagel & Meade 2004). This method is applied to lexical datasets from four different language families: Austronesian (Gray, Drummond & Greenhill 2009), Sinitic (Ben Hamed & Wang 2006), Indo-European (Bouckaert et al. 2012), and Japonic (Lee & Hasegawa 2011). Evidence for multiple topologies is found in all families except, surprisingly, Austronesian. It is suggested that reticulation may arise from a number of processes, including dialect chain break-up, borrowing (both shortly after language splits and later on), incomplete lineage sorting, and characteristics of lexical datasets. It is shown that the multiple topologies method is a useful tool to study the dynamics of language evolution.


2020 ◽  
Author(s):  
G Churakov ◽  
A Kuritzin ◽  
K Chukharev ◽  
F Zhang ◽  
F Wünnemann ◽  
...  

AbstractRetrophylogenomics makes use of genome-wide retrotransposon presence/absence insertion patterns to resolve questions in phylogeny and population genetics. In the genomics era, evaluating high-throughput data requires the associated development of appropriately powerful statistical tools. The currently used KKSC 3-lineage statistical test for evaluating the significance of data is limited by the number of possible tree topologies it can assess in one step. To improve on this, we have now extended the analysis to simultaneously compare 4-lineages, which now enables us to evaluate ten distinct presence/absence insertion patterns for 26 possible tree topologies plus 129 trees with different incidences of hybridization. Moreover, the new tool includes statistics for multiple ancestral hybridizations, ancestral incomplete lineage sorting, bifurcation, and polytomy. The test is embedded in a user-friendly web R-application (http://retrogenomics.uni-muenster.de:3838/hammlet/) and is available for use by the general scientific community.


2018 ◽  
Author(s):  
Alison Cloutier ◽  
Timothy B. Sackton ◽  
Phil Grayson ◽  
Michele Clamp ◽  
Allan J. Baker ◽  
...  

AbstractPalaeognathae represent one of the two basal lineages in modern birds, and comprise the volant (flighted) tinamous and the flightless ratites. Resolving palaeognath phylogenetic relationships has historically proved difficult, and short internal branches separating major palaeognath lineages in previous molecular phylogenies suggest that extensive incomplete lineage sorting (ILS) might have accompanied a rapid ancient divergence. Here, we investigate palaeognath relationships using genome-wide data sets of three types of noncoding nuclear markers, together totalling 20,850 loci and over 41 million base pairs of aligned sequence data. We recover a fully resolved topology placing rheas as the sister to kiwi and emu + cassowary that is congruent across marker types for two species tree methods (MP-EST and ASTRAL-II). This topology is corroborated by patterns of insertions for 4,274 CR1 retroelements identified from multi-species whole genome screening, and is robustly supported by phylogenomic subsampling analyses, with MP-EST demonstrating particularly consistent performance across subsampling replicates as compared to ASTRAL. In contrast, analyses of concatenated data supermatrices recover rheas as the sister to all other non-ostrich palaeognaths, an alternative that lacks retroelement support and shows inconsistent behavior under subsampling approaches. While statistically supporting the species tree topology, conflicting patterns of retroelement insertions also occur and imply high amounts of ILS across short successive internal branches, consistent with observed patterns of gene tree heterogeneity. Coalescent simulations indicate that the majority of observed topological incongruence among gene trees is consistent with coalescent variation rather than arising from gene tree estimation error alone, and estimated branch lengths for short successive internodes in the inferred species tree fall within the theoretical range encompassing the anomaly zone. Distributions of empirical gene trees confirm that the most common gene tree topology for each marker type differs from the species tree, signifying the existence of an empirical anomaly zone in palaeognaths.


2020 ◽  
Author(s):  
Liming Cai ◽  
Zhenxiang Xi ◽  
Emily Moriarty Lemmon ◽  
Alan R Lemmon ◽  
Austin Mast ◽  
...  

Abstract The genomic revolution offers renewed hope of resolving rapid radiations in the Tree of Life. The development of the multispecies coalescent (MSC) model and improved gene tree estimation methods can better accommodate gene tree heterogeneity caused by incomplete lineage sorting (ILS) and gene tree estimation error stemming from the short internal branches. However, the relative influence of these factors in species tree inference is not well understood. Using anchored hybrid enrichment, we generated a data set including 423 single-copy loci from 64 taxa representing 39 families to infer the species tree of the flowering plant order Malpighiales. This order includes nine of the top ten most unstable nodes in angiosperms, which have been hypothesized to arise from the rapid radiation during the Cretaceous. Here, we show that coalescent-based methods do not resolve the backbone of Malpighiales and concatenation methods yield inconsistent estimations, providing evidence that gene tree heterogeneity is high in this clade. Despite high levels of ILS and gene tree estimation error, our simulations demonstrate that these two factors alone are insufficient to explain the lack of resolution in this order. To explore this further, we examined triplet frequencies among empirical gene trees and discovered some of them deviated significantly from those attributed to ILS and estimation error, suggesting gene flow as an additional and previously unappreciated phenomenon promoting gene tree variation in Malpighiales. Finally, we applied a novel method to quantify the relative contribution of these three primary sources of gene tree heterogeneity and demonstrated that ILS, gene tree estimation error, and gene flow contributed to 10.0%, 34.8%, and 21.4% of the variation, respectively. Together, our results suggest that a perfect storm of factors likely influence this lack of resolution, and further indicate that recalcitrant phylogenetic relationships like the backbone of Malpighiales may be better represented as phylogenetic networks. Thus, reducing such groups solely to existing models that adhere strictly to bifurcating trees greatly oversimplifies reality, and obscures our ability to more clearly discern the process of evolution.


Insects ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 455
Author(s):  
Na Ra Jeong ◽  
Min Jee Kim ◽  
Sung-Soo Kim ◽  
Sei-Woong Choi ◽  
Iksoo Kim

Conogethes pinicolalis has long been considered as a Pinaceae-feeding type of the yellow peach moth, C. punctiferalis, in Korea. In this study, the divergence of C. pinicolalis from the fruit-feeding moth C. punctiferalis was analyzed in terms of morphology, ecology, and genetics. C. pinicolalis differs from C. punctiferalis in several morphological features. Through field observation, we confirmed that pine trees are the host plants for the first generation of C. pinicolalis larvae, in contrast to fruit-feeding C. punctiferalis larvae. We successfully reared C. pinicolalis larvae to adults by providing them pine needles as a diet. From a genetic perspective, the sequences of mitochondrial COI of these two species substantially diverged by an average of 5.46%; moreover, phylogenetic analysis clearly assigned each species to an independent clade. On the other hand, nuclear EF1α showed a lower sequence divergence (2.10%) than COI. Overall, EF1α-based phylogenetic analysis confirmed each species as an independent clade, but a few haplotypes of EF1α indicated incomplete lineage sorting between these two species. In conclusion, our results demonstrate that C. pinicolalis is an independent species according to general taxonomic criteria; however, analysis of the EF1α sequence revealed a short divergence time.


Sign in / Sign up

Export Citation Format

Share Document