Construction and annotation of large phylogenetic trees

2007 ◽  
Vol 20 (4) ◽  
pp. 287 ◽  
Author(s):  
Michael J. Sanderson

Broad availability of molecular sequence data allows construction of phylogenetic trees with 1000s or even 10 000s of taxa. This paper reviews methodological, technological and empirical issues raised in phylogenetic inference at this scale. Numerous algorithmic and computational challenges have been identified surrounding the core problem of reconstructing large trees accurately from sequence data, but many other obstacles, both upstream and downstream of this step, are less well understood. Before phylogenetic analysis, data must be generated de novo or extracted from existing databases, compiled into blocks of homologous data with controlled properties, aligned, examined for the presence of gene duplications or other kinds of complicating factors, and finally, combined with other evidence via supermatrix or supertree approaches. After phylogenetic analysis, confidence assessments are usually reported, along with other kinds of annotations, such as clade names, or annotations requiring additional inference procedures, such as trait evolution or divergence time estimates. Prospects for partial automation of large-tree construction are also discussed, as well as risks associated with ‘outsourcing’ phylogenetic inference beyond the systematics community.

2017 ◽  
Author(s):  
Mario dos Reis ◽  
Gregg F. Gunnell ◽  
José Barba-Montoya ◽  
Alex Wilkins ◽  
Ziheng Yang ◽  
...  

AbstractPrimates have long been a test case for the development of phylogenetic methods for divergence time estimation. Despite a large number of studies, however, the timing of origination of crown Primates relative to the K-Pg boundary and the timing of diversification of the main crown groups remain controversial. Here we analysed a dataset of 372 taxa (367 Primates and 5 outgroups, 61 thousand base pairs) that includes nine complete primate genomes (3.4 million base pairs). We systematically explore the effect of different interpretations of fossil calibrations and molecular clock models on primate divergence time estimates. We find that even small differences in the construction of fossil calibrations can have a noticeable impact on estimated divergence times, especially for the oldest nodes in the tree. Notably, choice of molecular rate model (auto-correlated or independently distributed rates) has an especially strong effect on estimated times, with the independent rates model producing considerably more ancient estimates for the deeper nodes in the phylogeny. We implement thermodynamic integration, combined with Gaussian quadrature, in the program MCMCTree, and use it to calculate Bayes factors for clock models. Bayesian model selection indicates that the auto-correlated rates model fits the primate data substantially better, and we conclude that time estimates under this model should be preferred. We show that for eight core nodes in the phylogeny, uncertainty in time estimates is close to the theoretical limit imposed by fossil uncertainties. Thus, these estimates are unlikely to be improved by collecting additional molecular sequence data. All analyses place the origin of Primates close to the K-Pg boundary, either in the Cretaceous or straddling the boundary into the Palaeogene.


2006 ◽  
Vol 2 (4) ◽  
pp. 543-547 ◽  
Author(s):  
Per G.P Ericson ◽  
Cajsa L Anderson ◽  
Tom Britton ◽  
Andrzej Elzanowski ◽  
Ulf S Johansson ◽  
...  

Patterns of diversification and timing of evolution within Neoaves, which includes almost 95% of all bird species, are virtually unknown. On the other hand, molecular data consistently indicate a Cretaceous origin of many neoavian lineages and the fossil record seems to support an Early Tertiary diversification. Here, we present the first well-resolved molecular phylogeny for Neoaves, together with divergence time estimates calibrated with a large number of stratigraphically and phylogenetically well-documented fossils. Our study defines several well-supported clades within Neoaves. The calibration results suggest that Neoaves, after an initial split from Galloanseres in Mid-Cretaceous, diversified around or soon after the K/T boundary. Our results thus do not contradict palaeontological data and show that there is no solid molecular evidence for an extensive pre-Tertiary radiation of Neoaves.


2016 ◽  
Vol 47 (2) ◽  
pp. 149-179 ◽  
Author(s):  
Werner P. Strümpher ◽  
Catherine L. Sole ◽  
Martin H. Villet ◽  
Clarke H. Scholtz

The namePhoberus capensis(Scholtz) is applied to a small flightless, keratinophagous beetle endemic to the Cape Floristic Region of South Africa. Its gross distribution stretches from roughly 1000 km from the Cederberg (32°24’22” S, 19°04’50” E) to Grahamstown (33°20’07” S, 26°32’50” E). The populations are spatially discrete, restricted to relict forests of the southern Cape and disjunct high montane refugia of the Cape Fold Mountains. We test the hypothesis that there is more than one distinct species nested within the nameP.capensis. Phylogenetic relationships among populations were inferred using molecular sequence data. The results support three distinct evolutionary lineages, which were also supported by morphological characters. Divergence time estimates suggest Pliocene-Pleistocene diversification. Based on these results, it is suggested that theP.capensislineage experienced climatically-driven allopatric speciation with sheltered Afrotemperate forests and high mountain peaks serving as important refugia in response to climatic ameliorations. TheP. capensiscomplex thus represents a speciation process in which flight-restricted populations evolved in close allopatry, possibly as recently as the Pleistocene. Two divergent and geographically distinct lineages are described as novel species: The new species,P.disjunctussp. n. andP.herminaesp. n., are illustrated by photographs of habitus and male aedeagi.


2017 ◽  
Author(s):  
Charles S. P. Foster ◽  
Simon Y. W. Ho

AbstractEvolutionary timescales can be inferred from molecular sequence data using a Bayesian phylogenetic approach. In these methods, the molecular clock is often calibrated using fossil data. The uncertainty in these fossil calibrations is important because it determines the limiting posterior distribution for divergence-time estimates as the sequence length tends to infinity. Here we investigate how the accuracy and precision of Bayesian divergence-time estimates improve with the increased clock-partitioning of genome-scale data into clock-subsets. We focus on a data set comprising plastome-scale sequences of 52 angiosperm taxa. There was little difference among the Bayesian date estimates whether we chose clock-subsets based on patterns of among-lineage rate heterogeneity or relative rates across genes, or by random assignment. Increasing the degree of clock-partitioning usually led to an improvement in the precision of divergence-time estimates, but this increase was asymptotic to a limit presumably imposed by fossil calibrations. Our clock-partitioning approaches yielded highly precise age estimates for several key nodes in the angiosperm phylogeny. For example, when partitioning the data into 20 clock-subsets based on patterns of among-lineage rate heterogeneity, we inferred crown angiosperms to have arisen 198–178 Ma. This demonstrates that judicious clock-partitioning can improve the precision of molecular dating based on phylogenomic data, but the meaning of this increased precision should be considered critically.


Paleobiology ◽  
1994 ◽  
Vol 20 (3) ◽  
pp. 259-273 ◽  
Author(s):  
Andrew B. Smith ◽  
D. T. J. Littlewood

Molecular data are becoming an indispensable tool for the reconstruction of phylogenies. Fossil molecular data remain scarce, but have the potential to resolve patterns of deep branching and provide empirical tests of tree reconstruction techniques. A total evidence approach, combining and comparing complementary morphological, molecular and stratigraphical data from both recent and fossil taxa, is advocated as the most promising way forward because there are several well-established problems that can afflict the analysis of molecular sequence data sometimes resulting in spurious tree topologies. The integration of evidence allows us to: (1) choose suitable taxa for molecular phylogenetic analysis for the question at hand; (2) discriminate between conflicting hypotheses of taxonomic relationship and phylogeny; (3) evaluate procedures and assumptions underlying methods of building trees; and (4) estimate rates of molecular evolution in the geological past. Paleontology offers a set of independent data for comparison and corroboration of analyses and provides the only direct means of calibrating molecular trees, thus giving insight into rates of molecular evolution in the geological past.


2020 ◽  
Vol 55 (4) ◽  
pp. 520-546
Author(s):  
Chengcai Si ◽  
Keke Chen ◽  
Ruisong Tao ◽  
Chengyong Su ◽  
Junye Ma ◽  
...  

Abstract Parnassius (Lepidoptera: Papilionidae) is a genus of attractive butterflies mainly distributed in the mountainous areas of Central Asia, the Himalayas, and western China. In this study, we used the internal transcribed spacer (ITS1 and ITS2) sequence data as DNA barcodes to characterize the genetic differentiation and conduct the phylogenetic analysis and divergence time estimation of the 17 Parnassius species collected in China. Species identification and genetic differentiation analysis suggest that the ITS barcode is an effective marker for Parnassius species identification; additionally, a relatively high level of genetic diversity and low level of gene flow were detected in the five Parnassius species with diverse geographic populations. Phylogenetic analysis indicates that the 17 species studied were clustered in six clades (subgenera), with subgenus Parnassius at the basal position in the phylogenetic trees. Bayesian divergence time estimation shows that the genus originated about 18 million years ago during the early Miocene, correlated with orogenic events in the distribution region, probably southwestern China about 20–10 million years ago. Our estimated phylochronology also suggests that the Parnassius interspecific and intraspecific divergences were probably related with the rapid rising of the Qinghai-Tibet Plateau, the Tibet Movement, the Kunlun-Yellow River Tectonic Movement, and global cooling associated with intensified glaciation in the region during the Quaternary Period.


Fossil Record ◽  
2017 ◽  
Vol 20 (2) ◽  
pp. 147-157 ◽  
Author(s):  
Kathrin Feldberg ◽  
Jiří Váňa ◽  
Alfons Schäfer-Verwimp ◽  
Michael Krings ◽  
Carsten Gröhn ◽  
...  

Abstract. A revision of the Baltic and Bitterfeld amber fossils assigned to Cylindrocolea dimorpha (Cephaloziellaceae) has yielded evidence of the presence of multicellular, bifid underleaves, which have not previously been reported for this species and conflict with the current circumscription of the family. We transfer the fossil species to Odontoschisma (sect. Iwatsukia) and propose the new combination O. dimorpha of the Cephaloziaceae. Characteristics of the fossil include an overall small size of the plant, entire-margined, bifid leaves and underleaves, more or less equally thickened leaf cell walls, ventral branching that includes stoloniform branches with reduced leaves, and the lack of a stem hyalodermis and gemmae. Placement of the fossil in Cephaloziaceae profoundly affects divergence time estimates for liverworts based on DNA sequence variation with integrated information from the fossil record. Our reclassification concurs with hypotheses on the divergence times of Cephaloziaceae derived from DNA sequence data that provide evidence of a late Early Cretaceous to early Eocene age of the Odontoschisma crown group and an origin of O. sect. Iwatsukia in the Late Cretaceous to Oligocene.


2000 ◽  
Vol 74 (5) ◽  
pp. 839-852 ◽  
Author(s):  
D. M. Haasl

Phylogenetic relationships within the neogastropod family Nassariidae are poorly understood as are relationships between the Nassariidae and other fossil and extant buccinid taxa. The poor resolution of nassariid and buccinoidean relationships is due to: 1) the complex distribution among these gastropods of characters commonly used in classification; 2) a number of Mesozoic and Paleogene genera whose relationships to extant buccinoidean lineages are poorly constrained; and 3) a lack of previous efforts to address these problems on a rigorous, phylogenetic basis.The results of a phylogenetic analysis of nassariid genera did not decisively support the monophyly of the family. The buccinid subfamily Photinae was an extant sister group to the Nassariinae in a phylogenetic analysis of extant taxa and on many cladograms from an analysis combining fossil and extant taxa. In addition, Buccitriton (representing the Paleogene Tritiaria group) was a sister taxon to the Nassariinae in all analyses in which it was included, regardless of the identity of the extant nassariine sister group. This suggests that the photines, which likely arose from a Tritiaria ancestor, are the closest living relatives to the Nassariinae. Many Paleogene fossil “buccinoid” taxa appear to be more distantly related to the Nassariinae and possibly to the rest of the nassariids as well. Stratigraphic range data combined with the results of this study suggest that the Nassariinae diversified rapidly in the early Miocene and achieved a cosmopolitan distribution early in their history. A largely Indo-Pacific subclade was consistently deeply-nested within the Nassariinae, suggesting that nassariines invaded the Indo-Pacific region most recently. The timing of this invasion is difficult to estimate but had occurred by the end of the Miocene. Further analyses using molecular sequence data, relative stratigraphic position, or focusing in more detail on the Paleogene taxa are required to resolve the identity of the sister group to the Nassariinae with greater confidence.


2013 ◽  
Vol 280 (1755) ◽  
pp. 20122686 ◽  
Author(s):  
Sophie Cardinal ◽  
Bryan N. Danforth

Reliable estimates on the ages of the major bee clades are needed to further understand the evolutionary history of bees and their close association with flowering plants. Divergence times have been estimated for a few groups of bees, but no study has yet provided estimates for all major bee lineages. To date the origin of bees and their major clades, we first perform a phylogenetic analysis of bees including representatives from every extant family, subfamily and almost all tribes, using sequence data from seven genes. We then use this phylogeny to place 14 time calibration points based on information from the fossil record for an uncorrelated relaxed clock divergence time analysis taking into account uncertainties in phylogenetic relationships and the fossil record. We explore the effect of placing a hard upper age bound near the root of the tree and the effect of different topologies on our divergence time estimates. We estimate that crown bees originated approximately 123 Ma (million years ago) (113–132 Ma), concurrently with the origin or diversification of the eudicots, a group comprising 75 per cent of angiosperm species. All of the major bee clades are estimated to have originated during the Middle to Late Cretaceous, which is when angiosperms became the dominant group of land plants.


MycoKeys ◽  
2019 ◽  
Vol 49 ◽  
pp. 99-129 ◽  
Author(s):  
Sheng-Nan Zhang ◽  
Kevin D. Hyde ◽  
E.B. Gareth Jones ◽  
Rajesh Jeewon ◽  
Ratchadawan Cheewangkoon ◽  
...  

Palms represent the most morphological diverse monocotyledonous plants and support a vast array of fungi. Recent examinations of palmicolous fungi in Thailand led to the discovery of a group of morphologically similar and interesting taxa. A polyphasic approach based on morphology, multi-gene phylogenetic analyses and divergence time estimates supports the establishment of a novel pleosporalean family Striatiguttulaceae, which diversified approximately 39 (20–63) MYA (crown age) and 60 (35–91) MYA (stem age). Striatiguttulaceae is characterized by stromata or ascomata with a short to long neck, trabeculate pseudoparaphyses and fusiform to ellipsoidal, 1–3-septate ascospores, with longitudinal striations and paler end cells, surrounded by a mucilaginous sheath. Multi-gene phylogenetic analysis showed that taxa of Striatiguttulaceae form a well-supported and distinct monophyletic clade in Pleosporales, and related to Ligninsphaeriaceae and Pseudoastrosphaeriellaceae. However, these families can be morphologically demarcated by the slit-like ascomata and extremely large ascospores in Ligninsphaeriaceae and the rather narrow fusiform ascospores in Pseudoastrosphaeriellaceae. Eight strains of Striatiguttulaceae formed two monophyletic sub-clades, which can be recognized as Longicorpusgen. nov. and Striatiguttulagen. nov. Morphologically, the genus Longicorpus can be differentiated from Striatiguttula by its elongated immersed ascomata and fusiform ascospores with relatively larger middle cells and paler end cells. Two new species Striatiguttulanypae and S.phoenicis, and one new combination, Longicorpusstriataspora are introduced with morphological details, and phylogenetic relationships are discussed based on DNA sequence data.


Sign in / Sign up

Export Citation Format

Share Document