scholarly journals Divergence dating using mixed effects clock modelling: An application to HIV-1

2019 ◽  
Vol 5 (2) ◽  
Author(s):  
Magda Bletsa ◽  
Marc A Suchard ◽  
Xiang Ji ◽  
Sophie Gryseels ◽  
Bram Vrancken ◽  
...  

Abstract The need to estimate divergence times in evolutionary histories in the presence of various sources of substitution rate variation has stimulated a rich development of relaxed molecular clock models. Viral evolutionary studies frequently adopt an uncorrelated clock model as a generic relaxed molecular clock process, but this may impose considerable estimation bias if discrete rate variation exists among clades or lineages. For HIV-1 group M, rate variation among subtypes has been shown to result in inconsistencies in time to the most recent common ancestor estimation. Although this calls into question the adequacy of available molecular dating methods, no solution to this problem has been offered so far. Here, we investigate the use of mixed effects molecular clock models, which combine both fixed and random effects in the evolutionary rate, to estimate divergence times. Using simulation, we demonstrate that this model outperforms existing molecular clock models in a Bayesian framework for estimating time-measured phylogenies in the presence of mixed sources of rate variation, while also maintaining good performance in simpler scenarios. By analysing a comprehensive HIV-1 group M complete genome data set we confirm considerable rate variation among subtypes that is not adequately modelled by uncorrelated relaxed clock models. The mixed effects clock model can accommodate this rate variation and results in a time to the most recent common ancestor of HIV-1 group M of 1920 (1915–25), which is only slightly earlier than the uncorrelated relaxed clock estimate for the same data set. The use of complete genome data appears to have a more profound impact than the molecular clock model because it reduces the credible intervals by 50 per cent relative to similar estimates based on short envelope gene sequences.

2010 ◽  
Vol 7 (11) ◽  
pp. 3387-3402 ◽  
Author(s):  
S. Trajanovski ◽  
C. Albrecht ◽  
K. Schreiber ◽  
R. Schultheiß ◽  
T. Stadler ◽  
...  

Abstract. Ancient Lake Ohrid on the Balkan Peninsula is considered to be the oldest ancient lake in Europe with a suggested Plio-/Pleistocene age. Its exact geological age, however, remains unknown. Therefore, molecular clock data of Lake Ohrid biota may serve as an independent constraint of available geological data, and may thus help to refine age estimates. Such evolutionary data may also help unravel potential biotic and abiotic factors that promote speciation events. Here, mitochondrial sequencing data of one of the largest groups of endemic taxa in the Ohrid watershed, the leech genus Dina, is used to test whether it represents an ancient lake species flock, to study the role of potential horizontal and vertical barriers in the watershed for evolutionary events, to estimate the onset of diversification in this group based on molecular clock analyses, and to compare this data with data from other endemic species for providing an approximate time frame for the origin of Lake Ohrid. Based on the criteria speciosity, monophyly and endemicity, it can be concluded that Dina spp. from the Ohrid watershed, indeed, represents an ancient lake species flock. Lineage sorting of its species, however, does not seem to be complete and/or hybridization may occur. Analyses of population structures of Dina spp. in the Ohrid watershed indicate a horizontal zonation of haplotypes from spring and lake populations, corroborating the role of lake-side springs, particularly the southern feeder springs, for evolutionary processes in endemic Ohrid taxa. Vertical differentiation of lake taxa, however, appears to be limited, though differences between populations from the littoral and the profundal are apparent. Molecular clock analyses indicate that the most recent common ancestor of extant species of this flock is approximately 1.99 ± 0.83 million years (Ma) old, whereas the split of the Ohrid Dina flock from a potential sister taxon outside the lake is estimated at 8.30 ± 3.60 Ma. Comparisons with other groups of endemic Ohrid species indicated that in all cases, diversification within the watershed started ≤2 Ma ago. Thus, this estimate may provide information on a minimum age for the origin of Lake Ohrid. Maximum ages are less consistent and generally less reliable. But cautiously, a maximum age of 3 Ma is suggested. Interestingly, this time frame of approximately 2–3 Ma ago for the origin of Lake Ohrid, generated based on genetic data, well fits the time frame most often used in the literature by geologists.


2010 ◽  
Vol 7 (4) ◽  
pp. 5011-5045 ◽  
Author(s):  
S. Trajanovski ◽  
C. Albrecht ◽  
K. Schreiber ◽  
R. Schultheiß ◽  
T. Stadler ◽  
...  

Abstract. Ancient Lake Ohrid on the Balkan Peninsula is considered to be the oldest ancient lake in Europe with a suggested Plio-Pleistocene age. Its exact geological age, however, remains unknown. Therefore, molecular clock data of Lake Ohrid biota may serve as an independent constraint of available geological data, and may thus also help to refine age estimates. Such evolutionary data may also help unravel potential biotic and abiotic factors that promote speciation events. Here, mitochondrial sequencing data of one of the largest groups of endemic taxa in Lake Ohrid, the leech genus Dina, is used to test whether it represents an ancient lake species flock, to study the role of horizontal and vertical barriers in Lake Ohrid for evolutionary events, to estimate the onset of intralacustrine diversification in this group based on molecular clock analyses, and to compare this data with data from other endemic species for providing an approximate time frame for the origin of Lake Ohrid. Based on the criteria speciosity, monophyly and endemicity, it can be concluded that Lake Ohrid Dina, indeed, represents an ancient lake species flock. Lineage sorting of its species, however, does not seem to be complete. Analyses of population structures of Dina spp. in the Ohrid watershed indicate a horizontal zonation of haplotypes from spring and lake populations, corroborating the role of lake-side springs, particularly the southern feeder springs, for evolutionary processes in endemic Ohrid taxa. Vertical differentiation of lake taxa, however, appears to be limited, though differences between populations from the littoral and the profundal are apparent. Molecular clock analyses indicate that the most recent common ancestor of extant species of this flock is approximately 1.99±0.83 Ma old, whereas the split of the Lake Ohrid Dina flock from a potential sister taxon outside the lake is estimated at 8.30±3.60 Ma. Comparisons with other groups of endemic Ohrid species indicated that in all cases, intralacustrine diversification started ≤2 Ma ago. Thus, this estimate may provide information on a minimum age for the origin of Lake Ohrid. Maximum ages are less consistent and generally less reliable. But cautiously, a maximum age of 3 Ma is suggested. Interestingly, this time frame of approximately 2–3 Ma for the origin of Lake Ohrid, generated based solely on evolutionary data, well fits the time frame most often used in the literature by geologists. Future studies must show whether this concurrence holds true.


2005 ◽  
Vol 79 (3) ◽  
pp. 1595-1604 ◽  
Author(s):  
Leen Vijgen ◽  
Els Keyaerts ◽  
Elien Moës ◽  
Inge Thoelen ◽  
Elke Wollants ◽  
...  

ABSTRACT Coronaviruses are enveloped, positive-stranded RNA viruses with a genome of approximately 30 kb. Based on genetic similarities, coronaviruses are classified into three groups. Two group 2 coronaviruses, human coronavirus OC43 (HCoV-OC43) and bovine coronavirus (BCoV), show remarkable antigenic and genetic similarities. In this study, we report the first complete genome sequence (30,738 nucleotides) of the prototype HCoV-OC43 strain (ATCC VR759). Complete genome and open reading frame (ORF) analyses were performed in comparison to the BCoV genome. In the region between the spike and membrane protein genes, a 290-nucleotide deletion is present, corresponding to the absence of BCoV ORFs ns4.9 and ns4.8. Nucleotide and amino acid similarity percentages were determined for the major HCoV-OC43 ORFs and for those of other group 2 coronaviruses. The highest degree of similarity is demonstrated between HCoV-OC43 and BCoV in all ORFs with the exception of the E gene. Molecular clock analysis of the spike gene sequences of BCoV and HCoV-OC43 suggests a relatively recent zoonotic transmission event and dates their most recent common ancestor to around 1890. An evolutionary rate in the order of 4 × 10−4 nucleotide changes per site per year was estimated. This is the first animal-human zoonotic pair of coronaviruses that can be analyzed in order to gain insights into the processes of adaptation of a nonhuman coronavirus to a human host, which is important for understanding the interspecies transmission events that led to the origin of the severe acute respiratory syndrome outbreak.


2019 ◽  
Vol 17 (2) ◽  
pp. 114-125 ◽  
Author(s):  
Dmitry Neshumaev ◽  
Aleksey Lebedev ◽  
Marina Malysheva ◽  
Anatoly Boyko ◽  
Sergey Skudarnov ◽  
...  

Background:The information about the dynamics of the viral population and migration events that affect the epidemic in different parts of the Russia is insufficient. Possibly, the huge size of the country and limited transport accessibility to certain territories may determine unique traits of the HIV-1 evolutionary history in different regions.Objective:The aim of this study was to explore the genetic diversity of HIV-1 in the Krasnoyarsk region and reconstruct spatial-temporal dynamics of the infection in the region.Methods:The demographic and virologic data from 281 HIV-infected individuals in Krasnoyarsk region collected during 2011-2016 were analyzed. The time to the most recent common ancestor, evolutionary rates, population growth, and ancestral geographic movements was estimated using Bayesian coalescent-based methods.Results:The study revealed moderate diversity of the HIV-1 subtypes found in the region, which included A6 (92.3%), CRF063_02A (4.3%), B (1.1%), and unique recombinants (2.5%). Phylogenetic reconstruction revealed that the A6 subtype was introduced into Krasnoyarsk region by one viral lineage, which arose around 1996.9 (1994.5-1999.5). The phylogeography analysis pointed to Krasnoyarsk city as the geographical center of the epidemic, which further spread to central neighboring districts of the region. At least two epidemic growth phases of subtype A6 were identified which included exponential growth in early-2000s followed by the decline in the mid/late 2010s.Conclusion:This study demonstrates a change in the genetic diversity of HIV-1 in the Krasnoyarsk region. At the beginning of the epidemic, subtype A6 prevailed, subtypes B and CRF063_02A appeared in the region later.


2004 ◽  
Vol 78 (19) ◽  
pp. 10501-10506 ◽  
Author(s):  
Simon A. A. Travers ◽  
Jonathan P. Clewley ◽  
Judith R. Glynn ◽  
Paul E. M. Fine ◽  
Amelia C. Crampin ◽  
...  

ABSTRACT Human immunodeficiency virus type 1 (HIV-1) subtype C is responsible for more than 55% of HIV-1 infections worldwide. When this subtype first emerged is unknown. We have analyzed all available gag (p17 and p24) and env (C2-V3) subtype C sequences with known sampling dates, which ranged from 1983 to 2000. The majority of these sequences come from the Karonga District in Malawi and include some of the earliest known subtype C sequences. Linear regression analyses of sequence divergence estimates (with four different approaches) were plotted against sample year to estimate the year in which there was zero divergence from the reconstructed ancestral sequence. Here we suggest that the most recent common ancestor of subtype C appeared in the mid- to late 1960s. Sensitivity analyses, by which possible biases due to oversampling from one district were explored, gave very similar estimates.


Viruses ◽  
2021 ◽  
Vol 13 (1) ◽  
pp. 93
Author(s):  
Javier E. Cañada ◽  
Elena Delgado ◽  
Horacio Gil ◽  
Mónica Sánchez ◽  
Sonia Benito ◽  
...  

The extraordinary genetic variability of human immunodeficiency virus type 1 (HIV-1) group M has led to the identification of 10 subtypes, 102 circulating recombinant forms (CRFs) and numerous unique recombinant forms. Among CRFs, 11 derived from subtypes B and C have been identified in China, Brazil, and Italy. Here we identify a new HIV-1 CRF_BC in Northern Spain. Originally, a phylogenetic cluster of 15 viruses of subtype C in protease-reverse transcriptase was identified in an HIV-1 molecular surveillance study in Spain, most of them from individuals from the Basque Country and heterosexually transmitted. Analyses of near full-length genome sequences from six viruses from three cities revealed that they were BC recombinant with coincident mosaic structures different from known CRFs. This allowed the definition of a new HIV-1 CRF designated CRF108_BC, whose genome is predominantly of subtype C, with four short subtype B fragments. Phylogenetic analyses with database sequences supported a Brazilian ancestry of the parental subtype C strain. Coalescent Bayesian analyses estimated the most recent common ancestor of CRF108_BC in the city of Vitoria, Basque Country, around 2000. CRF108_BC is the first CRF_BC identified in Spain and the second in Europe, after CRF60_BC, both phylogenetically related to Brazilian subtype C strains.


2015 ◽  
Vol 90 (3) ◽  
pp. 1244-1258 ◽  
Author(s):  
Natalie N. Kinloch ◽  
Daniel R. MacMillan ◽  
Anh Q. Le ◽  
Laura A. Cotton ◽  
David R. Bangsberg ◽  
...  

ABSTRACTHuman leukocyte antigen (HLA) class I-associated polymorphisms in HIV-1 that persist upon transmission to HLA-mismatched hosts may spread in the population as the epidemic progresses. Transmission of HIV-1 sequences containing such adaptations may undermine cellular immune responses to the incoming virus in future hosts. Building upon previous work, we investigated the extent of HLA-associated polymorphism accumulation in HIV-1 polymerase (Pol) through comparative analysis of linked HIV-1/HLA class I genotypes sampled during historic (1979 to 1989;n= 338) and modern (2001 to 2011;n= 278) eras from across North America (Vancouver, BC, Canada; Boston, MA; New York, NY; and San Francisco, CA). Phylogenies inferred from historic and modern HIV-1 Pol sequences were star-like in shape, with an inferred most recent common ancestor (epidemic founder virus) sequence nearly identical to the modern North American subtype B consensus sequence. Nevertheless, modern HIV-1 Pol sequences exhibited roughly 2-fold-higher patristic (tip-to-tip) genetic distances than historic sequences, with HLA pressures likely driving ongoing diversification. Moreover, the frequencies of published HLA-associated polymorphisms in individuals lacking the selecting HLA class I allele was on average ∼2.5-fold higher in the modern than in the historic era, supporting their spread in circulation, though some remained stable in frequency during this time. Notably, polymorphisms restricted by protective HLA alleles appear to be spreading to a greater relative extent than others, though these increases are generally of modest absolute magnitude. However, despite evidence of polymorphism spread, North American hosts generally remain at relatively low risk of acquiring an HIV-1 polymerase sequence substantially preadapted to their HLA profiles, even in the present era.IMPORTANCEHLA class I-restricted cytotoxic T-lymphocyte (CTL) escape mutations in HIV-1 that persist upon transmission may accumulate in circulation over time, potentially undermining host antiviral immunity to the transmitted viral strain. We studied >600 experimentally collected HIV-1 polymerase sequences linked to host HLA information dating back to 1979, along with phylogenetically reconstructed HIV-1 sequences dating back to the virus' introduction into North America. Overall, our results support the gradual spread of many—though not all—HIV-1 polymerase immune escape mutations in circulation over time. This is consistent with recent observations from other global regions, though the extent of polymorphism accumulation in North America appears to be lower than in populations with high seroprevalence, older epidemics, and/or limited HLA diversity. Importantly, the risk of acquiring an HIV-1 polymerase sequence at transmission that is substantially preadapted to one's HLA profile remains relatively low in North America, even in the present era.


2003 ◽  
Vol 17 (4) ◽  
pp. 605 ◽  
Author(s):  
Philip S. Ward ◽  
Seán G. Brady

We investigated phylogenetic relationships among the 'primitive' Australian ant genera Myrmecia and Nothomyrmecia (stat. rev.) and the Baltic amber fossil genus Prionomyrmex, using a combination of morphological and molecular data. Outgroups for the analysis included representatives from a variety of potential sister-groups, including five extant subfamilies of ants and one extinct group (Sphecomyrminae). Parsimony analysis of the morphological data provides strong support (~95% bootstrap proportions) for the monophyly of (1) genus Myrmecia, (2) genus Prionomyrmex, and (3) a clade containing those two genera plus Nothomyrmecia. A group comprising Nothomyrmecia and Prionomyrmex is also upheld (85% bootstrap support). Molecular sequence data (~2200 base pairs from the 18S and 28S ribosomal RNA genes) corroborate these findings for extant taxa, with Myrmecia and Nothomyrmecia appearing as sister-groups with ~100% bootstrap support under parsimony, neighbour-joining and maximum-likelihood analyses. Neither the molecular nor the morphological data set allows us to identify unambiguously the sister-group of (Myrmecia + (Nothomyrmecia + Prionomyrmex)). Rather, Myrmecia and relatives are part of an unresolved polytomy that encompasses most of the ant subfamilies. Taken as a whole, our results support the contention that many of the major lineages of ants – including a clade that later came to contain Myrmecia, Nothomyrmecia and Prionomyrmex – arose at around the same time during a bout of diversification in the middle or late Cretaceous. On the basis of Bayesian dating analysis, the estimated age of the most recent common ancestor of Myrmecia and Nothomyrmecia is 74 million years (95% confidence limits, 53–101�million years), a result consistent with the origin of the myrmeciine stem lineage in the Cretaceous. The ant subfamily Myrmeciinae is redefined to contain two tribes, Myrmeciini (genus Myrmecia) and Prionomyrmecini (Nothomyrmecia and Prionomyrmex). Phylogenetic analysis of the enigmatic Argentine fossils Ameghinoia and Polanskiella demonstrates that they are also members of the Myrmeciinae, probably more closely related to Prionomyrmecini than to Myrmeciini. Thus, the myrmeciine ants appear to be a formerly widespread group that retained many ancestral formicid characteristics and that became extinct everywhere except in the Australian region.


2018 ◽  
Author(s):  
Abayomi S Olabode ◽  
Mariano Avino ◽  
Tammy Ng ◽  
Faisal Abu-Sardanah ◽  
David W Dick ◽  
...  

AbstractReconstructing the early dynamics of the HIV-1 pandemic can provide crucial insights into the socioeconomic drivers of emerging infectious diseases in human populations, including the roles of urbanization and transportation networks. Current evidence indicates that the global pandemic comprising almost entirely of HIV-1/M originated around the 1920s in central Africa. However, these estimates are based on molecular clock estimates that are assumed to apply uniformly across the virus genome. There is growing evidence that recombination has played a significant role in the early history of the HIV-1 pandemic, such that different regions of the HIV-1 genome have different evolutionary histories. In this study, we have conducted a dated-tip analysis of all near full-length HIV-1/M genome sequences that were published in the GenBank database. We used a sliding window approach similar to the ‘bootscanning’ method for detecting breakpoints in intersubtype recombinant sequences. We found evidence of substantial variation in estimated root dates among windows, with an estimated mean time to the most recent common ancestor (tMRCA) of 1922. Estimates were significantly autocorrelated, which was more consistent with an early recombination event than with stochastic error variation in phylogenetic reconstruction and dating analyses. A piecewise regression analysis supported the existence of at least one recombination breakpoint in the HIV-1/M genome with interval-specific means around 1929 and 1913, respectively. This analysis demonstrates that a sliding window approach can accommodate early recombination events outside the established nomenclature of HIV-1/M subtypes, although it is difficult to incorporate the earliest available samples due to their limited genome coverage.


Sign in / Sign up

Export Citation Format

Share Document