scholarly journals Structural rearrangements drive extensive genome divergence between symbiotic and free-living Symbiodinium

2019 ◽  
Author(s):  
Raúl A. González-Pech ◽  
Timothy G. Stephens ◽  
Yibi Chen ◽  
Amin R. Mohamed ◽  
Yuanyuan Cheng ◽  
...  

AbstractSymbiodiniaceae are predominantly symbiotic dinoflagellates critical to corals and other reef organisms. Symbiodinium is a basal symbiodiniacean lineage and includes symbiotic and free-living taxa. However, the molecular mechanisms underpinning these distinct lifestyles remain little known. Here, we present high-quality de novo genome assemblies for the symbiotic Symbiodinium tridacnidorum CCMP2592 (genome size 1.3 Gbp) and the free-living Symbiodinium natans CCMP2548 (genome size 0.74 Gbp). These genomes display extensive sequence divergence, sharing only ~1.5% conserved regions (≥90% identity). We predicted 45,474 and 35,270 genes for S. tridacnidorum and S. natans, respectively; of the 58,541 homologous gene families, 28.5% are common to both genomes. We recovered a greater extent of gene duplication and higher abundance of repeats, transposable elements and pseudogenes in the genome of S. tridacnidorum than in that of S. natans. These findings demonstrate that genome structural rearrangements are pertinent to distinct lifestyles in Symbiodinium, and may contribute to the vast genetic diversity within the genus, and more broadly in Symbiodiniaceae. Moreover, the results from our whole-genome comparisons against a free-living outgroup support the notion that the symbiotic lifestyle is a derived trait in, and that the free-living lifestyle is ancestral to, Symbiodinium.

BMC Biology ◽  
2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Raúl A. González-Pech ◽  
Timothy G. Stephens ◽  
Yibi Chen ◽  
Amin R. Mohamed ◽  
Yuanyuan Cheng ◽  
...  

Abstract Background Dinoflagellates in the family Symbiodiniaceae are important photosynthetic symbionts in cnidarians (such as corals) and other coral reef organisms. Breakdown of the coral-dinoflagellate symbiosis due to environmental stress (i.e. coral bleaching) can lead to coral death and the potential collapse of reef ecosystems. However, evolution of Symbiodiniaceae genomes, and its implications for the coral, is little understood. Genome sequences of Symbiodiniaceae remain scarce due in part to their large genome sizes (1–5 Gbp) and idiosyncratic genome features. Results Here, we present de novo genome assemblies of seven members of the genus Symbiodinium, of which two are free-living, one is an opportunistic symbiont, and the remainder are mutualistic symbionts. Integrating other available data, we compare 15 dinoflagellate genomes revealing high sequence and structural divergence. Divergence among some Symbiodinium isolates is comparable to that among distinct genera of Symbiodiniaceae. We also recovered hundreds of gene families specific to each lineage, many of which encode unknown functions. An in-depth comparison between the genomes of the symbiotic Symbiodinium tridacnidorum (isolated from a coral) and the free-living Symbiodinium natans reveals a greater prevalence of transposable elements, genetic duplication, structural rearrangements, and pseudogenisation in the symbiotic species. Conclusions Our results underscore the potential impact of lifestyle on lineage-specific gene-function innovation, genome divergence, and the diversification of Symbiodinium and Symbiodiniaceae. The divergent features we report, and their putative causes, may also apply to other microbial eukaryotes that have undergone symbiotic phases in their evolutionary history.


2020 ◽  
Vol 10 (10) ◽  
pp. 3467-3478 ◽  
Author(s):  
Peter M. Thielen ◽  
Amanda L. Pendleton ◽  
Robert A. Player ◽  
Kenneth V. Bowden ◽  
Thomas J. Lawton ◽  
...  

Setaria viridis (green foxtail) is an important model system for improving cereal crops due to its diploid genome, ease of cultivation, and use of C4 photosynthesis. The S. viridis accession ME034V is exceptionally transformable, but the lack of a sequenced genome for this accession has limited its utility. We present a 397 Mb highly contiguous de novo assembly of ME034V using ultra-long nanopore sequencing technology (read N50 = 41kb). We estimate that this genome is largely complete based on our updated k-mer based genome size estimate of 401 Mb for S. viridis. Genome annotation identified 37,908 protein-coding genes and >300k repetitive elements comprising 46% of the genome. We compared the ME034V assembly with two other previously sequenced Setaria genomes as well as to a diversity panel of 235 S. viridis accessions. We found the genome assemblies to be largely syntenic, but numerous unique polymorphic structural variants were discovered. Several ME034V deletions may be associated with recent retrotransposition of copia and gypsy LTR repeat families, as evidenced by their low genotype frequencies in the sampled population. Lastly, we performed a phylogenomic analysis to identify gene families that have expanded in Setaria, including those involved in specialized metabolism and plant defense response. The high continuity of the ME034V genome assembly validates the utility of ultra-long DNA sequencing to improve genetic resources for emerging model organisms. Structural variation present in Setaria illustrates the importance of obtaining the proper genome reference for genetic experiments. Thus, we anticipate that the ME034V genome will be of significant utility for the Setaria research community.


2019 ◽  
Author(s):  
Raúl A. González-Pech ◽  
Yibi Chen ◽  
Timothy G. Stephens ◽  
Sarah Shah ◽  
Amin R. Mohamed ◽  
...  

AbstractDinoflagellates of the family Symbiodiniaceae (Order Suessiales) are predominantly symbiotic, and many are known for their association with corals. The genetic and functional diversity among Symbiodiniaceae is well acknowledged, but the genome-wide sequence divergence among these lineages remains little known. Here, we present de novo genome assemblies of five isolates from the basal genus Symbiodinium, encompassing distinct ecological niches. Incorporating existing data from Symbiodiniaceae and other Suessiales (15 genome datasets in total), we investigated genome features that are common or unique to these Symbiodiniaceae, to genus Symbiodinium, and to the individual species S. microadriaticum and S. tridacnidorum. Our whole-genome comparisons reveal extensive sequence divergence, with no sequence regions common to all 15. Based on similarity of k-mers from whole-genome sequences, the distances among Symbiodinium isolates are similar to those between isolates of distinct genera. We observed extensive structural rearrangements among symbiodiniacean genomes; those from two distinct Symbiodinium species share the most (853) syntenic gene blocks. Functions enriched in genes core to Symbiodiniaceae are also enriched in those core to Symbiodinium. Gene functions related to symbiosis and stress response exhibit similar relative abundance in all analysed genomes. Our results suggest that structural rearrangements contribute to genome sequence divergence in Symbiodiniaceae even within a same species, but the gene functions have remained largely conserved in Suessiales. This is the first comprehensive comparison of Symbiodiniaceae based on whole-genome sequence data, including comparisons at the intra-genus and intra-species levels.


2021 ◽  
Vol 118 (42) ◽  
pp. e2025711118
Author(s):  
Quanjun Hu ◽  
Yazhen Ma ◽  
Terezie Mandáková ◽  
Sheng Shi ◽  
Chunlin Chen ◽  
...  

Deserts exert strong selection pressures on plants, but the underlying genomic drivers of ecological adaptation and subsequent speciation remain largely unknown. Here, we generated de novo genome assemblies and conducted population genomic analyses of the psammophytic genus Pugionium (Brassicaceae). Our results indicated that this bispecific genus had undergone an allopolyploid event, and the two parental genomes were derived from two ancestral lineages with different chromosome numbers and structures. The postpolyploid expansion of gene families related to abiotic stress responses and lignin biosynthesis facilitated environmental adaptations of the genus to desert habitats. Population genomic analyses of both species further revealed their recent divergence with continuous gene flow, and the most divergent regions were found to be centered on three highly structurally reshuffled chromosomes. Genes under selection in these regions, which were mainly located in one of the two subgenomes, contributed greatly to the interspecific divergence in microhabitat adaptation.


2020 ◽  
Author(s):  
Ke Cao ◽  
Zhen Peng ◽  
Xing Zhao ◽  
Yong Li ◽  
Kuozhan Liu ◽  
...  

AbstractAs a foundation to understand the molecular mechanisms of peach evolution and high-altitude adaptation, we performed de novo genome assembling of four wild relatives of P. persica, P. mira, P. kansuensis, P. davidiana and P. ferganensis. Through comparative genomic analysis, abundant genetic variations were identified in four wild species when compared to P. persica. Among them, a deletion, located at the promoter of Prupe.2G053600 in P. kansuensis, was validated to regulate the resistance to nematode. Next, a pan-genome was constructed which comprised 15,216 core gene families among four wild peaches and P. perisca. We identified the expanded and contracted gene families in different species and investigated their roles during peach evolution. Our results indicated that P. mira was the primitive ancestor of cultivated peach, and peach evolution was non-linear and a cross event might have occurred between P. mira and P. dulcis during the process. Combined with the selective sweeps identified using accessions of P. mira originating from different altitude regions, we proposed that nitrogen recovery was essential for high-altitude adaptation of P. mira through increasing its resistance to low temperature. The pan-genome constructed in our study provides a valuable resource for developing elite cultivars, studying the peach evolution, and characterizing the high-altitude adaptation in perennial crops.


2017 ◽  
Author(s):  
Rachael E. Workman ◽  
Alexander M. Myrka ◽  
Elizabeth Tseng ◽  
G. William Wong ◽  
Kenneth C. Welch ◽  
...  

AbstractHummingbirds can support their high metabolic rates exclusively by oxidizing ingested sugars, which is unsurprising given their sugar-rich nectar diet and use of energetically expensive hovering flight. However, they cannot rely on dietary sugars as a fuel during fasting periods, such as during the night, at first light, or when undertaking long-distance migratory flights, and must instead rely exclusively on onboard lipids. This metabolic flexibility is remarkable both in that the birds can switch between exclusive use of each fuel type within minutes and in that de novo lipogenesis from dietary sugar precursors is the principle way in which fat stores are built, sometimes at exceptionally high rates, such as during the few days prior to a migratory flight. The hummingbird hepatopancreas is the principle location of de novo lipogenesis and likely plays a key role in fuel selection, fuel switching, and glucose homeostasis. Yet understanding how this tissue, and the whole organism, achieves and moderates high rates of energy turnover is hampered by a fundamental lack of information regarding how genes coding for relevant enzymes differ in their sequence, expression, and regulation in these unique animals. To address this knowledge gap, we generated a de novo transcriptome of the hummingbird liver using PacBio full-length cDNA sequencing (Iso-Seq), yielding a total of 8.6Gb of sequencing data, or 2.6M reads from 4 different size fractions. We analyzed data using the SMRTAnalysis v3.1 Iso-Seq pipeline, including classification of reads and clustering of isoforms (ICE) followed by error-correction (Arrow). With COGENT, we clustered different isoforms into gene families to generate de novo gene contigs. We performed orthology analysis to identify closely related sequences between our transcriptome and other avian and human gene sets. We also aligned our transcriptome against the Calypte anna genome where possible. Finally, we closely examined homology of critical lipid metabolic genes between our transcriptome data and avian and human genomes. We confirmed high levels of sequence divergence within hummingbird lipogenic enzymes, suggesting a high probability of adaptive divergent function in the hepatic lipogenic pathways. Our results have leveraged cutting-edge technology and a novel bioinformatics pipeline to provide a compelling first direct look at the transcriptome of this incredible organism.


Genes ◽  
2021 ◽  
Vol 12 (11) ◽  
pp. 1777
Author(s):  
Xuan Zhuang ◽  
C.-H. Christina Cheng

The de novo birth of functional genes from non-coding DNA as an important contributor to new gene formation is increasingly supported by evidence from diverse eukaryotic lineages. However, many uncertainties remain, including how the incipient de novo genes would continue to evolve and the molecular mechanisms underlying their evolutionary trajectory. Here we address these questions by investigating evolutionary history of the de novo antifreeze glycoprotein (AFGP) gene and gene family in gadid (codfish) lineages. We examined AFGP phenotype on a phylogenetic framework encompassing a broad sampling of gadids from freezing and non-freezing habitats. In three select species representing different AFGP-bearing clades, we analyzed all AFGP gene family members and the broader scale AFGP genomic regions in detail. Codon usage analyses suggest that motif duplication produced the intragenic AFGP tripeptide coding repeats, and rapid sequence divergence post-duplication stabilized the recombination-prone long repetitive coding region. Genomic loci analyses support AFGP originated once from a single ancestral genomic origin, and shed light on how the de novo gene proliferated into a gene family. Results also show the processes of gene duplication and gene loss are distinctive in separate clades, and both genotype and phenotype are commensurate with differential local selective pressures.


2020 ◽  
Author(s):  
Peter M. Thielen ◽  
Amanda L. Pendleton ◽  
Robert A. Player ◽  
Kenneth V. Bowden ◽  
Thomas J. Lawton ◽  
...  

ABSTRACTSetaria viridis (green foxtail) is an important model system for improving cereal crops due to its diploid genome, ease of cultivation, and use of C4 photosynthesis. The S. viridis cultivar ME034V is exceptionally transformable, but the lack of a sequenced genome for this cultivar has limited its utility. We present a 397 Mb highly contiguous de novo assembly of ME034V using ultra-long nanopore sequencing technology (read N50=41kb). We estimate that this genome is largely complete based on our updated k-mer based genome size estimate of 401 Mb for S. viridis. Genome annotation identified 37,908 protein-coding genes and >300k repetitive elements comprising 46% of the genome. We compared the ME034V assembly with two other previously sequenced Setaria genomes as well as to a diversity panel of 235 S. viridis cultivars. We found the genome assemblies to be largely syntenic, but numerous unique polymorphic structural variants were discovered. Several ME034V deletions may be associated with recent retrotransposition of copia and gypsy LTR repeat families, as evidenced by their low genotype frequencies in the sampled population. Lastly, we performed a phylogenomic analysis to identify gene families that have expanded in Setaria, including those involved in specialized metabolism and plant defense response. The high continuity of the ME034V genome assembly validates the utility of ultra-long DNA sequencing to improve genetic resources for emerging model organisms. Structural variation present in Setaria illustrates the importance of obtaining the proper genome reference for genetic experiments. Thus, we anticipate that the ME034V genome will be of significant utility for the Setaria research community.


BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Min Shi ◽  
Zhizhi Wang ◽  
Xiqian Ye ◽  
Hongqing Xie ◽  
Fei Li ◽  
...  

Abstract Background Parasitic insects are well-known biological control agents for arthropod pests worldwide. They are capable of regulating their host’s physiology, development and behaviour. However, many of the molecular mechanisms involved in host-parasitoid interaction remain unknown. Results We sequenced the genomes of two parasitic wasps (Cotesia vestalis, and Diadromus collaris) that parasitize the diamondback moth Plutella xylostella using Illumina and Pacbio sequencing platforms. Genome assembly using SOAPdenovo produced a 178 Mb draft genome for C. vestalis and a 399 Mb draft genome for D. collaris. A total set that contained 11,278 and 15,328 protein-coding genes for C. vestalis and D. collaris, respectively, were predicted using evidence (homology-based and transcriptome-based) and de novo prediction methodology. Phylogenetic analysis showed that the braconid C. vestalis and the ichneumonid D. collaris diverged approximately 124 million years ago. These two wasps exhibit gene gains and losses that in some cases reflect their shared life history as parasitic wasps and in other cases are unique to particular species. Gene families with functions in development, nutrient acquisition from hosts, and metabolism have expanded in each wasp species, while genes required for biosynthesis of some amino acids and steroids have been lost, since these nutrients can be directly obtained from the host. Both wasp species encode a relative higher number of neprilysins (NEPs) thus far reported in arthropod genomes while several genes encoding immune-related proteins and detoxification enzymes were lost in both wasp genomes. Conclusions We present the annotated genome sequence of two parasitic wasps C. vestalis and D. collaris, which parasitize a common host, the diamondback moth, P. xylostella. These data will provide a fundamental source for studying the mechanism of host control and will be used in parasitoid comparative genomics to study the origin and diversification of the parasitic lifestyle.


2019 ◽  
Author(s):  
Yibi Chen ◽  
Raúl A. González-Pech ◽  
Timothy G. Stephens ◽  
Debashish Bhattacharya ◽  
Cheong Xin Chan

AbstractComparative algal genomics often relies on predicted gene models from de novo assembled genomes. However, the artifacts introduced by different gene-prediction approaches, and their impact on comparative genomic analysis, remain poorly understood. Here, using available genome data from six dinoflagellate species in Symbiodiniaceae, we identified potential methodological biases in the published gene models that were predicted using different approaches. We developed and applied a comprehensive customized workflow to predict genes from these genomes. The observed variation among predicted gene models resulting from our workflow agreed with current understanding of phylogenetic relationships among these taxa, whereas those published earlier were largely biased by the distinct approaches used in each instance. Importantly, these biases mislead the inference of homologous gene families and synteny among genomes, thus impacting biological interpretation of these data. Our results demonstrate that a consistent gene-prediction approach is critical for comparative genomics, particularly for non-model algal genomes.


Sign in / Sign up

Export Citation Format

Share Document