scholarly journals The First Transcriptome Assembly of Yenyuan Stream Salamander (Batrachuperus yenyuanensis) Provides Novel Insights into Its Molecular Evolution

2019 ◽  
Vol 20 (7) ◽  
pp. 1529 ◽  
Author(s):  
Jianli Xiong ◽  
Yunyun Lv ◽  
Yong Huang ◽  
Qiangqiang Liu

The Yenyuan stream salamander (Batrachuperus yenyuanensis) has been previously evaluated with regards to phylogeny, population genetics, and hematology, but genomic information is sparse due to the giant genome size of salamanders which contain highly repetitive sequences, thus resulting in the lack of a complete reference genome. This study evaluates the encoding genetic sequences and provides the first transcriptome assembly of Yenyuan stream salamander based on mixed samples from the liver, spermary, muscle and spleen tissues. Using this transcriptome assembly and available encoding sequences from other vertebrates, the gene families, phylogenetic status, and species divergence time were compared or estimated. A total of 13,750 encoding sequences were successfully obtained from the transcriptome assembly of Yenyuan stream salamander, estimated to contain 40.1% of the unigenes represented in tetrapod databases. A total of 88.79% of these genes could be annotated to a biological function by current databases. Through gene family clustering, we found multiple possible isoforms of the Scribble gene—whose function is related to regeneration—based on sequence similarity. Meanwhile, we constructed a robust phylogenetic tree based on 56 single-copy orthologues, which indicates that based on phylogenetic position, the Yenyuan stream salamander presents the closest relationship with the Chinese giant salamander (Andrias davidianus) of the investigated vertebrates. Based on the fossil-calibrated phylogeny, we estimated that the lineage divergence between the ancestral Yenyuan stream salamander and the Chinese giant salamander may have occurred during the Cretaceous period (~78.4 million years ago). In conclusion, this study not only provides a candidate gene that is valuable for exploring the remarkable capacity of regeneration in the future, but also gives an interesting insight into the understanding of Yenyuan stream salamander by this first transcriptome assembly.

2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Chi Song ◽  
Fangfang Fu ◽  
Lulu Yang ◽  
Yan Niu ◽  
Zhaoyang Tian ◽  
...  

AbstractTaxol, a natural product derived from Taxus, is one of the most effective natural anticancer drugs and the biosynthetic pathway of Taxol is the basis of heterologous bio-production. Here, we report a high-quality genome assembly and annotation of Taxus yunnanensis based on 10.7 Gb sequences assembled into 12 chromosomes with contig N50 and scaffold N50 of 2.89 Mb and 966.80 Mb, respectively. Phylogenomic analyses show that T. yunnanensis is most closely related to Sequoiadendron giganteum among the sampled taxa, with an estimated divergence time of 133.4−213.0 MYA. As with most gymnosperms, and unlike most angiosperms, there is no evidence of a recent whole-genome duplication in T. yunnanensis. Repetitive sequences, especially long terminal repeat retrotransposons, are prevalent in the T. yunnanensis genome, contributing to its large genome size. We further integrated genomic and transcriptomic data to unveil clusters of genes involved in Taxol synthesis, located on the chromosome 12, while gene families encoding hydroxylase in the Taxol pathway exhibited significant expansion. Our study contributes to the further elucidation of gymnosperm relationships and the Taxol biosynthetic pathway.


2016 ◽  
Author(s):  
Congrui Sun ◽  
Jie Li ◽  
Xiaogang Dai ◽  
Yingnan Chen

By screening sequence reads from the chloroplast (cp) genome of S. suchowensis that generated by the next generation sequencing platforms, we built the complete circular pseudomolecule for its cp genome. This pseudomolecule is 155,508 bp in length, which has a typical quadripartite structure containing two single copy regions, a large single copy region (LSC 84,385 bp), and a small single copy region (SSC 16,209 bp) separated by inverted repeat regions (IRs 27,457 bp). Gene annotation revealed that the cp genome of S. suchowensis encoded 119 unique genes, including 4 ribosome RNA genes, 30 transfer RNA genes, 82 protein-coding genes and 3 pseudogenes. Analyzing the repetitive sequences detected 15 tandem repeats, 16 forward repeats and 5 palindromic repeats. In addition, a total of 188 perfect microsatellites were detected, which were characterized as A/T predominance in nucleotide compositions. Significant shifting of the IR/SSC boundaries was revealed by comparing this cp genome with that of other rosids plants. We also built phylogenetic trees to demonstrate the phylogenetic position of S. suchowensis in Rosidae, with 66 orthologous protein-coding genes presented in the cp genomes of 32 species. By sequencing 30 amplicons based on the pseudomolecule, experimental verification achieved accuracy up to 99.84% for the cp genome assembly of S. suchowensis. In conclusion, this study built a high quality pseudomolecule for the cp genome of S. suchowensis, which is a useful resource for facilitating the development of this shrub willow into a more productive bioenergy crop.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e6663 ◽  
Author(s):  
Liqiang Wang ◽  
Hui Zhang ◽  
Mei Jiang ◽  
Haimei Chen ◽  
Linfang Huang ◽  
...  

Plastome-based phylogenetic study has largely resolved the phylogeny of Icacinaceae. However, no single complete plastome sequence is available for Icacinaceae species, thereby limiting the further phylogenomics analysis of the members of this family. Here, we obtained the complete plastome sequence of Iodes cirrhosa Turcz., which is the first in Icacinaceae, by using the next-generation sequencing technology. The genome was annotated and compared with other closely related plastomes by using mVISTA. The divergence time of six Iodes species was analyzed using the BEAST software. The plastome of I. cirrhosa was 151,994 bp long, with a pair of inverted repeats (IRs, 24,973 bp) separated by a large single-copy (LSC, 84,527 bp) region and a small single-copy (SSC, 17,521 bp) region. The plastome encoded 112 unique genes, including 80 protein-coding, 28 tRNA, and four rRNA genes. Approximately 59 repeat sequences and 188 simple sequence repeats were identified. Four pairs of partially overlapped genes, namely, psbD/psbC, ndhF/Ψycf1, atpB/atpE, and rpl22/rps3, were observed. A comparison of the boundaries of the LSC, SSC, and IR regions with four other plastomes from Aquifoliales and Sapindales exhibited a high overall degree of sequence similarity. Four most highly variable regions, namely, trnH-GUG/psbA, psbM/trnD-GUC, petA/psbJ, and rps16/trnQ-UUG, were found. Using the plastome of I. cirrhosa as reference, we reassembled the plastomes of five Iodes species. Ka/Ks ratio analyses revealed that 27 genes and 52 amino acid residue sites from 11 genes had undergone strong positive selection in the Iodes branch, with the most abundant proteins being the NDH and ribosomal proteins. Divergence-time analysis indicated that Iodes species were first formed 34.40 million years ago. Results revealed that the ancestor of the six species was likely to have split in the late Eocene epoch. In summary, the first complete plastome sequence of I. cirrhosa provided valuable information regarding the evolutionary processes of Iodes species.


2016 ◽  
Author(s):  
Congrui Sun ◽  
Jie Li ◽  
Xiaogang Dai ◽  
Yingnan Chen

By screening sequence reads from the chloroplast (cp) genome of S. suchowensis that generated by the next generation sequencing platforms, we built the complete circular pseudomolecule for its cp genome. This pseudomolecule is 155,508 bp in length, which has a typical quadripartite structure containing two single copy regions, a large single copy region (LSC 84,385 bp), and a small single copy region (SSC 16,209 bp) separated by inverted repeat regions (IRs 27,457 bp). Gene annotation revealed that the cp genome of S. suchowensis encoded 119 unique genes, including 4 ribosome RNA genes, 30 transfer RNA genes, 82 protein-coding genes and 3 pseudogenes. Analyzing the repetitive sequences detected 15 tandem repeats, 16 forward repeats and 5 palindromic repeats. In addition, a total of 188 perfect microsatellites were detected, which were characterized as A/T predominance in nucleotide compositions. Significant shifting of the IR/SSC boundaries was revealed by comparing this cp genome with that of other rosids plants. We also built phylogenetic trees to demonstrate the phylogenetic position of S. suchowensis in Rosidae, with 66 orthologous protein-coding genes presented in the cp genomes of 32 species. By sequencing 30 amplicons based on the pseudomolecule, experimental verification achieved accuracy up to 99.84% for the cp genome assembly of S. suchowensis. In conclusion, this study built a high quality pseudomolecule for the cp genome of S. suchowensis, which is a useful resource for facilitating the development of this shrub willow into a more productive bioenergy crop.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Huai-Jun Xue ◽  
Yi-Wei Niu ◽  
Kari A. Segraves ◽  
Rui-E Nie ◽  
Ya-Jing Hao ◽  
...  

Abstract Background Altica (Coleoptera: Chrysomelidae) is a highly diverse and taxonomically challenging flea beetle genus that has been used to address questions related to host plant specialization, reproductive isolation, and ecological speciation. To further evolutionary studies in this interesting group, here we present a draft genome of a representative specialist, Altica viridicyanea, the first Alticinae genome reported thus far. Results The genome is 864.8 Mb and consists of 4490 scaffolds with a N50 size of 557 kb, which covered 98.6% complete and 0.4% partial insect Benchmarking Universal Single-Copy Orthologs. Repetitive sequences accounted for 62.9% of the assembly, and a total of 17,730 protein-coding gene models and 2462 non-coding RNA models were predicted. To provide insight into host plant specialization of this monophagous species, we examined the key gene families involved in chemosensation, detoxification of plant secondary chemistry, and plant cell wall-degradation. Conclusions The genome assembled in this work provides an important resource for further studies on host plant adaptation and functionally affiliated genes. Moreover, this work also opens the way for comparative genomics studies among closely related Altica species, which may provide insight into the molecular evolutionary processes that occur during ecological speciation.


Plants ◽  
2020 ◽  
Vol 9 (12) ◽  
pp. 1692
Author(s):  
Li Gu ◽  
Ting Su ◽  
Ming-Tai An ◽  
Guo-Xiong Hu

Oreocharis esquirolii, a member of Gesneriaceae, is known as Thamnocharis esquirolii, which has been regarded a synonym of the former. The species is endemic to Guizhou, southwestern China, and is evaluated as vulnerable (VU) under the International Union for Conservation of Nature (IUCN) criteria. Until now, the sequence and genome information of O. esquirolii remains unknown. In this study, we assembled and characterized the complete chloroplast (cp) genome of O. esquirolii using Illumina sequencing data for the first time. The total length of the cp genome was 154,069 bp with a typical quadripartite structure consisting of a pair of inverted repeats (IRs) of 25,392 bp separated by a large single copy region (LSC) of 85,156 bp and a small single copy region (SSC) of18,129 bp. The genome comprised 114 unique genes with 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Thirty-one repeat sequences and 74 simple sequence repeats (SSRs) were identified. Genome alignment across five plastid genomes of Gesneriaceae indicated a high sequence similarity. Four highly variable sites (rps16-trnQ, trnS-trnG, ndhF-rpl32, and ycf 1) were identified. Phylogenetic analysis indicated that O. esquirolii grouped together with O. mileensis, supporting resurrection of the name Oreocharis esquirolii from Thamnocharisesquirolii. The complete cp genome sequence will contribute to further studies in molecular identification, genetic diversity, and phylogeny.


2020 ◽  
Vol 12 (6) ◽  
pp. 905-910 ◽  
Author(s):  
Ruoyu Liu ◽  
Kun Wang ◽  
Jun Liu ◽  
Wenjie Xu ◽  
Yang Zhou ◽  
...  

Abstract Cold seeps, characterized by the methane, hydrogen sulfide, and other hydrocarbon chemicals, foster one of the most widespread chemosynthetic ecosystems in deep sea that are densely populated by specialized benthos. However, scarce genomic resources severely limit our knowledge about the origin and adaptation of life in this unique ecosystem. Here, we present a genome of a deep-sea limpet Bathyacmaea lactea, a common species associated with the dominant mussel beds in cold seeps. We yielded 54.6 gigabases (Gb) of Nanopore reads and 77.9-Gb BGI-seq raw reads, respectively. Assembly harvested a 754.3-Mb genome for B. lactea, with 3,720 contigs and a contig N50 of 1.57 Mb, covering 94.3% of metazoan Benchmarking Universal Single-Copy Orthologs. In total, 23,574 protein-coding genes and 463.4 Mb of repetitive elements were identified. We analyzed the phylogenetic position, substitution rate, demographic history, and TE activity of B. lactea. We also identified 80 expanded gene families and 87 rapidly evolving Gene Ontology categories in the B. lactea genome. Many of these genes were associated with heterocyclic compound metabolism, membrane-bounded organelle, metal ion binding, and nitrogen and phosphorus metabolism. The high-quality assembly and in-depth characterization suggest the B. lactea genome will serve as an essential resource for understanding the origin and adaptation of life in the cold seeps.


1989 ◽  
Vol 9 (3) ◽  
pp. 935-945
Author(s):  
L A Johnston ◽  
M A Kotarski ◽  
D J Jerry ◽  
L P Kozak

While studying the organization of the mouse glycerol-phosphate dehydrogenase gene (Gdc-1 on chromosome 15), we identified a novel transcriptional unit located only 3.4 kilobases (kb) upstream of the 5' end of the Gdc-1 gene. This gene has been provisionally named D15Kz1. The unusual proximity of these two genes led us to investigate the pattern of expression and sequence characteristics of the new gene for comparison with those of Gdc-1. D15Kz1 was found to have transcripts of 3.2 and 3.4 kb in length. The 3.4-kb transcript was expressed at low levels in all tissues examined, whereas the 3.2-kb transcript was detected only in the cerebral cortex and the brown fat. D15Kz1 and Gdc-1 are not coordinately regulated, as evidenced by the characteristics of their expression in several tissues and in differentiating 3T3-F442A adipocyte cultures. A cDNA sequence of 3,105 bases isolated from an embryonal carcinoma lambda gt10 cDNA library had a large open reading frame of 461 amino acids at one end followed by 1.6 kb of sequence with multiple stop codons. Algorithms used to search the protein and nucleic acid data bases detected no significant sequence similarity to any other protein or gene. Southern blot analysis of genomic DNA using the D15Kz1 cDNA as a probe indicated that D15Kz1 is a single-copy gene in the mouse genome and that it is conserved in humans, rats, and chickens. This conservation of gene sequences suggests that D15Kz1 encodes a protein with an important cellular function.


2007 ◽  
Vol 73 (17) ◽  
pp. 5566-5573 ◽  
Author(s):  
Rok Kostanjšek ◽  
Jasna Štrus ◽  
Gorazd Avguštin

ABSTRACT Pointed, rod-shaped bacteria colonizing the cuticular surface of the hindgut of the terrestrial isopod crustacean Porcellio scaber (Crustacea: Isopoda) were investigated by comparative 16S rRNA gene sequence analysis and electron microscopy. The results of phylogenetic analysis, and the absence of a cell wall, affiliated these bacteria with the class Mollicutes, within which they represent a novel and deeply branched lineage, sharing less than 82.6% sequence similarity to known Mollicutes. The lineage has been positioned as a sister group to the clade comprising the Spiroplasma group, the Mycoplasma pneumoniae group, and the Mycoplasma hominis group. The specific signature sequence was identified and used as a probe in in situ hybridization, which confirmed that the retrieved sequences originate from the attached rod-shaped bacteria from the hindgut of P. scaber and made it possible to detect these bacteria in their natural environment. Scanning and transmission electron microscopy revealed a spherically shaped structure at the tapered end of the rod-shaped bacteria, enabling their specific and exclusive attachment to the tip of the cuticular spines on the inner surface of the gut. Specific adaptation to the gut environment, as well as phylogenetic positioning, indicate the long-term association and probable coevolution of the bacteria and the host. Taking into account their pointed, rod-shaped morphology and their phylogenetic position, the name “Candidatus Bacilloplasma” has been proposed for this new lineage of bacteria specifically associated with the gut surface of P. scaber.


2015 ◽  
Vol 65 (Pt_12) ◽  
pp. 4410-4416 ◽  
Author(s):  
Soon Dong Lee

A novel actinobacterium, designated strain C4-31T, was isolated from soil collected from a cave. Cells were aerobic, Gram-reaction-positive, oxidase-negative, catalase-positive and non-motile cocci. Comparison of 16S rRNA gene sequences showed that the organism occupied a distinct phylogenetic position within the suborder Frankineae, with sequence similarity values of less than 93.2 % to members of this suborder. The diagnostic diamino acid in the cell-wall peptidoglycan was meso-diaminopimelic acid. The major menaquinone was MK-9(H4). The polar lipids were diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylinositol, phosphatidylinositol mannoside, an unknown aminophospholipid and an unknown phospholipid. The major fatty acids were iso-C16 : 0, C17 : 1ω6c and C16 : 0. The G+C content of the DNA was 62.8 mol%. On the basis of morphological and chemotaxonomic data as well as phylogenetic evidence, strain C4-31T ( = KCTC 39556T = DSM 100065T) is considered to represent the type strain of a novel species of a new genus in the suborder Frankineae, for which the name Antricoccus suffuscus gen. nov., sp. nov. is proposed.


Sign in / Sign up

Export Citation Format

Share Document