scholarly journals Plastid genomics of Nicotiana (Solanaceae): insights into molecular evolution, positive selection and the origin of the maternal genome of Aztec tobacco (Nicotiana rustica)

PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9552
Author(s):  
Furrukh Mehmood ◽  
Abdullah ◽  
Zartasha Ubaid ◽  
Iram Shahzadi ◽  
Ibrar Ahmed ◽  
...  

Species of the genus Nicotiana (Solanaceae), commonly referred to as tobacco plants, are often cultivated as non-food crops and garden ornamentals. In addition to the worldwide production of tobacco leaves, they are also used as evolutionary model systems due to their complex development history tangled by polyploidy and hybridization. Here, we assembled the plastid genomes of five tobacco species: N. knightiana, N. rustica, N. paniculata, N. obtusifolia and N. glauca. De novo assembled tobacco plastid genomes had the typical quadripartite structure, consisting of a pair of inverted repeat (IR) regions (25,323–25,369 bp each) separated by a large single-copy (LSC) region (86,510–86,716 bp) and a small single-copy (SSC) region (18,441–18,555 bp). Comparative analyses of Nicotiana plastid genomes with currently available Solanaceae genome sequences showed similar GC and gene content, codon usage, simple sequence and oligonucleotide repeats, RNA editing sites, and substitutions. We identified 20 highly polymorphic regions, mostly belonging to intergenic spacer regions (IGS), which could be suitable for the development of robust and cost-effective markers for inferring the phylogeny of the genus Nicotiana and family Solanaceae. Our comparative plastid genome analysis revealed that the maternal parent of the tetraploid N. rustica was the common ancestor of N. paniculata and N. knightiana, and the later species is more closely related to N. rustica. Relaxed molecular clock analyses estimated the speciation event between N. rustica and N. knightiana appeared 0.56 Ma (HPD 0.65–0.46). Biogeographical analysis supported a south-to-north range expansion and diversification for N. rustica and related species, where N. undulata and N. paniculata evolved in North/Central Peru, while N. rustica developed in Southern Peru and separated from N. knightiana, which adapted to the Southern coastal climatic regimes. We further inspected selective pressure on protein-coding genes among tobacco species to determine if this adaptation process affected the evolution of plastid genes. These analyses indicate that four genes involved in different plastid functions, including DNA replication (rpoA) and photosynthesis (atpB, ndhD and ndhF), came under positive selective pressure as a result of specific environmental conditions. Genetic mutations in these genes might have contributed to better survival and superior adaptations during the evolutionary history of tobacco species.

Author(s):  
Furrukh Mehmood ◽  
Abdullah ◽  
Zartasha Ubaid ◽  
Iram Shahzadi ◽  
Ibrar Ahmed ◽  
...  

AbstractThe genus Nicotiana of the family Solanaceae, commonly referred to as tobacco plants, are a group cultivated as garden ornamentals. Besides their use in the worldwide production of tobacco leaves, they are also used as evolutionary model systems due to their complex development history, which is tangled by polyploidy and hybridization. Here, we assembled the plastid genomes of five tobacco species, namely N. knightiana, N. rustica, N. paniculata, N. obtusifolia and N. glauca. De novo assembled tobacco plastid genomes showed typical quadripartite structure, consisting of a pair of inverted repeats (IR) regions (25,323–25,369 bp each) separated by a large single copy (LSC) region (86,510 –86,716 bp) and a small single copy (SSC) region (18,441–18,555 bp). Comparative analyses of Nicotiana plastid genomes showed similar GC content, gene content, codon usage, simple sequence repeats, oligonucleotide repeats, RNA editing sites and substitutions with currently available Solanaceae genomes sequences. We identified twenty highly polymorphic regions mostly belonging to intergenic spacer regions (IGS), which could be appropriate for the development of robust and cost-effective markers to infer the phylogeny of genus Nicotiana and family Solanaceae. Our comparative plastid genome analysis revealed that the maternal parent of the tetraploid N. rustica was the common ancestor of N. paniculata and N. knightiana, and the later species is more closely related to N. rustica. The relaxed molecular clock analyses estimated that the speciation event between N. rustica and knightiana appeared 0.56 Ma (HPD 0.65–0.46). The biogeographical analysis showed a south-to-north range expansion and diversification for N. rustica and related species, where N. undulata and N. paniculata evolved in North/Central Peru, while N. rustica developed in Southern Peru and separated from N. knightiana, which adapted to the Southern coastal climatic regimes. We further inspected selective pressure on protein-coding genes among tobacco species to determine if this adaptation process affected the evolution of plastid genes. These analyses indicated that four genes involved in different plastid functions, such as DNA replication (rpoA) and photosynthesis (atpB, ndhD and ndhF), came under positive selective pressure as a result of specific environmental conditions. Genetic mutations of the following genes might have contributed to the survival and better adaptation during the evolutionary history of tobacco species.


2021 ◽  
Vol 12 ◽  
Author(s):  
Fenghua Tian ◽  
Changtian Li ◽  
Yu Li

Yuanmo [Sarcomyxa edulis (Y.C. Dai, Niemelä & G.F. Qin) T. Saito, Tonouchi & T. Harada] is an important edible and medicinal mushroom endemic to Northeastern China. Here we report the de novo sequencing and assembly of the S. edulis genome using single-molecule real-time sequencing technology. The whole genome was approximately 35.65 Mb, with a G + C content of 48.31%. Genome assembly generated 41 contigs with an N50 length of 1,772,559 bp. The genome comprised 9,364 annotated protein-coding genes, many of which encoded enzymes involved in the modification, biosynthesis, and degradation of glycoconjugates and carbohydrates or enzymes predicted to be involved in the biosynthesis of secondary metabolites such as terpene, type I polyketide, siderophore, and fatty acids, which are responsible for the pharmacodynamic activities of S. edulis. We also identified genes encoding 1,3-β-glucan synthase and endo-1,3(4)-β-glucanase, which are involved in polysaccharide and uridine diphosphate glucose biosynthesis. Phylogenetic and comparative analyses of Basidiomycota fungi based on a single-copy orthologous protein indicated that the Sarcomyxa genus is an independent group that evolved from the Pleurotaceae family. The annotated whole-genome sequence of S. edulis can serve as a reference for investigations of bioactive compounds with medicinal value and the development and commercial production of superior S. edulis varieties.


2019 ◽  
Vol 42 (4) ◽  
pp. 601-611 ◽  
Author(s):  
Yan Li ◽  
Liukun Jia ◽  
Zhihua Wang ◽  
Rui Xing ◽  
Xiaofeng Chi ◽  
...  

Abstract Saxifraga sinomontana J.-T. Pan & Gornall belongs to Saxifraga sect. Ciliatae subsect. Hirculoideae, a lineage containing ca. 110 species whose phylogenetic relationships are largely unresolved due to recent rapid radiations. Analyses of complete chloroplast genomes have the potential to significantly improve the resolution of phylogenetic relationships in this young plant lineage. The complete chloroplast genome of S. sinomontana was de novo sequenced, assembled and then compared with that of other six Saxifragaceae species. The S. sinomontana chloroplast genome is 147,240 bp in length with a typical quadripartite structure, including a large single-copy region of 79,310 bp and a small single-copy region of 16,874 bp separated by a pair of inverted repeats (IRs) of 25,528 bp each. The chloroplast genome contains 113 unique genes, including 79 protein-coding genes, four rRNAs and 30 tRNAs, with 18 duplicates in the IRs. The gene content and organization are similar to other Saxifragaceae chloroplast genomes. Sixty-one simple sequence repeats were identified in the S. sinomontana chloroplast genome, mostly represented by mononucleotide repeats of polyadenine or polythymine. Comparative analysis revealed 12 highly divergent regions in the intergenic spacers, as well as coding genes of matK, ndhK, accD, cemA, rpoA, rps19, ndhF, ccsA, ndhD and ycf1. Phylogenetic reconstruction of seven Saxifragaceae species based on 66 protein-coding genes received high bootstrap support values for nearly all identified nodes, suggesting a promising opportunity to resolve infrasectional relationships of the most species-rich section Ciliatae of Saxifraga.


2018 ◽  
Author(s):  
Weixue Mu ◽  
Ting Yang ◽  
Xin Liu

AbstractBrassicales is a diverse angiosperm order with about 4,700 recognized species. Here, we assembled and described the complete plastid genomes from four species of Brassicales: Capparis urophylla F.Chun (Capparaceae), Carica papaya L. (Caricaceae), Cleome rutidosperma DC. (Cleomaceae), and Moringa oleifera Lam. (Moringaceae), including two plastid genomes newly assembled for two families (Capparaceae and Moringaceae). The four plastid genomes are 159,680 base pairs on average in length and encode 78 protein-coding genes. The genomes each contains a typical structure of a Large Single-Copy (LSC) region and a Small Single-Copy (SSC) region separated by two Inverted Repeat (IR) regions. We performed the maximum-likelihood (ML) phylogenetic analysis using three different data sets of 66 protein-coding genes (ntAll, ntNo3rd and AA). Our phylogenetic results from different dataset are congruent, and are consistent with previous phylogenetic studies of Brassiales.


Plants ◽  
2021 ◽  
Vol 10 (12) ◽  
pp. 2699
Author(s):  
Joan Pere Pascual-Díaz ◽  
Sònia Garcia ◽  
Daniel Vitales

Plastid genomes are in general highly conserved given their slow evolutionary rate, and thus large changes in their structure are unusual. However, when specific rearrangements are present, they are often phylogenetically informative. Asteraceae is a highly diverse family whose evolution is long driven by polyploidy (up to 48x) and hybridization, both processes usually complicating systematic inferences. In this study, we generated one of the most comprehensive plastome-based phylogenies of family Asteraceae, providing information about the structure, genetic diversity and repeat composition of these sequences. By comparing the whole-plastome sequences obtained, we confirmed the double inversion located in the long single-copy region, for most of the species analyzed (with the exception of basal tribes), a well-known feature for Asteraceae plastomes. We also showed that genome size, gene order and gene content are highly conserved along the family. However, species representative of the basal subfamily Barnadesioideae—as well as in the sister family Calyceraceae—lack the pseudogene rps19 located in one inverted repeat. The phylogenomic analysis conducted here, based on 63 protein-coding genes, 30 transfer RNA genes and 21 ribosomal RNA genes from 36 species of Asteraceae, were overall consistent with the general consensus for the family’s phylogeny while resolving the position of tribe Senecioneae and revealing some incongruences at tribe level between reconstructions based on nuclear and plastid DNA data.


2019 ◽  
Author(s):  
Siuli Mitra ◽  
Ara Sreenivas ◽  
Divya Tej Sowpati ◽  
Amitha Sampat Kumar ◽  
Gowri Awasthi ◽  
...  

AbstractWe report the first draft of the whole genome assembly of a male Asiatic lion, Atul and whole transcriptomes of five Asiatic lion individuals. Evaluation of genetic diversity placed the Asiatic lion in the lowest bracket of genomic diversity index highlighting the gravity of its conservation status. Comparative analysis with other felids and mammalian genomes unraveled the evolutionary history of Asiatic lion and its position among other felids. The genome is estimated to be 2.3 Gb (Gigabase) long with 62X sequence coverage and is found to have 20,543 protein-coding genes. About 2.66% of the genome is covered by simple sequence repeats (SSRs) and 0.4% is estimated to have segmental duplications. Comparison with seven well annotated genomes indicates the presence of 6,295 single copy orthologs, 4 co-orthologs, 21 paralogs uniquely present in Asiatic lion and 8,024 other orthologs. Assessment of male and female transcriptomes gave a list of genes specifically expressed in the male.Our genomic analyses provide candidates for phenotypes characteristic to felids and lion, inviting further confirmation of their contribution through population genetic studies. An Asiatic lion-specific expansion is detected in the Cysteine Dioxygenase-I (CDO-I) family that is responsible for taurine biosynthesis in cats. Wilm’s tumor-associated protein (WT1) family, a non-Y chromosome genetic factor underlying male-sex determination and differentiation is found to have undergone expansion, interestingly like that of the human genome. Another protein family, translation machinery-associated protein 7 (TMA7) that has undergone expansion in humans, also expanded in Asiatic lion and can be further investigated as a candidate responsible for mane in lions because of its role in hair follicle morphogenesis.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Nikolaos Vakirlis ◽  
Omer Acar ◽  
Brian Hsu ◽  
Nelson Castilho Coelho ◽  
S. Branden Van Oss ◽  
...  

AbstractRecent evidence demonstrates that novel protein-coding genes can arise de novo from non-genic loci. This evolutionary innovation is thought to be facilitated by the pervasive translation of non-genic transcripts, which exposes a reservoir of variable polypeptides to natural selection. Here, we systematically characterize how these de novo emerging coding sequences impact fitness in budding yeast. Disruption of emerging sequences is generally inconsequential for fitness in the laboratory and in natural populations. Overexpression of emerging sequences, however, is enriched in adaptive fitness effects compared to overexpression of established genes. We find that adaptive emerging sequences tend to encode putative transmembrane domains, and that thymine-rich intergenic regions harbor a widespread potential to produce transmembrane domains. These findings, together with in-depth examination of the de novo emerging YBR196C-A locus, suggest a novel evolutionary model whereby adaptive transmembrane polypeptides emerge de novo from thymine-rich non-genic regions and subsequently accumulate changes molded by natural selection.


2017 ◽  
Author(s):  
Mariana B. Grizante ◽  
Marc Tollis ◽  
Juan J. Rodriguez ◽  
Ofir Levy ◽  
Michael J. Angilletta ◽  
...  

AbstractBackgroundThe eastern fence lizard (Sceloporus undulatus) has been a model species for ecological and evolutionary research. Genomic and transcriptomic resources for this species would promote investigation of genetic mechanisms that underpin plastic responses to environmental stress, such as climate warming. Moreover, such resources would aid comparative studies of complex traits at the molecular level, such as the transition from oviparous to viviparous reproduction, which happened at least four times within Sceloporus.FindingsA de novo transcriptome assembly for Sceloporus undulatus, Sund_v1.0, was generated using over 179 million Illumina reads obtained from three tissues (whole brain, skeletal muscle, and embryo) as well as previously reported liver sequences. The Sund_v1.0 assembly had an average contig length of 782 nucleotides and an E90N50 statistic of 2,550 nucleotides. Comparing S. undulatus transcripts with the benchmarking universal single-copy orthologs (BUSCO) for tetrapod species yielded 97.2% gene representation. A total of 13,422 protein-coding orthologs were identified in comparison to the genome of the green anole lizard, Anolis carolinensis, which is the closest related species with genomic data available.ConclusionsThe multi-tissue transcriptome of S. undulatus is the first for a member of the family Phrynosomatidae, offering an important resource to advance studies of adaptation in this species and genomic research in reptiles.


Author(s):  
Joan Pere Pascual-Díaz ◽  
Sònia Garcia ◽  
Daniel Vitales

Plastid genomes are in general highly conserved given their slow evolutionary rate, thus large changes in their structure are unusual. However, when specific rearrangements are present, they are often phylogenetically informative. Asteraceae is a highly diverse family whose evolution is long driven by polyploidy (up to 48x) and hybridisation, both processes usually complicating systematic inferences. In this study, we have generated one of the most comprehensive plastome-based phylogenies of family Asteraceae, providing information about the structure, genetic diversity, and repeat composition of these sequences. By comparing the whole plastome sequences obtained, we confirmed the double inversion located in the long single copy region, for most of the species analysed (with the exception of basal tribes), a well-known feature for Asteraceae plastomes. We also show that genome size, gene order and gene content are highly conserved along the family. However, species representative of the basal subfamily Barnadesioideae -as well as in the sister family Calyceraceae - are lacking the pseudogene rps19 located in one inverted repeat. The phylogenomic analysis conducted here, based on 63 protein-coding genes, 30 transfer RNA genes and 21 ribosomal RNA genes from 36 species of Asteraceae, are overall consistent with the general consensus for the family’s phylogeny, while resolving the position of tribe Senecioneae and revealing some incongruences at tribe level between reconstructions based on nuclear and plastid DNA data.


2021 ◽  
Vol 104 (4) ◽  
pp. 003685042110599
Author(s):  
Dhafer Alzahrani ◽  
Enas Albokhari ◽  
Abidina Abba ◽  
Samaila Yaradua

Caylusea hexagyna and Ochradenus baccatus are two species in the Resedaceae family. In this study, we analysed the complete plastid genomes of these two species using high-throughput sequencing technology and compared their genomic data. The length of the plastid genome of C. hexagyna was 154,390 bp while that of O. baccatus was 153,380 bp. The lengths of the inverted repeats (IR) regions were 26,526 bp and 26,558 bp, those of the large single copy (LSC) regions were 83,870 bp and 83,023 bp; and those of the small single copy (SSC) regions were 17,468 bp and 17,241 bp in C. hexagyna and O. baccatus, respectively. Both genomes consisted of 113 genes: 79 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Repeat analysis showed that the plastid genome included all types of repeats, with more frequent occurrences of palindromic sequences. Comparative studies of SSR markers showed that there were 256 markers in C. hexagyna and 255 in O. baccatus; the majority of the SSRs in these plastid genomes were mononucleotide repeats (A/T). All the clusters in the phylogenetic tree had high support. This study reported the first complete plastid genomes of the genera Caylusea and Ochradenus and the first for the Resedaceae family.


Sign in / Sign up

Export Citation Format

Share Document