scholarly journals The complete chloroplast genome sequence of strawberry (Fragaria × ananassaDuch.) and comparison with related species of Rosaceae

PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3919 ◽  
Author(s):  
Hui Cheng ◽  
Jinfeng Li ◽  
Hong Zhang ◽  
Binhua Cai ◽  
Zhihong Gao ◽  
...  

Compared with other members of the family Rosaceae, the chloroplast genomes ofFragariaspecies exhibit low variation, and this situation has limited phylogenetic analyses; thus, complete chloroplast genome sequencing ofFragariaspecies is needed. In this study, we sequenced the complete chloroplast genome ofF. × ananassa‘Benihoppe’ using the Illumina HiSeq 2500-PE150 platform and then performed a combination ofde novoassembly and reference-guided mapping of contigs to generate complete chloroplast genome sequences. The chloroplast genome exhibits a typical quadripartite structure with a pair of inverted repeats (IRs, 25,936 bp) separated by large (LSC, 85,531 bp) and small (SSC, 18,146 bp) single-copy (SC) regions. The length of theF. × ananassa‘Benihoppe’ chloroplast genome is 155,549 bp, representing the smallestFragariachloroplast genome observed to date. The genome encodes 112 unique genes, comprising 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Comparative analysis of the overall nucleotide sequence identity among ten complete chloroplast genomes confirmed that for both coding and non-coding regions in Rosaceae, SC regions exhibit higher sequence variation than IRs. The Ka/Ks ratio of most genes was less than 1, suggesting that most genes are under purifying selection. Moreover, the mVISTA results also showed a high degree of conservation in genome structure, gene order and gene content inFragaria, particularly among three octoploid strawberries which wereF. × ananassa‘Benihoppe’,F.chiloensis(GP33) andF.virginiana(O477). However, when the sequences of the coding and non-coding regions ofF. × ananassa‘Benihoppe’ were compared in detail with those ofF.chiloensis(GP33) andF.virginiana(O477), a number of SNPs and InDels were revealed by MEGA 7. Six non-coding regions (trnK-matK,trnS-trnG,atpF-atpH,trnC-petN,trnT-psbDandtrnP-psaJ) with a percentage of variable sites greater than 1% and no less than five parsimony-informative sites were identified and may be useful for phylogenetic analysis of the genusFragaria.

Plants ◽  
2020 ◽  
Vol 9 (6) ◽  
pp. 737 ◽  
Author(s):  
Abdullah ◽  
Claudia L. Henriquez ◽  
Furrukh Mehmood ◽  
Iram Shahzadi ◽  
Zain Ali ◽  
...  

The chloroplast genome provides insight into the evolution of plant species. We de novo assembled and annotated chloroplast genomes of four genera representing three subfamilies of Araceae: Lasia spinosa (Lasioideae), Stylochaeton bogneri, Zamioculcas zamiifolia (Zamioculcadoideae), and Orontium aquaticum (Orontioideae), and performed comparative genomics using these chloroplast genomes. The sizes of the chloroplast genomes ranged from 163,770 bp to 169,982 bp. These genomes comprise 113 unique genes, including 79 protein-coding, 4 rRNA, and 30 tRNA genes. Among these genes, 17–18 genes are duplicated in the inverted repeat (IR) regions, comprising 6–7 protein-coding (including trans-splicing gene rps12), 4 rRNA, and 7 tRNA genes. The total number of genes ranged between 130 and 131. The infA gene was found to be a pseudogene in all four genomes reported here. These genomes exhibited high similarities in codon usage, amino acid frequency, RNA editing sites, and microsatellites. The oligonucleotide repeats and junctions JSB (IRb/SSC) and JSA (SSC/IRa) were highly variable among the genomes. The patterns of IR contraction and expansion were shown to be homoplasious, and therefore unsuitable for phylogenetic analyses. Signatures of positive selection were seen in three genes in S. bogneri, including ycf2, clpP, and rpl36. This study is a valuable addition to the evolutionary history of chloroplast genome structure in Araceae.


2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
Junjun Yao ◽  
Fangyu Zhao ◽  
Yuanjiang Xu ◽  
Kaihui Zhao ◽  
Hong Quan ◽  
...  

Dracocephalum tanguticum and Dracocephalum moldavica are important herbs from Lamiaceae and have great medicinal value. We used the Illumina sequencing technology to sequence the complete chloroplast genome of D. tanguticum and D. moldavica and then conducted de novo assembly. The two chloroplast genomes have a typical quadripartite structure, with the gene’s lengths of 82,221 bp and 81,450 bp, large single-copy region’s (LSC) lengths of 82,221 bp and 81,450 bp, and small single-copy region’s (SSC) lengths of 17,363 bp and 17,066 bp, inverted repeat region’s (IR) lengths of 51,370 bp and 51,352 bp, respectively. The GC content of the two chloroplast genomes was 37.80% and 37.83%, respectively. The chloroplast genomes of the two plants encode 133 and 132 genes, respectively, among which there are 88 and 87 protein-coding genes, respectively, as well as 37 tRNA genes and 8 rRNA genes. Among them, the rps2 gene is unique to D. tanguticum, which is not found in D. moldavica. Through SSR analysis, we also found 6 mutation hotspot regions, which can be used as molecular markers for taxonomic studies. Phylogenetic analysis showed that Dracocephalum was more closely related to Mentha.


2020 ◽  
Author(s):  
Abdullah ◽  
Claudia L. Henriquez ◽  
Furrukh Mehmood ◽  
Iram Shahzadi ◽  
Zain Ali ◽  
...  

AbstractThe chloroplast genome provides insight into the evolution of plant species. We de novo assembled and annotated chloroplast genomes of the first representatives of four genera representing three subfamilies: Lasia spinosa (Lasioideae), Stylochaeton bogneri, Zamioculcas zamiifolia (Zamioculcadoideae), and Orontium aquaticum (Orontioideae), and performed comparative genomics using the plastomes. The size of the chloroplast genomes ranged from 163,770–169,982 bp. These genomes comprise 114 unique genes, including 80 protein-coding, 4 rRNA, and 30 tRNA genes. These genomes exhibited high similarities in codon usage, amino acid frequency, RNA editing sites, and microsatellites. The junctions JSB (IRb/SSC) and JSA (SSC/IRa) are highly variable, as is oligonucleotide repeats content among the genomes. The patterns of inverted repeats contraction and expansion were shown to be homoplasious and therefore unsuitable for phylogenetic analyses. Signatures of positive selection were shown for several genes in S. bogneri. This study is a valuable addition to the evolutionary history of chloroplast genome structure in Araceae.


Plants ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 270
Author(s):  
Dhafer A. Alzahrani

Abutilon fruticosum is one of the endemic plants with high medicinal and economic value in Saudi Arabia and belongs to the family Malvaceae. However, the plastome sequence and phylogenetic position have not been reported until this study. In this research, the complete chloroplast genome of A. fruticosum was sequenced and assembled, and comparative and phylogenetic analyses within the Malvaceae family were conducted. The chloroplast genome (cp genome) has a circular and quadripartite structure with a total length of 160,357 bp and contains 114 unique genes (80 protein-coding genes, 30 tRNA genes and 4 rRNA genes). The repeat analyses indicate that all the types of repeats (palindromic, complement, forward and reverse) were present in the genome, with palindromic occurring more frequently. A total number of 212 microsatellites were identified in the plastome, of which the majority are mononucleotides. Comparative analyses with other species of Malvaceae indicate a high level of resemblance in gene content and structural organization and a significant level of variation in the position of genes in single copy and inverted repeat borders. The analyses also reveal variable hotspots in the genomes that can serve as barcodes and tools for inferring phylogenetic relationships in the family: the regions include trnH-psbA, trnK-rps16, psbI-trnS, atpH-atpI, trnT-trnL, matK, ycf1 and ndhH. Phylogenetic analysis indicates that A. fruticosum is closely related to Althaea officinalis, which disagrees with the previous systematic position of the species. This study provides insights into the systematic position of A. fruticosum and valuable resources for further phylogenetic and evolutionary studies of the species and the Malvaceae family to resolve ambiguous issues within the taxa.


Plants ◽  
2020 ◽  
Vol 9 (1) ◽  
pp. 61 ◽  
Author(s):  
Huyen-Trang Vu ◽  
Ngan Tran ◽  
Thanh-Diem Nguyen ◽  
Quoc-Luan Vu ◽  
My-Huyen Bui ◽  
...  

Paphiopedilum delenatii is a native orchid of Vietnam with highly attractive floral traits. Unfortunately, it is now listed as a critically endangered species with a few hundred individuals remaining in nature. In this study, we performed next-generation sequencing of P. delenatii and assembled its complete chloroplast genome. The whole chloroplast genome of P. delenatii was 160,955 bp in size, 35.6% of which was GC content, and exhibited typical quadripartite structure of plastid genomes with four distinct regions, including the large and small single-copy regions and a pair of inverted repeat regions. There were, in total, 130 genes annotated in the genome: 77 coding genes, 39 tRNA genes, 8 rRNA genes, and 6 pseudogenes. The loss of ndh genes and variation in inverted repeat (IR) boundaries as well as data of simple sequence repeats (SSRs) and divergent hotspots provided useful information for identification applications and phylogenetic studies of Paphiopedilum species. Whole chloroplast genomes could be used as an effective super barcode for species identification or for developing other identification markers, which subsequently serves the conservation of Paphiopedilum species.


2020 ◽  
Author(s):  
Aziz Ebrahimi ◽  
Jennifer D. Antonides ◽  
Cornelia C. Pinchot ◽  
James M. Slavicek ◽  
Charles E. Flower ◽  
...  

ABSTRACTAmerican elm, Ulmus americana L., was cultivated widely in USA and Canada as a landscape tree, but the genome of this important species is poorly characterized. For the first time, we describe the sequencing and assembly of the chloroplast genomes of two American elm genotypes (RV16 and Am57845). The complete chloroplast genome of U. americana ranged from 158,935-158,993 bp. The genome contains 127 genes, including 85 protein-coding genes, 34 tRNA genes and 8 rRNA genes. Between the two American elm chloroplasts we sequenced, we identified 240 sequence variants (SNPs and indels). To evaluate the phylogeny of American elm, we compared the chloroplast genomes of two American elms along with seven Asian elm species and twelve other chloroplast genomes available through the NCBI database. As expected, Ulmus was closely related to Morus and Cannabis, as all three genera are assigned to the Urticales. Comparison of American elm with Asian elms revealed that trnH was absent from the chloroplast of American elm but not most Asian elms; conversely, petB, petD, psbL, trnK, and rps16 are present in the American elm but absent from all Asian elms. The complete chloroplast genome of U. americana will provide useful genetic resources for characterizing the genetic diversity of U. americana and potentially help to conserve natural populations of American elm.


Forests ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 608
Author(s):  
Sang-Chul Kim ◽  
Jei-Wan Lee ◽  
Byoung-Ki Choi

In the present study, chloroplast genome sequences of four species of Symplocos (S. chinensis for. pilosa, S. prunifolia, S. coreana, and S. tanakana) from South Korea were obtained by Ion Torrent sequencing and compared with the sequences of three previously reported Symplocos chloroplast genomes from different species. The length of the Symplocos chloroplast genome ranged from 156,961 to 157,365 bp. Overall, 132 genes including 87 functional genes, 37 tRNA genes, and eight rRNA genes were identified in all Symplocos chloroplast genomes. The gene order and contents were highly similar across the seven species. The coding regions were more conserved than the non-coding regions, and the large single-copy and small single-copy regions were less conserved than the inverted repeat regions. We identified five new hotspot regions (rbcL, ycf4, psaJ, rpl22, and ycf1) that can be used as barcodes or species-specific Symplocos molecular markers. These four novel chloroplast genomes provide basic information on the plastid genome of Symplocos and enable better taxonomic characterization of this genus.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9132
Author(s):  
Shuilian He ◽  
Yang Yang ◽  
Ziwei Li ◽  
Xuejiao Wang ◽  
Yanbing Guo ◽  
...  

The horticulturally important genus Zantedeschia (Araceae) comprises eight species of herbaceous perennials. We sequenced, assembled and analyzed the chloroplast (cp) genomes of four species of Zantedeschia (Z. aethiopica, Z. odorata, Z. elliottiana, and Z. rehmannii) to investigate the structure of the cp genome in the genus. According to our results, the cp genome of Zantedeschia ranges in size from 169,065 bp (Z. aethiopica) to 175,906 bp (Z. elliottiana). We identified a total of 112 unique genes, including 78 protein-coding genes, 30 transfer RNA (tRNA) genes and four ribosomal RNA (rRNA) genes. Comparison of our results with cp genomes from other species in the Araceae suggests that the relatively large sizes of the Zantedeschia cp genomes may result from inverted repeats (IR) region expansion. The sampled Zantedeschia species formed a monophylogenetic clade in our phylogenetic analysis. Furthermore, the long single copy (LSC) and short single copy (SSC) regions in Zantedeschia are more divergent than the IR regions in the same genus, and non-coding regions showed generally higher divergence than coding regions. We identified a total of 410 cpSSR sites from the four Zantedeschia species studied. Genetic diversity analyses based on four polymorphic SSR markers from 134 cultivars of Zantedeschia suggested that high genetic diversity (I = 0.934; Ne = 2.371) is present in the Zantedeschia cultivars. High genetic polymorphism from the cpSSR region suggests that cpSSR could be an effective tool for genetic diversity assessment and identification of Zantedeschia varieties.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9448
Author(s):  
Swati Tyagi ◽  
Jae-A Jung ◽  
Jung Sun Kim ◽  
So Youn Won

Background Chrysanthemum boreale Makino (Anthemideae, Asteraceae) is a plant of economic, ornamental and medicinal importance. We characterized and compared the chloroplast genomes of three C. boreale strains. These were collected from different geographic regions of Korea and varied in floral morphology. Methods The chloroplast genomes were obtained by next-generation sequencing techniques, assembled de novo, annotated, and compared with one another. Phylogenetic analysis placed them within the Anthemideae tribe. Results The sizes of the complete chloroplast genomes of the C. boreale strains were 151,012 bp (strain 121002), 151,098 bp (strain IT232531) and 151,010 bp (strain IT301358). Each genome contained 80 unique protein-coding genes, 4 rRNA genes and 29 tRNA genes. Comparative analyses revealed a high degree of conservation in the overall sequence, gene content, gene order and GC content among the strains. We identified 298 single nucleotide polymorphisms (SNPs) and 106 insertions/deletions (indels) in the chloroplast genomes. These variations were more abundant in non-coding regions than in coding regions. Long dispersed repeats and simple sequence repeats were present in both coding and noncoding regions, with greater frequency in the latter. Regardless of their location, these repeats can be used for molecular marker development. Phylogenetic analysis revealed the evolutionary relationship of the species in the Anthemideae tribe. The three complete chloroplast genomes will be valuable genetic resources for studying the population genetics and evolutionary relationships of Asteraceae species.


2019 ◽  
Vol 20 (22) ◽  
pp. 5812
Author(s):  
Liping Nie ◽  
Yingxian Cui ◽  
Liwei Wu ◽  
Jianguo Zhou ◽  
Zhichao Xu ◽  
...  

Macrosolen plants are parasitic shrubs, several of which are important medicinal plants, that are used as folk medicine in some provinces of China. However, reports on Macrosolen are limited. In this study, the complete chloroplast genome sequences of Macrosolen cochinchinensis, Macrosolen tricolor and Macrosolen bibracteolatus are reported. The chloroplast genomes were sequenced by Illumina HiSeq X. The length of the chloroplast genomes ranged from 129,570 bp (M. cochinchinensis) to 126,621 bp (M. tricolor), with a total of 113 genes, including 35 tRNA, eight rRNA, 68 protein-coding genes, and two pseudogenes (ycf1 and rpl2). The simple sequence repeats are mainly comprised of A/T mononucleotide repeats. Comparative genome analyses of the three species detected the most divergent regions in the non-coding spacers. Phylogenetic analyses using maximum parsimony and maximum likelihood strongly supported the idea that Loranthaceae and Viscaceae are monophyletic clades. The data obtained in this study are beneficial for further investigations of Macrosolen in respect to evolution and molecular identification.


Sign in / Sign up

Export Citation Format

Share Document