scholarly journals Comparative analysis of the complete plastid genomes of Mangifera species and gene transfer between plastid and mitochondrial genomes

PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e10774
Author(s):  
Yingfeng Niu ◽  
Chengwen Gao ◽  
Jin Liu

Mango is an important commercial fruit crop belonging to the genus Mangifera. In this study, we reported and compared four newly sequenced plastid genomes of the genus Mangifera, which showed high similarities in overall size (157,780–157,853 bp), genome structure, gene order, and gene content. Three mutation hotspots (trnG-psbZ, psbD-trnT, and ycf4-cemA) were identified as candidate DNA barcodes for Mangifera. These three DNA barcode candidate sequences have high species identification ability. We also identified 12 large fragments that were transferred from the plastid genome to the mitochondrial genome, and found that the similarity was more than 99%. The total size of the transferred fragment was 35,652 bp, accounting for 22.6% of the plastid genome. Fifteen intact chloroplast genes, four tRNAs and numerous partial genes and intergenic spacer regions were identified. There are many of these genes transferred from mitochondria to the chloroplast in other species genomes. Phylogenetic analysis based on whole plastid genome data provided a high support value, and the interspecies relationships within Mangifera were resolved well.

Database ◽  
2020 ◽  
Vol 2020 ◽  
Author(s):  
Tao Liu ◽  
Yutong Cui ◽  
Xuli Jia ◽  
Jing Zhang ◽  
Ruoran Li ◽  
...  

Abstract Algae are the oldest taxa on Earth, with an evolutionary relationship that spans prokaryotes (Cyanobacteria) and eukaryotes. A long evolutionary history has led to high algal diversity. Their organelle DNAs are characterized by uniparental inheritance and a compact genome structure compared with nuclear genomes; thus, they are efficient molecular tools for the analysis of gene structure, genome structure, organelle function and evolution. However, an integrated organelle genome database for algae, which could enable users to both examine and use relevant data, has not previously been developed. Therefore, to provide an organelle genome platform for algae, we have developed a user-friendly database named Organelle Genome Database for Algae (OGDA, http://ogda.ytu.edu.cn/). OGDA contains organelle genome data either retrieved from several public databases or sequenced in our laboratory (Laboratory of Genetics and Breeding of Marine Organism [MOGBL]), which are continuously updated. The first release of OGDA contains 1055 plastid genomes and 755 mitochondrial genomes. Additionally, a variety of applications have been integrated into this platform to analyze the structural characteristics, collinearity and phylogeny of organellar genomes for algae. This database represents a useful tool for users, enabling the rapid retrieval and analysis of information related to organellar genomes for biological discovery.


2015 ◽  
Author(s):  
Rob W Ness ◽  
Susanne A Kraemer ◽  
Nick Colegrave ◽  
Peter D Keightley

Plastids perform crucial cellular functions, including photosynthesis, across a wide variety of eukaryotes. Since endosymbiosis, plastids have maintained independent genomes that now display a wide diversity of gene content, genome structure, gene regulation mechanisms, and transmission modes. The evolution of plastid genomes depends on an input ofde novomutation, but our knowledge of mutation in the plastid is limited to indirect inference from patterns of DNA divergence between species. Here, we use a mutation accumulation experiment, where selection acting on mutations is rendered ineffective, combined with whole-plastid genome sequencing to directly characterize de novo mutation inChlamydomonas reinhardtii. We show that the mutation rates of the plastid and nuclear genomes are similar, but that the base spectra of mutations differ significantly. We integrate our measure of the mutation rate with a population genomic dataset of 20 individuals, and show that the plastid genome is subject to substantially stronger genetic drift than the nuclear genome. We also show that high levels of linkage disequilibrium in the plastid genome are not due to restricted recombination, but are instead a consequence of increased genetic drift. One likely explanation for increased drift in the plastid genome is that there are stronger effects of genetic hitchhiking. The presence of recombination in the plastid is consistent with laboratory studies inC. reinhardtiiand demonstrates that although the plastid genome is thought to be uniparentally inherited, it recombines in nature at a rate similar to the nuclear genome.


Plants ◽  
2020 ◽  
Vol 9 (8) ◽  
pp. 965 ◽  
Author(s):  
Xian-Lin Guo ◽  
Hong-Yi Zheng ◽  
Megan Price ◽  
Song-Dong Zhou ◽  
Xing-Jin He

Chamaesium H. Wolff (Apiaceae, Apioideae) is a small genus mainly distributed in the Hengduan Mountains and the Himalayas. Ten species of Chamaesium have been described and nine species are distributed in China. Recent advances in molecular phylogenetics have revolutionized our understanding of Chinese Chamaesium taxonomy and evolution. However, an accurate phylogenetic relationship in Chamaesium based on the second-generation sequencing technology remains poorly understood. Here, we newly assembled nine plastid genomes from the nine Chinese Chamaesium species and combined these genomes with eight other species from five genera to perform a phylogenic analysis by maximum likelihood (ML) using the complete plastid genome and analyzed genome structure, GC content, species pairwise Ka/Ks ratios and the simple sequence repeat (SSR) component. We found that the nine species’ plastid genomes ranged from 152,703 bp (C. thalictrifolium) to 155,712 bp (C. mallaeanum), and contained 133 genes, 34 SSR types and 585 SSR loci. We also found 20,953–21,115 codons from 53 coding sequence (CDS) regions, 38.4–38.7% GC content of the total genome and low Ka/Ks (0.27–0.43) ratios of 53 aligned CDS. These results will facilitate our further understanding of the evolution of the genus Chamaesium.


2006 ◽  
Vol 84 (9) ◽  
pp. 1434-1443 ◽  
Author(s):  
Gernot G. Presting

All oligonucleotides of the sugarcane chloroplast genome that are conserved in one or more of 36 other completed plastid genomes have been identified by computer-assisted sequence comparison. These regions are of interest because they (i) are indicative of strong selection pressures to maintain specific nucleotide sequences that may yield insights into plastid biology and (ii) can be used as priming sites for amplifying intervening sequences that represent potential DNA barcodes for species identification. The majority of conserved sites are located in the inverted repeat (IR) region, but several sites in the single copy region (predominantly in tRNA and psa/psb genes) are conserved among chloroplasts of all higher plants examined here. Of particular interest are protein coding regions that have been conserved at the nucleotide level, as these may be involved in transcript regulation. This analysis also provides the basis for rational design of a DNA barcode for plastids, and several potential barcode regions have been identified. In particular, two oligonucleotides of length 33 and 25, and separated by approximately 362 nucleotides, are found in all cyanobacteria, red, brown and green algae, as well as diatoms, euglenids, apicomplexans and land plants that have been examined to date. Their widespread occurrence makes the intervening sequence a universal marker for all photosynthetic lineages. Analysis of 160 GenBank accessions illustrates that this region discriminates many algae at the species level, but lacks sufficient variation among the more recently diverged land plants to serve as a single DNA barcode for this taxon. However, this marker should be particularly useful for the DNA barcoding of algal lineages and lichens, as well as for environmental sampling. More rapidly evolving regions of the plastid genome also identified here serve as a starting point to design and test barcodes for more narrowly defined lineages, including the more recently diverged angiosperms.


2021 ◽  
Author(s):  
Belen Escobari ◽  
Thomas Borsch ◽  
Taylor S. Quedensley ◽  
Michael Gruenstaeudl

ABSTRACTPREMISEThe genus Gynoxys and relatives form a species-rich lineage of Andean shrubs and trees with low genetic distances within the sunflower subtribe Tussilaginineae. Previous molecular phylogenetic investigations of the Tussilaginineae have included few, if any, representatives of this Gynoxoid group or reconstructed ambiguous patterns of relationships for it.METHODSWe sequenced complete plastid genomes of 21 species of the Gynoxoid group and related Tussilaginineae and conducted detailed comparisons of the phylogenetic relationships supported by the gene, intron, and intergenic spacer partitions of these genomes. We also evaluated the impact of manual, motif-based adjustments of automatic DNA sequence alignments on phylogenetic tree inference.RESULTSOur results indicate that the inclusion of all plastid genome partitions is needed to infer fully resolved phylogenetic trees of the Gynoxoid group. Whole plastome-based tree inference suggests that the genera Gynoxys and Nordenstamia are polyphyletic and form the core clade of the Gynoxoid group. This clade is sister to a clade of Aequatorium and Paragynoxys and also includes some but not all representatives of Paracalia.CONCLUSIONSThe concatenation and combined analysis of all plastid genome partitions and the construction of manually curated, motif-based DNA sequence alignments are found to be instrumental in the recovery of strongly supported relationships of the Gynoxoid group. We demonstrate that the correct assessment of homology in genome-level plastid sequence datasets is crucial for subsequent phylogeny reconstruction and that the manual post-processing of multiple sequence alignments improves the reliability of such reconstructions amid low genetic distances between taxa.


2020 ◽  
Author(s):  
Jingling Li ◽  
Jianmin Tang ◽  
Siyuan Zeng ◽  
Fang Han ◽  
Jing Yuan ◽  
...  

Abstract Background: Pilea is a genus of perennial herbs from the family Urticaceae, and some species are used as courtyard ornamentals or for medicinal purposes. At present, there is no information about the plastid genome of Pilea, which limits our understanding of this genus. Here, we report 4 plastid genomes of Pilea taxa (Pilea mollis, Pilea glauca 'Greizy', Pilea peperomioides and Pilea serpyllacea 'Globosa') and performed comprehensive comparative analysis.Results: The four plastid genomes all have a typical quartile structure. The lengths of the plastid genomes ranged from 150,398 bp to 152,327 bp, and each genome contained 113 unique genes, including 79 protein-coding genes, 4 rRNA genes, and 30 tRNA genes. Comparative analysis showed a rather high level of sequence divergence in the four genomes. Moreover, eight hypervariable regions were identified (petN-psbM, psbZ-trnG-GCC, trnT-UGU-trnL-UAA, accD-psbI, ndhF-rpl32, rpl32-trnL-UAG, ndhA-intron and ycf1), which are proposed for use as DNA barcode regions. Phylogenetic relationships based on the plastid genomes of 23 species of 14 genera of Urticaceae resulted in the placement of Pilea in the middle and lower part of the phylogenetic tree, with 100% bootstrap support within Urticaceae.Conclusion: Our results enrich the resources concerning plastid genomes. Comparative plastome analysis provides insight into the interspecific diversity of the plastid genome of Pilea. The identified hypervariable regions could be used for developing molecular markers applicable in various research areas.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Jingling Li ◽  
Jianmin Tang ◽  
Siyuan Zeng ◽  
Fang Han ◽  
Jing Yuan ◽  
...  

Abstract Background Pilea is a genus of perennial herbs from the family Urticaceae, and some species are used as courtyard ornamentals or for medicinal purposes. At present, there is no information about the plastid genome of Pilea, which limits our understanding of this genus. Here, we report 4 plastid genomes of Pilea taxa (Pilea mollis, Pilea glauca ‘Greizy’, Pilea peperomioides and Pilea serpyllacea ‘Globosa’) and performed comprehensive comparative analysis. Results The four plastid genomes all have a typical quartile structure. The lengths of the plastid genomes ranged from 150,398 bp to 152,327 bp, and each genome contained 113 unique genes, including 79 protein-coding genes, 4 rRNA genes, and 30 tRNA genes. Comparative analysis showed a rather high level of sequence divergence in the four genomes. Moreover, eight hypervariable regions were identified (petN-psbM, psbZ-trnG-GCC, trnT-UGU-trnL-UAA, accD-psbI, ndhF-rpl32, rpl32-trnL-UAG, ndhA-intron and ycf1), which are proposed for use as DNA barcode regions. Phylogenetic relationships based on the plastid genomes of 23 species of 14 genera of Urticaceae resulted in the placement of Pilea in the middle and lower part of the phylogenetic tree, with 100% bootstrap support within Urticaceae. Conclusion Our results enrich the resources concerning plastid genomes. Comparative plastome analysis provides insight into the interspecific diversity of the plastid genome of Pilea. The identified hypervariable regions could be used for developing molecular markers applicable in various research areas.


2019 ◽  
Author(s):  
Michael Gruenstaeudl ◽  
Nils Jenke

ABSTRACTBackgroundThe circular, quadripartite structure of plastid genomes which includes two inverted repeat regions renders the automatic assembly of plastid genomes challenging. The correct assembly of plastid genomes is a prerequisite for the validity of subsequent analyses on plastid genome structure and evolution. Plastome-based phylogenetic or population genetic investigations, for example, require the precise identification of DNA sequence and length to determine the location of nucleotide polymorphisms. The average coverage depth of a genome assembly is often used as an indicator for assembly quality. Visualizing coverage depth across a draft genome allows users to inspect the quality of the assembly and, where applicable, identify regions of reduced assembly confidence. Based on such visualizations, users can conduct a local re-assembly or other forms of targeted error correction. Few, if any, contemporary software tools can visualize the coverage depth of a plastid genome assembly while taking its quadripartite structure into account, despite the interplay between genome structure and assembly quality. A software tool is needed that visualizes the coverage depth of a plastid genome assembly on a circular, quadripartite map of the plastid genome.ResultsWe introduce ‘PACVr’, an R package that visualizes the coverage depth of a plastid genome assembly in relation to the circular, quadripartite structure of the genome as well as to the individual plastome genes. The tool allows visualizations on different scales using a variable window approach and also visualizes the equality of gene synteny in the inverted repeat regions of the plastid genome, thus providing an additional measure of assembly quality. As a tool for plastid genomics, PACVr provides the functionality to identify regions of coverage depth above or below user-defined threshold values and helps to identify non-identical IR regions. To allow easy integration into bioinformatic workflows, PACVr can be directly invoked from a Unix shell, thus facilitating its use in automated quality control. We illustrate the application of PACVr on two empirical datasets and compare the resulting visualizations with alternative software tools for displaying plastome sequencing coverage.ConclusionsPACVr provides a user-friendly tool to visualize (a) the coverage depth of a plastid genome assembly on a circular, quadripartite plastome map and in relation to individual plastome genes, and (b) the equality of gene synteny in the inverted repeat regions. It, thus, contributes to optimizing plastid genome assemblies and increasing the reliability of publicly available plastome sequences, especially in light of incongruence among the visualization results of alternative software tools. The software, example datasets, technical documentation, and a tutorial are available with the package at https://github.com/michaelgruenstaeudl/PACVr.


2021 ◽  
Vol 104 (4) ◽  
pp. 003685042110599
Author(s):  
Dhafer Alzahrani ◽  
Enas Albokhari ◽  
Abidina Abba ◽  
Samaila Yaradua

Caylusea hexagyna and Ochradenus baccatus are two species in the Resedaceae family. In this study, we analysed the complete plastid genomes of these two species using high-throughput sequencing technology and compared their genomic data. The length of the plastid genome of C. hexagyna was 154,390 bp while that of O. baccatus was 153,380 bp. The lengths of the inverted repeats (IR) regions were 26,526 bp and 26,558 bp, those of the large single copy (LSC) regions were 83,870 bp and 83,023 bp; and those of the small single copy (SSC) regions were 17,468 bp and 17,241 bp in C. hexagyna and O. baccatus, respectively. Both genomes consisted of 113 genes: 79 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Repeat analysis showed that the plastid genome included all types of repeats, with more frequent occurrences of palindromic sequences. Comparative studies of SSR markers showed that there were 256 markers in C. hexagyna and 255 in O. baccatus; the majority of the SSRs in these plastid genomes were mononucleotide repeats (A/T). All the clusters in the phylogenetic tree had high support. This study reported the first complete plastid genomes of the genera Caylusea and Ochradenus and the first for the Resedaceae family.


2020 ◽  
Vol 16 (4) ◽  
pp. 705-712
Author(s):  
Le Thi Thu Hien ◽  
Ha Hong Hanh

Cordyceps genus is a well-known traditional medicine worldwide. It contains abundant physiological active compounds that were demonstrated to perform benefit in reducing progression of cancer as well as protecting human health. Accurately classifying species in this genus is essential in order to prevent commercial counterfeit medicines. Nowadays, a taxonomic classification of species based on DNA sequences can overcome the existed limitation in identifying by using only morphological characteristics of this genus. DNA barcodes are standard short genomic regions that are universally present in target lineages and has sufficient sequence variation to discriminate species in the genus. A variety of loci has been suggested as DNA barcodes for plants, including genes and non-coding regions in the nuclear and plastid genomes such as psbA-trnH, matK, rbcL, and ITS. Thus, the objective of this study was to identify selected species of Cordyceps genus using DNA barcodes. Seven strains of Cordyceps were collected. Total DNA extraction and purification, PCR amplification and DNA sequencing were performed with standard chemicals and kits. The candidate ITS1-5.8S-ITS2 region was amplified and sequenced. Data were analyzed using Bioedit 7.2.6 and MEGA 7 softwares. Analysis of seven obtained DNA barcode sequences of collected samples revealed that the ITS1-5.8S-ITS2 region provided high species discriminating power for Cordyceps genus. Accordingly, phylogenetic trees based on this DNA barcode exhibited six samples had closed relationship to Cordyceps militaris, while another specimen was the nearest neighbor to Cordyceps sinensis with average similarities at 99.82% and 99.81%, respectively. Our results support the identification of valuable medicinal plant species within Cordyceps genus.


Sign in / Sign up

Export Citation Format

Share Document