scholarly journals Complex Analyses of Short Inverted Repeats in All Sequenced Chloroplast DNAs

2018 ◽  
Vol 2018 ◽  
pp. 1-10 ◽  
Author(s):  
Václav Brázda ◽  
Jiří Lýsek ◽  
Martin Bartas ◽  
Miroslav Fojta

Chloroplasts are key organelles in the management of oxygen in algae and plants and are therefore crucial for all living beings that consume oxygen. Chloroplasts typically contain a circular DNA molecule with nucleus-independent replication and heredity. Using “palindrome analyser” we performed complete analyses of short inverted repeats (S-IRs) in all chloroplast DNAs (cpDNAs) available from the NCBI genome database. Our results provide basic parameters of cpDNAs including comparative information on localization, frequency, and differences in S-IR presence. In a total of 2,565 cpDNA sequences available, the average frequency of S-IRs in cpDNA genomes is 45 S-IRs/per kbp, significantly higher than that found in mitochondrial DNA sequences. The frequency of S-IRs in cpDNAs generally decreased with S-IR length, but not for S-IRs 15, 22, 24, or 27 bp long, which are significantly more abundant than S-IRs with other lengths. These results point to the importance of specific S-IRs in cpDNA genomes. Moreover, comparison by Levenshtein distance of S-IR similarities showed that a limited number of S-IR sequences are shared in the majority of cpDNAs. S-IRs are not located randomly in cpDNAs, but are length-dependently enriched in specific locations, including the repeat region, stem, introns, and tRNA regions. The highest enrichment was found for 12 bp and longer S-IRs in the stem-loop region followed by 12 bp and longer S-IRs located before the repeat region. On the other hand, S-IRs are relatively rare in rRNA sequences and around introns. These data show nonrandom and conserved arrangements of S-IRs in chloroplast genomes.

Database ◽  
2019 ◽  
Vol 2019 ◽  
Author(s):  
Rui Zhang ◽  
Fangfang Ge ◽  
Huayang Li ◽  
Yudong Chen ◽  
Ying Zhao ◽  
...  

Abstract Inverted repeats (IRs) serve as potential biomarkers for genomic instability, DNA replication and other genetic processes. However, little information can be found in databases to help researchers recognize potential IR nucleotides, explore junction sites and annotate related functional genes. Plant Chloroplast Inverted Repeats (PCIR) is an interactive, web-based platform containing various sequenced chloroplast genomes that enables detection, searching and visualization of large-scale detailed information on IRs. PCIR contains many datasets, including 21 433 IRs, 113 plants chloroplast genomes, 16 948 functional genes and 21 659 visual maps. This database offers an online prediction tool for detecting IRs based on DNA sequences. PCIR can also analyze phylogenetic relationships using IR information among different species and provide users with high-quality marker maps. This database will be a valuable resource for IR distribution patterns, related genes and architectural features.


2020 ◽  
Vol 117 (38) ◽  
pp. 23982-23990 ◽  
Author(s):  
Shengjun Li ◽  
Mu Li ◽  
Kan Liu ◽  
Huimin Zhang ◽  
Shuxin Zhang ◽  
...  

MAC5 is a component of the conserved MOS4-associated complex. It plays critical roles in development and immunity. Here we report that MAC5 is required for microRNA (miRNA) biogenesis. MAC5 interacts with Serrate (SE), which is a core component of the microprocessor that processes primary miRNA transcripts (pri-miRNAs) into miRNAs and binds the stem-loop region of pri-miRNAs. MAC5 is essential for both the efficient processing and the stability of pri-miRNAs. Interestingly, the reduction of pri-miRNA levels inmac5is partially caused by XRN2/XRN3, the nuclear-localized 5′-to-3′ exoribonucleases, and depends on SE. These results reveal that MAC5 plays a dual role in promoting pri-miRNA processing and stability through its interaction with SE and/or pri-miRNAs. This study also uncovers that pri-miRNAs need to be protected from nuclear RNA decay machinery, which is connected to the microprocessor.


Genetics ◽  
2003 ◽  
Vol 163 (4) ◽  
pp. 1299-1313
Author(s):  
Zheng Xu ◽  
Britton Lance ◽  
Claudia Vargas ◽  
Budak Arpinar ◽  
Suchendra Bhandarkar ◽  
...  

Abstract A bioinformatics tool called ODS3 has been created for mapping by sequencing. The tool allows the creation of integrated genomic maps from genetic, physical mapping, and sequencing data and permits an integrated genome map to be stored, retrieved, viewed, and queried in a stand-alone capacity, in a client/server relationship with the Fungal Genome Database (FGDB), and as a web-browsing tool for the FGDB. In that ODS3 is programmed in Java, the tool promotes platform independence and supports export of integrated genome-mapping data in the extensible markup language (XML) for data interchange with other genome information systems. The tool ODS3 is used to create an initial integrated genome map of the AIDS-related fungal pathogen, Pneumocystis carinii. Contig dynamics would indicate that this physical map is ∼50% complete with ∼200 contigs. A total of 10 putative multigene families were found. Two of these putative families were previously characterized in P. carinii, namely the major surface glycoproteins (MSGs) and HSP70 proteins; three of these putative families (not previously characterized in P. carinii) were found to be similar to families encoding the HSP60 in Schizosaccharomyces pombe, the heat-shock Ψ protein in S. pombe, and the RNA synthetase family (i.e., MES1) in Saccharomyces cerevisiae. Physical mapping data are consistent with the 16S, 5.8S, and 26S rDNA genes being single copy in P. carinii. No other fungus outside this genus is known to have the rDNA genes in single copy.


1986 ◽  
Vol 6 (9) ◽  
pp. 3262-3267
Author(s):  
D D Chang ◽  
D A Clayton

Transcription of the heavy strand of mouse mitochondrial DNA starts from two closely spaced, distinct sites located in the displacement loop region of the genome. We report here an analysis of regulatory sequences required for faithful transcription from these two sites. Data obtained from in vitro assays demonstrated that a 51-base-pair region, encompassing nucleotides -40 to +11 of the downstream start site, contains sufficient information for accurate transcription from both start sites. Deletion of the 3' flanking sequences, including one or both start sites to -17, resulted in the initiation of transcription by the mitochondrial RNA polymerase from alternative sites within vector DNA sequences. This feature places the mouse heavy-strand promoter uniquely among other known mitochondrial promoters, all of which absolutely require cognate start sites for transcription. Comparison of the heavy-strand promoter with those of other vertebrate mitochondrial DNAs revealed a remarkably high rate of sequence divergence among species.


Genetics ◽  
1991 ◽  
Vol 129 (3) ◽  
pp. 873-884 ◽  
Author(s):  
Y Ogihara ◽  
T Terachi ◽  
T Sasakuma

Abstract The nucleotide divergence of chloroplast DNAs around the hot spot region related to length mutation in Triticum (wheat) and Aegilops was analyzed. DNA sequences (ca. 4.5 kbp) of three chloroplast genome types of wheat complex were compared with one another and with the corresponding region of other grasses. The sequences region contained rbcL and psaI, two open reading frames, and a pseudogene, rpl23' (pseudogene for ribosomal protein L23) disrupted by AT-rich intergic spacer regions. The evolution of these genes in the closely related wheat complex is characterized by nonbiased nucleotide substitutions in terms of being synonymous/nonsynonymous, having A-T pressure transitions over transversions, and frequent changes at the third codon position, in contrast with the gene evolution among more distant plant groups where biased nucleotide substitutions have frequently occurred. The sequences of these genes had diverged almost in proportion to taxonomic distance. The sequence of the pseudogene rpl23' changed approximately two times faster than that of the coding region. Sequence comparison between the pseudogene and its protein-coding counterpart revealed different degrees of nucleotide homology in wheat, rice and maize, suggesting that the transposition timing of the pseudogene differed and/or that different rates of gene conversion operated on the pseudogene in the cpDNA of the three plant groups in Gramineae. The intergenic spacer regions diverged approximately ten times faster than the genes. The divergence of wheat from barley, and that from rice are estimated based on the nucleotide similarity to be 1.5, 10 and 40 million years, respectively.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e6563
Author(s):  
Jianying Sun ◽  
Xiaofeng Dong ◽  
Qinghe Cao ◽  
Tao Xu ◽  
Mingku Zhu ◽  
...  

Background Ipomoea is the largest genus in the family Convolvulaceae. The species in this genus have been widely used in many fields, such as agriculture, nutrition, and medicine. With the development of next-generation sequencing, more than 50 chloroplast genomes of Ipomoea species have been sequenced. However, the repeats and divergence regions in Ipomoea have not been well investigated. In the present study, we sequenced and assembled eight chloroplast genomes from sweet potato’s close wild relatives. By combining these with 32 published chloroplast genomes, we conducted a detailed comparative analysis of a broad range of Ipomoea species. Methods Eight chloroplast genomes were assembled using short DNA sequences generated by next-generation sequencing technology. By combining these chloroplast genomes with 32 other published Ipomoea chloroplast genomes downloaded from GenBank and the Oxford Research Archive, we conducted a comparative analysis of the repeat sequences and divergence regions across the Ipomoea genus. In addition, separate analyses of the Batatas group and Quamoclit group were also performed. Results The eight newly sequenced chloroplast genomes ranged from 161,225 to 161,721 bp in length and displayed the typical circular quadripartite structure, consisting of a pair of inverted repeat (IR) regions (30,798–30,910 bp each) separated by a large single copy (LSC) region (87,575–88,004 bp) and a small single copy (SSC) region (12,018–12,051 bp). The average guanine-cytosine (GC) content was approximately 40.5% in the IR region, 36.1% in the LSC region, 32.2% in the SSC regions, and 37.5% in complete sequence for all the generated plastomes. The eight chloroplast genome sequences from this study included 80 protein-coding genes, four rRNAs (rrn23, rrn16, rrn5, and rrn4.5), and 37 tRNAs. The boundaries of single copy regions and IR regions were highly conserved in the eight chloroplast genomes. In Ipomoea, 57–89 pairs of repetitive sequences and 39–64 simple sequence repeats were found. By conducting a sliding window analysis, we found six relatively high variable regions (ndhA intron, ndhH-ndhF, ndhF-rpl32, rpl32-trnL, rps16-trnQ, and ndhF) in the Ipomoea genus, eight (trnG, rpl32-trnL, ndhA intron, ndhF-rpl32, ndhH-ndhF, ccsA-ndhD, trnG-trnR, and pasA-ycf3) in the Batatas group, and eight (ndhA intron, petN-psbM, rpl32-trnL, trnG-trnR, trnK-rps16, ndhC-trnV, rps16-trnQ, and trnG) in the Quamoclit group. Our maximum-likelihood tree based on whole chloroplast genomes confirmed the phylogenetic topology reported in previous studies. Conclusions The chloroplast genome sequence and structure were highly conserved in the eight newly-sequenced Ipomoea species. Our comparative analysis included a broad range of Ipomoea chloroplast genomes, providing valuable information for Ipomoea species identification and enhancing the understanding of Ipomoea genetic resources.


Blood ◽  
2011 ◽  
Vol 117 (22) ◽  
pp. 5931-5940 ◽  
Author(s):  
Simon Haefliger ◽  
Christiane Klebig ◽  
Kerstin Schaubitzer ◽  
Julian Schardt ◽  
Nikolai Timchenko ◽  
...  

Abstract Deregulation of the myeloid key transcription factor CEBPA is a common event in acute myeloid leukemia (AML). We previously reported that the chaperone calreticulin is activated in subgroups of AML patients and that calreticulin binds to the stem loop region of the CEBPA mRNA, thereby blocking CEBPA translation. In this study, we screened for additional CEBPA mRNA binding proteins and we identified protein disulfide isomerase (PDI), an endoplasmic reticulum (ER) resident protein, to bind to the CEBPA mRNA stem loop region. We found that forced PDI expression in myeloid leukemic cells in fact blocked CEBPA translation, but not transcription, whereas abolishing PDI function restored CEBPA protein. In addition, PDI protein displayed direct physical interaction with calreticulin. Induction of ER stress in leukemic HL60 and U937 cells activated PDI expression, thereby decreasing CEBPA protein levels. Finally, leukemic cells from 25.4% of all AML patients displayed activation of the unfolded protein response as a marker for ER stress, and these patients also expressed significantly higher PDI levels. Our results indicate a novel role of PDI as a member of the ER stress–associated complex mediating blocked CEBPA translation and thereby suppressing myeloid differentiation in AML patients with activated unfolded protein response (UPR).


Sign in / Sign up

Export Citation Format

Share Document