scholarly journals Trichoderma reesei complete genome sequence, repeat-induced point mutation and partitioning of CAZyme gene clusters

2017 ◽  
Author(s):  
Wan-Chen Li ◽  
Chien-Hao Huang ◽  
Chia-Ling Chen ◽  
Yu-Chien Chuang ◽  
Shu-Yun Tung ◽  
...  

AbstractTrichoderma reesei (Ascomycota, Pezizomycotina) QM6a is a model fungus for a broad spectrum of physiological phenomena, including plant cell wall degradation, industrial production of enzymes, light responses, conidiation, sexual development, polyketide biosynthesis and plant-fungal interactions. The genomes of QM6a and its high-enzyme producing mutants have been sequenced by second-generation-sequencing methods and are publicly available from the Joint Genome Institute (JGI). While these genome sequences have offered useful information for genomic and transcriptomic studies, their limitations and especially their short read lengths make them poorly suited for some particular biological problems, including assembly, genome-wide determination of chromosome architecture and genetic modification or engineering. We integrated Pacific Biosciences and Illumina sequencing platforms for the highest-quality genome assembly yet achieved, revealing seven telomere-to-telomere chromosomes (34,922,528 bp; 10877 genes) with 1630 newly-predicted genes and >1.5 Mb of new sequences. Most new sequences are located on AT-rich blocks, including 7 centromeres, 14 subtelomeres and 2329 interspersed AT-rich blocks. The seven QM6a centromeres separately consist of 24 conserved repeats and 37 putative centromere-encoded genes. These findings open up a new perspective for future centromere and chromosome architecture studies. Next, we demonstrate that sexual crossing readily induced cytosine-to-thymine point mutations on both tandem and unlinked duplicated sequences. We also show by bioinformatic analysis that Trichoderma reesei has evolved a robust repeat-induced point mutation (RIP) system to accumulate AT-rich sequences, with longer AT-rich blocks having more RIP mutations. The widespread distribution of AT-rich blocks correlates genome-wide partitions with gene clusters, explaining why clustering of genes has been reported to not influence gene expression in Trichoderma reesei. Compartmentation of ancestral gene clusters by AT-rich blocks might promote flexibilities that are evolutionarily advantageous in this fungus’ soil habitats and other natural environments. Our analyses, together with the complete genome sequence, provide a better blueprint for biotechnological and industrial applications.

2017 ◽  
Vol 5 (25) ◽  
Author(s):  
Yun Wang ◽  
Hu Liu ◽  
Kai Liu ◽  
Chengqiang Wang ◽  
Hailin Ma ◽  
...  

ABSTRACT Bacillus paralicheniformis MDJK30 was isolated from the rhizosphere of a peony. It could control the pathogen of peony root rot. Here, we report the complete genome sequence of B. paralicheniformis MDJK30. Eleven secondary metabolism gene clusters were predicted.


2019 ◽  
Vol 11 (12) ◽  
pp. 3529-3533
Author(s):  
Pavelas Sazinas ◽  
Morten Lindqvist Hansen ◽  
May Iren Aune ◽  
Marie Højmark Fischer ◽  
Lars Jelsbak

Abstract Many of the soil-dwelling Pseudomonas species are known to produce secondary metabolite compounds, which can have antagonistic activity against other microorganisms, including important plant pathogens. It is thus of importance to isolate new strains of Pseudomonas and discover novel or rare gene clusters encoding bioactive products. In an effort to accomplish this, we have isolated a bioactive Pseudomonas strain DTU12.1 from leaf-covered soil in Denmark. Following genome sequencing with Illumina and Oxford Nanopore technologies, we generated a complete genome sequence with the length of 5,943,629 base pairs. The DTU12.1 strain contained a complete gene cluster for a rare thioquinolobactin siderophore, which was previously described as possessing bioactivity against oomycetes and several fungal species. We placed the DTU12.1 strain within Pseudomonas gessardii subgroup of fluorescent pseudomonads, where it formed a distinct clade with other Pseudomonas strains, most of which also contained a complete thioquinolobactin gene cluster. Only two other Pseudomonas strains were found to contain the gene cluster, though they were present in a different phylogenetic clade and were missing a transcriptional regulator of the whole cluster. We show that having the complete genome sequence and establishing phylogenetic relationships with other strains can enable us to start evaluating the distribution and evolutionary origins of secondary metabolite clusters.


2016 ◽  
Vol 4 (2) ◽  
Author(s):  
Jun Kong ◽  
Hongshan Jiang ◽  
Baiyun Li ◽  
Wenjun Zhao ◽  
Zhihong Li ◽  
...  

Pseudomonas syringae pv. lapsa is a pathovar of Pseudomonas syringae that can infect wheat. The complete genome of P. syringae pv. lapsa strain ATCC 10859 contains a 5,918,899-bp circular chromosome with 4,973 coding sequences, 16 rRNAs, 69 tRNAs, and an average GC content of 59.13%. The analysis of this genome revealed several gene clusters that are related to pathogenesis and virulence.


2003 ◽  
Vol 185 (17) ◽  
pp. 5220-5233 ◽  
Author(s):  
Eric S. Miller ◽  
John F. Heidelberg ◽  
Jonathan A. Eisen ◽  
William C. Nelson ◽  
A. Scott Durkin ◽  
...  

ABSTRACT The complete genome sequence of the T4-like, broad-host-range vibriophage KVP40 has been determined. The genome sequence is 244,835 bp, with an overall G+C content of 42.6%. It encodes 386 putative protein-encoding open reading frames (CDSs), 30 tRNAs, 33 T4-like late promoters, and 57 potential rho-independent terminators. Overall, 92.1% of the KVP40 genome is coding, with an average CDS size of 587 bp. While 65% of the CDSs were unique to KVP40 and had no known function, the genome sequence and organization show specific regions of extensive conservation with phage T4. At least 99 KVP40 CDSs have homologs in the T4 genome (Blast alignments of 45 to 68% amino acid similarity). The shared CDSs represent 36% of all T4 CDSs but only 26% of those from KVP40. There is extensive representation of the DNA replication, recombination, and repair enzymes as well as the viral capsid and tail structural genes. KVP40 lacks several T4 enzymes involved in host DNA degradation, appears not to synthesize the modified cytosine (hydroxymethyl glucose) present in T-even phages, and lacks group I introns. KVP40 likely utilizes the T4-type sigma-55 late transcription apparatus, but features of early- or middle-mode transcription were not identified. There are 26 CDSs that have no viral homolog, and many did not necessarily originate from Vibrio spp., suggesting an even broader host range for KVP40. From these latter CDSs, an NAD salvage pathway was inferred that appears to be unique among bacteriophages. Features of the KVP40 genome that distinguish it from T4 are presented, as well as those, such as the replication and virion gene clusters, that are substantially conserved.


2017 ◽  
Vol 5 (21) ◽  
Author(s):  
Jianguo Xu ◽  
Min Xu ◽  
Kai Liu ◽  
Qinyin Peng ◽  
Meifeng Tao

ABSTRACT Streptomyces sp. Sge12 was isolated from forest soil and exhibited remarkable antimicrobial activities against selected fungi and Gram-positive bacteria. Here, we report the complete genome sequence of this strain, which contains 37 putative secondary metabolite gene clusters.


2017 ◽  
Vol 5 (11) ◽  
Author(s):  
Márcio Tadeu Godinho ◽  
Débora Pires Paula ◽  
Arvind Varsani ◽  
Simone Graça Ribeiro

ABSTRACT Here we report the first complete genome sequence of a cauliflower mosaic virus from Brazil, obtained from the gut content of the predator earwig (Doru luteipes). This virus has a genome of 8,030 nucleotides (nt) and shares 97% genome-wide identity with an isolate from Argentina.


2021 ◽  
Vol 10 (30) ◽  
Author(s):  
Tue K. Nielsen ◽  
Mengistu F. Mekureyaw ◽  
Lars H. Hansen ◽  
Mette H. Nicolaisen ◽  
Thomas G. Roitsch ◽  
...  

Here, we report the complete genome sequence of the cytokinin-producing plant growth-promoting strain Pseudomonas fluorescens G20-18. The complete genome assembly resulted in a single, circular chromosome of 6.48 Mbp and harbors several secondary metabolite biosynthesis gene clusters that are potentially involved in its plant growth-promoting function.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Pachi Pulusu Chanakya ◽  
Balaram Khamari ◽  
Manmath Lama ◽  
Arun Sai Kumar Peketi ◽  
Prakash Kumar ◽  
...  

Abstract Objective M. morganii is a gram-negative, non-lactose fermenting and an opportunistic pathogen frequently associated with nosocomial infections. Although first isolated in 1906 from a pediatric fecal sample, not many M. morganii isolates have been sequenced. The objective of this work is to determine the complete genome sequence of an XDR M. morganii strain (SMM01) isolated from the urine of a patient with urinary and fecal incontinence and to characterize its antimicrobial resistance profile. Data description Here, we report the complete genome sequence of M. morganii SMM01 generated from the hybrid assembly of Illumina HiSeq X and Nanopore MinION reads. The assembly is 100% complete with genome size of 39,30,130 bp and GC content of 51%. Genomic features include 3617 CDS, 18 rRNAs, 78 tRNAs, 4 ncRNAs and 60 pseudogenes. Antimicrobial resistance profile was characterized by the presence of genes conferring resistance to aminoglycosides, β-lactams, fluoroquinolones, chloramphenicol, and tetracyclines. Secondary metabolite biosynthetic gene clusters like NRPS, T1PKS, thiopeptide, beta-lactone, and bacteriocin were identified. The genome data described here would be the first complete genome of an Indian M. morganii isolate providing crucial information on antimicrobial resistance patterns, paving the way for further comparative genome analyses.


2018 ◽  
Vol 6 (14) ◽  
Author(s):  
M. Fatima C. Ilagan-Cruzada ◽  
Albert Remus R. Rosana ◽  
Andrew D. Montecillo ◽  
Noel G. Sabino ◽  
Ida F. Dalmacio

ABSTRACT Lactobacillus plantarum subsp. plantarum strain LB1-2, isolated from the hindgut of European honeybees in the Philippines, is active against Paenibacillus larvae and has broad activity against several Gram-positive and Gram-negative bacteria. The complete genome sequence reported herein contains gene clusters for multiple bacteriocins and extensive gene inventories for carbohydrate metabolism.


Sign in / Sign up

Export Citation Format

Share Document