scholarly journals Genomic analysis of Pseudomonas putida: genes in a genome island are crucial for nicotine degradation

2012 ◽  
Vol 2 (1) ◽  
Author(s):  
Hongzhi Tang ◽  
Yuxiang Yao ◽  
Lijuan Wang ◽  
Hao Yu ◽  
Yiling Ren ◽  
...  
2021 ◽  
Vol 9 (2) ◽  
pp. 348
Author(s):  
Florian Tagini ◽  
Trestan Pillonel ◽  
Claire Bertelli ◽  
Katia Jaton ◽  
Gilbert Greub

The Mycobacterium kansasii species comprises six subtypes that were recently classified into six closely related species; Mycobacterium kansasii (formerly M. kansasii subtype 1), Mycobacterium persicum (subtype 2), Mycobacterium pseudokansasii (subtype 3), Mycobacterium ostraviense (subtype 4), Mycobacterium innocens (subtype 5) and Mycobacterium attenuatum (subtype 6). Together with Mycobacterium gastri, they form the M. kansasii complex. M. kansasii is the most frequent and most pathogenic species of the complex. M. persicum is classically associated with diseases in immunosuppressed patients, and the other species are mostly colonizers, and are only very rarely reported in ill patients. Comparative genomics was used to assess the genetic determinants leading to the pathogenicity of members of the M. kansasii complex. The genomes of 51 isolates collected from patients with and without disease were sequenced and compared with 24 publicly available genomes. The pathogenicity of each isolate was determined based on the clinical records or public metadata. A comparative genomic analysis showed that all M. persicum, M. ostraviense, M innocens and M. gastri isolates lacked the ESX-1-associated EspACD locus that is thought to play a crucial role in the pathogenicity of M. tuberculosis and other non-tuberculous mycobacteria. Furthermore, M. kansasii was the only species exhibiting a 25-Kb-large genomic island encoding for 17 type-VII secretion system-associated proteins. Finally, a genome-wide association analysis revealed that two consecutive genes encoding a hemerythrin-like protein and a nitroreductase-like protein were significantly associated with pathogenicity. These two genes may be involved in the resistance to reactive oxygen and nitrogen species, a required mechanism for the intracellular survival of bacteria. Three non-pathogenic M. kansasii lacked these genes likely due to two distinct distributive conjugal transfers (DCTs) between M. attenuatum and M. kansasii, and one DCT between M. persicum and M. kansasii. To our knowledge, this is the first study linking DCT to reduced pathogenicity.


2021 ◽  
Author(s):  
Xinxin Yi ◽  
Jing Liu ◽  
Shengcai Chen ◽  
Hao Wu ◽  
Min Liu ◽  
...  

Cultivated soybean (Glycine max) is an important source for protein and oil. Many elite cultivars with different traits have been developed for different conditions. Each soybean strain has its own genetic diversity, and the availability of more high-quality soybean genomes can enhance comparative genomic analysis for identifying genetic underpinnings for its unique traits. In this study, we constructed a high-quality de novo assembly of an elite soybean cultivar Jidou 17 (JD17) with chromsome contiguity and high accuracy. We annotated 52,840 gene models and reconstructed 74,054 high-quality full-length transcripts. We performed a genome-wide comparative analysis based on the reference genome of JD17 with three published soybeans (WM82, ZH13 and W05) , which identified five large inversions and two large translocations specific to JD17, 20,984 - 46,912 PAVs spanning 13.1 - 46.9 Mb in size, and 5 - 53 large PAV clusters larger than 500kb. 1,695,741 - 3,664,629 SNPs and 446,689 - 800,489 Indels were identified and annotated between JD17 and them. Symbiotic nitrogen fixation (SNF) genes were identified and the effects from these variants were further evaluated. It was found that the coding sequences of 9 nitrogen fixation-related genes were greatly affected. The high-quality genome assembly of JD17 can serve as a valuable reference for soybean functional genomics research.


2011 ◽  
Vol 56 (2) ◽  
pp. 1001-1009 ◽  
Author(s):  
Matilde Fernández ◽  
Susana Conde ◽  
Jesús de la Torre ◽  
Carlos Molina-Santiago ◽  
Juan-Luis Ramos ◽  
...  

ABSTRACTPseudomonas putidaKT2440 is a chloramphenicol-resistant bacterium that is able to grow in the presence of this antibiotic at a concentration of up to 25 μg/ml. Transcriptomic analyses revealed that the expression profile of 102 genes changed in response to this concentration of chloramphenicol in the culture medium. The genes that showed altered expression include those involved in general metabolism, cellular stress response, gene regulation, efflux pump transporters, and protein biosynthesis. Analysis of a genome-wide collection of mutants showed that survival of a knockout mutant in the TtgABC resistance-nodulation-division (RND) efflux pump and mutants in the biosynthesis of pyrroloquinoline (PQQ) were compromised in the presence of chloramphenicol. The analysis also revealed that an ABC extrusion system (PP2669/PP2668/PP2667) and the AgmR regulator (PP2665) were needed for full resistance toward chloramphenicol. Transcriptional arrays revealed that AgmR controls the expression of thepqqgenes and the operon encoding the ABC extrusion pump from the promoter upstream of open reading frame (ORF) PP2669.


2019 ◽  
Author(s):  
Dhaivat Joshi ◽  
Shunfu Mao ◽  
Sreeram Kannan ◽  
Suhas Diggavi

AbstractMotivationEfficient and accurate alignment of DNA / RNA sequence reads to each other or to a reference genome / transcriptome is an important problem in genomic analysis. Nanopore sequencing has emerged as a major sequencing technology and many long-read aligners have been designed for aligning nanopore reads. However, the high error rate makes accurate and efficient alignment difficult. Utilizing the noise and error characteristics inherent in the sequencing process properly can play a vital role in constructing a robust aligner. In this paper, we design QAlign, a pre-processor that can be used with any long-read aligner for aligning long reads to a genome / transcriptome or to other long reads. The key idea in QAlign is to convert the nucleotide reads into discretized current levels that capture the error modes of the nanopore sequencer before running it through a sequence aligner.ResultsWe show that QAlign is able to improve alignment rates from around 80% up to 90% with nanopore reads when aligning to the genome. We also show that QAlign improves the average overlap quality by 9.2%, 2.5% and 10.8% in three real datasets for read-to-read alignment. Read-to-transcriptome alignment rates are improved from 51.6% to 75.4% and 82.6% to 90% in two real datasets.Availabilityhttps://github.com/joshidhaivat/QAlign.git


2020 ◽  
Vol 63 (2) ◽  
pp. 483-491
Author(s):  
Haobin Hou ◽  
Xiaoliang Wang ◽  
Caiyun Zhang ◽  
Yingying Tu ◽  
Wenwei Lv ◽  
...  

Abstract. Facial pigmentation is an important economic trait of chickens, especially for laying hens, which will affect the carcass appearance of eliminated layers. Therefore, identifying the genomic regions and exploring the function of this region that contributes to understanding the variation of skin color traits is significant for breeding. In the study, 291 pure-line Xinyang blue-shelled laying hens were selected, of which 75 were dark-faced chickens and 216 were white-faced chickens. The population was sequenced and typed by GBS genotyping technology. The obtained high-quality SNPs and pigmentation phenotypes were analyzed by a genome-wide association study (GWAS) and a FST scan. Based on the two analytical methods, we identified a same genomic region (10.70–11.60 Mb) on chromosome 20 with 68 significant SNPs (−log 10(P)>6), mapped to 10 known genes, including NPEPL1, EDN3, GNAS, C20orf85, VAPB, BMP7, TUBB1, ELMO2, DDX27, and NCOA5, which are associated with dermal hyperpigmentation.


2015 ◽  
Author(s):  
Liya Wang ◽  
Peter Van Buren ◽  
Doreen Ware

Over the past few years, cloud-based platforms have been proposed to address storage, management, and computation of large-scale data, especially in the field of genomics. However, for collaboration efforts involving multiple institutes, data transfer and management, interoperability and standardization among different platforms have imposed new challenges. This paper proposes a distributed bioinformatics platform that can leverage local clusters with remote computational clusters for genomic analysis using the unified bioinformatics workflow. The platform is built with a data server configured with iRODS, a computation cluster authenticated with iPlant Agave system, and web server to interact with the platform. A Genome-Wide Association Study workflow is integrated to validate the feasibility of the proposed approach.


2021 ◽  
Author(s):  
Guangyang Wang ◽  
Shenghui Li ◽  
Qiulong Yan ◽  
Ruochun Guo ◽  
Yue Zhang ◽  
...  

Abstract Background: Viruses in the human gut have been linked to health and disease. Deciphering of the gut virome is dependent on metagenomic sequencing of the virus-like particles purified from the fecal specimens. A major limitation of conventional viral metagenomic sequencing is the low recoverability of viral genomes from the metagenomic dataset. Results: Herein, we developed an optimal method for viral amplification and metagenomic sequencing to maximize the recovery of viral genomes. Using 5 fecal specimens with multiple repetitions, we revealed the optimal number of PCR cycles of high-fidelity enzyme-based amplification and the reliability of multiple displacement amplification in virome DNA preparation, verified the reproducibility of the optimally whole viral metagenomic experimental process, and tested the capability of long-read sequencing for improving viral metagenomic assembly. Based on our optimized results, we generated 151 high-quality viruses using the data combined from short-read (15 cycles for PCR amplification) and long-read sequencing. Genomic analysis of these viruses found that most (60.3%) of them were previously unknown and showed a remarkable diversity of viral functions, especially the existence of 206 viral auxiliary metabolic genes. Finally, we compared the viral metagenomic and bulk metagenomic sequencing approaches and revealed significant differences in the efficiency and coverage of viral identification between them. Conclusions: Our study demonstrates the potential of optimized experiment and sequencing strategies in uncovering viral genomes from fecal specimens, which will facilitate future research about genome-level characterization of complex viral communities.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Dayana E. Salas-Leiva ◽  
Eelco C. Tromer ◽  
Bruce A. Curtis ◽  
Jon Jerlström-Hultqvist ◽  
Martin Kolisko ◽  
...  

AbstractCells replicate and segregate their DNA with precision. Previous studies showed that these regulated cell-cycle processes were present in the last eukaryotic common ancestor and that their core molecular parts are conserved across eukaryotes. However, some metamonad parasites have secondarily lost components of the DNA processing and segregation apparatuses. To clarify the evolutionary history of these systems in these unusual eukaryotes, we generated a genome assembly for the free-living metamonad Carpediemonas membranifera and carried out a comparative genomics analysis. Here, we show that parasitic and free-living metamonads harbor an incomplete set of proteins for processing and segregating DNA. Unexpectedly, Carpediemonas species are further streamlined, lacking the origin recognition complex, Cdc6 and most structural kinetochore subunits. Carpediemonas species are thus the first known eukaryotes that appear to lack this suite of conserved complexes, suggesting that they likely rely on yet-to-be-discovered or alternative mechanisms to carry out these fundamental processes.


2021 ◽  
Author(s):  
Xiaoming Song ◽  
Yanping Wei ◽  
Dong Xiao ◽  
Ke Gong ◽  
Pengchuan Sun ◽  
...  

Abstract Ethiopian mustard (Brassica carinata) in the Brassicaceae family possesses many excellent agronomic traits. Here, the high-quality genome sequence of B. carinata is reported. Characterization revealed a genome anchored to 17 chromosomes with a total length of 1.087 Gb and an N50 scaffold length of 60 Mb. Repetitive sequences account for approximately 634 Mb or 58.34% of the B. carinata genome. Notably, 51.91% of 97,149 genes are confined to the terminal 20% of chromosomes as a result of the expansion of repeats in pericentromeric regions. Brassica carinata shares one whole-genome triplication event with the five other species in U’s triangle, a classic model of evolution and polyploidy in Brassica. Brassica carinata was deduced to have formed ∼0.047 Mya, which is slightly earlier than B. napus but later than B. juncea. Our analysis indicated that the relationship between the two subgenomes (BcaB and BcaC) is greater than that between other two tetraploid subgenomes (BjuB and BnaC) and their respective diploid parents. RNA-seq datasets and comparative genomic analysis were used to identify several key genes in pathways regulating disease resistance and glucosinolate metabolism. Further analyses revealed that genome triplication and tandem duplication played important roles in the expansion of those genes in Brassica species. With the genome sequencing of B. carinata completed, the genomes of all six Brassica species in U’s triangle are now resolved. The data obtained from genome sequencing, transcriptome analysis, and comparative genomic efforts in this study provide valuable insights into the genome evolution of the six Brassica species in U’s triangle.


Sign in / Sign up

Export Citation Format

Share Document