scholarly journals A genome-based species taxonomy of the Lactobacillus Genus Complex

2019 ◽  
Author(s):  
Stijn Wittouck ◽  
Sander Wuyts ◽  
Conor J Meehan ◽  
Vera van Noort ◽  
Sarah Lebeer

AbstractBackgroundThere are over 200 published species within the Lactobacillus Genus Complex (LGC), the majority of which have sequenced type strain genomes available. Although gold standard, genome-based species delimitation cutoffs are accepted by the community, they are seldom checked against currently available genome data. In addition, there are many species-level misclassification issues within the LGC. We constructed a de novo species taxonomy for the LGC based on 2,459 publicly available, decent-quality genomes and using a 94% core nucleotide identity threshold. We reconciled these de novo species with published species and subspecies names by (i) identifying genomes of type strains in our dataset and (ii) performing comparisons based on 16S rRNA sequence identity against type strains.ResultsWe found that genomes within the LGC could be divided into 239 clusters (de novo species) that were discontinuous and exclusive. Comparison of these de novo species to published species lead to the identification of ten sets of published species that can be merged and one species that can be split. Further, we found at least eight genome clusters that constitute new species. Finally, we were able to accurately classify 98 unclassified genomes and reclassify 74 wrongly classified genomes.ConclusionsThe current state of LGC species taxonomy is largely consistent with genome data, but there are some inconsistencies as well as genome misclassifications. These inconsistencies should be resolved to evolve towards a meaningful taxonomy where species have a consistent size in terms of sequence divergence.

mSystems ◽  
2019 ◽  
Vol 4 (5) ◽  
Author(s):  
Stijn Wittouck ◽  
Sander Wuyts ◽  
Conor J. Meehan ◽  
Vera van Noort ◽  
Sarah Lebeer

ABSTRACT There are more than 200 published species within the Lactobacillus genus complex (LGC), the majority of which have sequenced type strain genomes available. Although genome-based species delimitation cutoffs are accepted as the gold standard by the community, these are seldom actually checked for new or already published species. In addition, the availability of genome data is revealing inconsistencies in the species-level classification of many strains. We constructed a de novo species taxonomy for the LGC based on 2,459 publicly available genomes, using a 94% core nucleotide identity cutoff. We reconciled these de novo species with published species and subspecies names by (i) identifying genomes of type strains and (ii) comparing 16S rRNA genes of the genomes with 16S rRNA genes of type strains. We found that genomes within the LGC could be divided into 239 de novo species that were discontinuous and exclusive. Comparison of these de novo species to published species led to the identification of nine sets of published species that can be merged and one species that can be split. Further, we found at least eight de novo species that constitute new, unpublished species. Finally, we reclassified 74 genomes on the species level and identified for the first time the species of 98 genomes. Overall, the current state of LGC species taxonomy is largely consistent with genome-based species delimitation cutoffs. There are, however, exceptions that should be resolved to evolve toward a taxonomy where species share a consistent diversity in terms of sequence divergence. IMPORTANCE The Lactobacillus genus complex is a group of bacteria that constitutes an important source of strains with medical and food applications. The number of bacterial whole-genome sequences available for this taxon has been increasing rapidly in recent years. Despite this wealth of information, the species within this group are still largely defined by older techniques. Here, we constructed a completely new species-level taxonomy for the Lactobacillus genus complex based on ∼2,500 whole-genome sequences. As a result of this effort, we found that many genomes are not classified to their correct species, and we were able to correct these. In addition, we found that some published species are abnormally large, while others are too small. Finally, we discovered at least eight completely novel species that have not been published before. Our work will help the field to evolve toward a more meaningful and complete taxonomy, based on whole-genome sequences.


2019 ◽  
Vol 20 (1) ◽  
pp. 35-54
Author(s):  
K. Bouti ◽  
◽  
H. Boudjella ◽  
N. Bouras ◽  
A. Zitouni ◽  
...  

n a continuing search fornew antimicrobial products from actinobacteriacollected in Algerian Saharan soils, an isolateof actinobacteria,designated Sg163, was selected for its interesting antimicrobial activity.The isolate wasidentified to the genus Streptosporangium by phenotypicand molecular criteria. The cultural and physiological characterizations as well as phylogenetic analysis indicated that the isolate was different from known members of the genus Streptosporangium. Analysis of the 16S rRNA sequence showed 97.09to 98.27% similarity with those of Streptosporangium type strains. The strainSg163 produced antifungal and antibacterial activities on several culture media. The highest antimicrobial activities were obtained in ISP2 medium. Three active products C1, C2 and C3 with both antifungal and antibacterial activities were isolated and purified by chromatographic methodswith C2, as the major compound. The data of the infraredspectroscopy andthe chemical revelations, suggested that the active molecules were glycosylatedaromaticcompounds.


2005 ◽  
Vol 55 (1) ◽  
pp. 281-287 ◽  
Author(s):  
Virginia Gallego ◽  
María Teresa García ◽  
Antonio Ventosa

Members of the genus Methylobacterium are ubiquitous in nature and can be isolated from almost any freshwater environment where dissolved oxygen exists. This genus is composed of a variety of pink-pigmented, facultatively methylotrophic (PPFM) bacteria. During a screening programme to monitor the bacterial population present in the drinking water of a municipal water supply in Seville (Spain) during the year 2003, five strains of PPFM bacteria were isolated and characterized. Analysis of their complete 16S rRNA gene sequences revealed that they constituted two separate phylogenetic groups (strains GP34T and GR18, and strains GR16T, GP22 and GP32, respectively) showing highest similarity to members of the genus Methylobacterium. The highest 16S rRNA sequence similarities of strain GP34T were found with respect to the type strains of Methylobacterium radiotolerans (96·6 %) and Methylobacterium fujisawaense (96·4 %) and the highest 16S rRNA sequence similarities of strain GR16T were to the type strains of Methylobacterium extorquens (96·0 %) and Methylobacterium rhodesianum (95·8 %). The G+C content of their DNA ranged from 66·5 to 67·8 mol%. DNA–DNA hybridization studies confirmed that they constituted two separate genospecies. On the basis of this phenotypic, phylogenetic and genotypic study, two novel species of the genus Methylobacterium are proposed: Methylobacterium hispanicum sp. nov., with type strain GP34T (CECT 5997T=CCM 7219T=DSM 16372T=CIP 108332T), and Methylobacterium aquaticum sp. nov., with type strain GR16T (CECT 5998T=CCM 7218T=DSM 16371T=CIP 108333T).


2020 ◽  
Vol 15 ◽  
Author(s):  
Dicle Yalcin ◽  
Hasan H. Otu

Background: Epigenetic repression mechanisms play an important role in gene regulation, specifically in cancer development. In many cases, a CpG island’s (CGI) susceptibility or resistance to methylation are shown to be contributed by local DNA sequence features. Objective: To develop unbiased machine learning models–individually and combined for different biological features–that predict the methylation propensity of a CGI. Methods: We developed our model consisting of CGI sequence features on a dataset of 75 sequences (28 prone, 47 resistant) representing a genome-wide methylation structure. We tested our model on two independent datasets that are chromosome (132 sequences) and disease (70 sequences) specific. Results: We provided improvements in prediction accuracy over previous models. Our results indicate that combined features better predict the methylation propensity of a CGI (area under the curve (AUC) ~0.81). Our global methylation classifier performs well on independent datasets reaching an AUC of ~0.82 for the complete model and an AUC of ~0.88 for the model using select sequences that better represent their classes in the training set. We report certain de novo motifs and transcription factor binding site (TFBS) motifs that are consistently better in separating prone and resistant CGIs. Conclusion: Predictive models for the methylation propensity of CGIs lead to a better understanding of disease mechanisms and can be used to classify genes based on their tendency to contain methylation prone CGIs, which may lead to preventative treatment strategies. MATLAB and Python™ scripts used for model building, prediction, and downstream analyses are available at https://github.com/dicleyalcin/methylProp_predictor.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Yujing Liu ◽  
Zhang Song ◽  
Hualong Zeng ◽  
Meng Lu ◽  
Weiyao Zhu ◽  
...  

AbstractPseudomonas are ubiquitously occurring microorganisms and are known for their ability to produce antimicrobials. An endophytic bacterial strain NP-1 T, isolated from Eucalyptus dunnii leaves, exhibits antifungal properties against five tested phytopathogenic fungi. The strain is a Gram-negative rod-shaped bacterium containing a single polar flagellum. It is strictly aerobic, grows at 4–37 °C, 2–5% NaCl, and pH 3–7. The 16S rRNA sequence analysis showed that NP-1 T belongs to the Pseudomonas genus. Phylogenetic analysis based on four concatenated partial genes (16S rDNA, gyrB, rpoB and rpoD) and the phylogenomic tree indicated that NP-1 T belongs to Pseudomonas fluorescens lineage but is distinct from any known Pseudomonas species. The G + C mol % of NP-1 T genome is 63.96, and the differences between NP-1 T and related species are larger than 1. The digital DNA-DNA hybridization and tetranucleotide signatures are 23.8 and 0.97, which clearly separates strain NP-1 T from its closest neighbours, Pseudomonas coleopterorum and Pseudomonas rhizosphaerae. Its phenotypic and chemotaxonomic features confirmed its differentiation from related taxa. The results from this polyphasic approach support the classification of NP-1 T as a novel species of Pseudomonas, and the name of Pseudomonas eucalypticola is thus proposed for this strain, whose type is NP-1 T (= CCTCC M2018494T = JCM 33572 T).


Genes ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 246
Author(s):  
Xiaomeng Chen ◽  
Rui Li ◽  
Yonglin Wang ◽  
Aining Li

An emerging poplar canker caused by the gram-negative bacterium, Lonsdalea populi, has led to high mortality of hybrid poplars Populus × euramericana in China and Europe. The molecular bases of pathogenicity and bark adaptation of L. populi have become a focus of recent research. This study revealed the whole genome sequence and identified putative virulence factors of L. populi. A high-quality L. populi genome sequence was assembled de novo, with a genome size of 3,859,707 bp, containing approximately 3434 genes and 107 RNAs (75 tRNA, 22 rRNA, and 10 ncRNA). The L. populi genome contained 380 virulence-associated genes, mainly encoding for adhesion, extracellular enzymes, secretory systems, and two-component transduction systems. The genome had 110 carbohydrate-active enzyme (CAZy)-coding genes and putative secreted proteins. The antibiotic-resistance database annotation listed that L. populi was resistant to penicillin, fluoroquinolone, and kasugamycin. Analysis of comparative genomics found that L. populi exhibited the highest homology with the L. britannica genome and L. populi encompassed 1905 specific genes, 1769 dispensable genes, and 1381 conserved genes, suggesting high evolutionary diversity and genomic plasticity. Moreover, the pan genome analysis revealed that the N-5-1 genome is an open genome. These findings provide important resources for understanding the molecular basis of the pathogenicity and biology of L. populi and the poplar-bacterium interaction.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Huihui Li ◽  
Mingzhe Xie ◽  
Yan Wang ◽  
Ludong Yang ◽  
Zhi Xie ◽  
...  

AbstractriboCIRC is a translatome data-oriented circRNA database specifically designed for hosting, exploring, analyzing, and visualizing translatable circRNAs from multi-species. The database provides a comprehensive repository of computationally predicted ribosome-associated circRNAs; a manually curated collection of experimentally verified translated circRNAs; an evaluation of cross-species conservation of translatable circRNAs; a systematic de novo annotation of putative circRNA-encoded peptides, including sequence, structure, and function; and a genome browser to visualize the context-specific occupant footprints of circRNAs. It represents a valuable resource for the circRNA research community and is publicly available at http://www.ribocirc.com.


2020 ◽  
Vol 12 (6) ◽  
pp. 905-910 ◽  
Author(s):  
Ruoyu Liu ◽  
Kun Wang ◽  
Jun Liu ◽  
Wenjie Xu ◽  
Yang Zhou ◽  
...  

Abstract Cold seeps, characterized by the methane, hydrogen sulfide, and other hydrocarbon chemicals, foster one of the most widespread chemosynthetic ecosystems in deep sea that are densely populated by specialized benthos. However, scarce genomic resources severely limit our knowledge about the origin and adaptation of life in this unique ecosystem. Here, we present a genome of a deep-sea limpet Bathyacmaea lactea, a common species associated with the dominant mussel beds in cold seeps. We yielded 54.6 gigabases (Gb) of Nanopore reads and 77.9-Gb BGI-seq raw reads, respectively. Assembly harvested a 754.3-Mb genome for B. lactea, with 3,720 contigs and a contig N50 of 1.57 Mb, covering 94.3% of metazoan Benchmarking Universal Single-Copy Orthologs. In total, 23,574 protein-coding genes and 463.4 Mb of repetitive elements were identified. We analyzed the phylogenetic position, substitution rate, demographic history, and TE activity of B. lactea. We also identified 80 expanded gene families and 87 rapidly evolving Gene Ontology categories in the B. lactea genome. Many of these genes were associated with heterocyclic compound metabolism, membrane-bounded organelle, metal ion binding, and nitrogen and phosphorus metabolism. The high-quality assembly and in-depth characterization suggest the B. lactea genome will serve as an essential resource for understanding the origin and adaptation of life in the cold seeps.


Sign in / Sign up

Export Citation Format

Share Document