A genome-based species taxonomy of the Lactobacillus Genus Complex

Mapping Intimacies ◽

10.1101/537084 ◽

2019 ◽

Cited By ~ 1

Author(s):

Stijn Wittouck ◽

Sander Wuyts ◽

Conor J Meehan ◽

Vera van Noort ◽

Sarah Lebeer

Keyword(s):

De Novo ◽

Sequence Divergence ◽

Rrna Sequence ◽

16S Rrna Sequence ◽

Genome Data ◽

Current State ◽

A Genome ◽

Identity Threshold ◽

Species Taxonomy ◽

Type Strains

AbstractBackgroundThere are over 200 published species within the Lactobacillus Genus Complex (LGC), the majority of which have sequenced type strain genomes available. Although gold standard, genome-based species delimitation cutoffs are accepted by the community, they are seldom checked against currently available genome data. In addition, there are many species-level misclassification issues within the LGC. We constructed a de novo species taxonomy for the LGC based on 2,459 publicly available, decent-quality genomes and using a 94% core nucleotide identity threshold. We reconciled these de novo species with published species and subspecies names by (i) identifying genomes of type strains in our dataset and (ii) performing comparisons based on 16S rRNA sequence identity against type strains.ResultsWe found that genomes within the LGC could be divided into 239 clusters (de novo species) that were discontinuous and exclusive. Comparison of these de novo species to published species lead to the identification of ten sets of published species that can be merged and one species that can be split. Further, we found at least eight genome clusters that constitute new species. Finally, we were able to accurately classify 98 unclassified genomes and reclassify 74 wrongly classified genomes.ConclusionsThe current state of LGC species taxonomy is largely consistent with genome data, but there are some inconsistencies as well as genome misclassifications. These inconsistencies should be resolved to evolve towards a meaningful taxonomy where species have a consistent size in terms of sequence divergence.

Download Full-text

Geographic and bathymetric patterns of mitochondrial 16S rRNA sequence divergence among deep-sea amphipods, Eurythenes gryllus

Marine Biology ◽

10.1007/bf00351330 ◽

1996 ◽

Vol 126 (4) ◽

pp. 633-643 ◽

Cited By ~ 101

Author(s):

S. C. France ◽

T. D. Kocher

Keyword(s):

16S Rrna ◽

Deep Sea ◽

Sequence Divergence ◽

Rrna Sequence ◽

16S Rrna Sequence

Download Full-text

Phylogenetic relationships of filamentous cyanobacterial taxa inferred from 16S rRNA sequence divergence.

The Journal of General and Applied Microbiology ◽

10.2323/jgam.43.237 ◽

1997 ◽

Vol 43 (4) ◽

pp. 237-241 ◽

Cited By ~ 24

Author(s):

Tatsuya Ishida ◽

Akira Yokota ◽

Junta Sugiyama

Keyword(s):

16S Rrna ◽

Phylogenetic Relationships ◽

Sequence Divergence ◽

Rrna Sequence ◽

16S Rrna Sequence

Download Full-text

A Genome-Based Species Taxonomy of the Lactobacillus Genus Complex

mSystems ◽

10.1128/msystems.00264-19 ◽

2019 ◽

Vol 4 (5) ◽

Cited By ~ 16

Author(s):

Stijn Wittouck ◽

Sander Wuyts ◽

Conor J. Meehan ◽

Vera van Noort ◽

Sarah Lebeer

Keyword(s):

16S Rrna ◽

Species Delimitation ◽

De Novo ◽

Species Level ◽

16S Rrna Genes ◽

Rrna Genes ◽

Whole Genome ◽

Genome Sequences ◽

Species Taxonomy ◽

Type Strains

ABSTRACT There are more than 200 published species within the Lactobacillus genus complex (LGC), the majority of which have sequenced type strain genomes available. Although genome-based species delimitation cutoffs are accepted as the gold standard by the community, these are seldom actually checked for new or already published species. In addition, the availability of genome data is revealing inconsistencies in the species-level classification of many strains. We constructed a de novo species taxonomy for the LGC based on 2,459 publicly available genomes, using a 94% core nucleotide identity cutoff. We reconciled these de novo species with published species and subspecies names by (i) identifying genomes of type strains and (ii) comparing 16S rRNA genes of the genomes with 16S rRNA genes of type strains. We found that genomes within the LGC could be divided into 239 de novo species that were discontinuous and exclusive. Comparison of these de novo species to published species led to the identification of nine sets of published species that can be merged and one species that can be split. Further, we found at least eight de novo species that constitute new, unpublished species. Finally, we reclassified 74 genomes on the species level and identified for the first time the species of 98 genomes. Overall, the current state of LGC species taxonomy is largely consistent with genome-based species delimitation cutoffs. There are, however, exceptions that should be resolved to evolve toward a taxonomy where species share a consistent diversity in terms of sequence divergence. IMPORTANCE The Lactobacillus genus complex is a group of bacteria that constitutes an important source of strains with medical and food applications. The number of bacterial whole-genome sequences available for this taxon has been increasing rapidly in recent years. Despite this wealth of information, the species within this group are still largely defined by older techniques. Here, we constructed a completely new species-level taxonomy for the Lactobacillus genus complex based on ∼2,500 whole-genome sequences. As a result of this effort, we found that many genomes are not classified to their correct species, and we were able to correct these. In addition, we found that some published species are abnormally large, while others are too small. Finally, we discovered at least eight completely novel species that have not been published before. Our work will help the field to evolve toward a more meaningful and complete taxonomy, based on whole-genome sequences.

Download Full-text

Taxonomy and antimicrobial activity of streptosporangium strain SG163 isolated from a Saharan soil

Lebanese Science Journal ◽

10.22453/lsj-020.1.035-054 ◽

2019 ◽

Vol 20 (1) ◽

pp. 35-54

Author(s):

K. Bouti ◽

◽

H. Boudjella ◽

N. Bouras ◽

A. Zitouni ◽

...

Keyword(s):

Antimicrobial Activity ◽

Phylogenetic Analysis ◽

16S Rrna ◽

Culture Media ◽

Antibacterial Activities ◽

Antimicrobial Activities ◽

Rrna Sequence ◽

16S Rrna Sequence ◽

Saharan Soil ◽

Type Strains

n a continuing search fornew antimicrobial products from actinobacteriacollected in Algerian Saharan soils, an isolateof actinobacteria,designated Sg163, was selected for its interesting antimicrobial activity.The isolate wasidentified to the genus Streptosporangium by phenotypicand molecular criteria. The cultural and physiological characterizations as well as phylogenetic analysis indicated that the isolate was different from known members of the genus Streptosporangium. Analysis of the 16S rRNA sequence showed 97.09to 98.27% similarity with those of Streptosporangium type strains. The strainSg163 produced antifungal and antibacterial activities on several culture media. The highest antimicrobial activities were obtained in ISP2 medium. Three active products C1, C2 and C3 with both antifungal and antibacterial activities were isolated and purified by chromatographic methodswith C2, as the major compound. The data of the infraredspectroscopy andthe chemical revelations, suggested that the active molecules were glycosylatedaromaticcompounds.

Download Full-text

Methylobacterium hispanicum sp. nov. and Methylobacterium aquaticum sp. nov., isolated from drinking water

INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY ◽

10.1099/ijs.0.63319-0 ◽

2005 ◽

Vol 55 (1) ◽

pp. 281-287 ◽

Cited By ~ 66

Author(s):

Virginia Gallego ◽

María Teresa García ◽

Antonio Ventosa

Keyword(s):

Drinking Water ◽

16S Rrna ◽

Type Strain ◽

Rrna Gene ◽

Phylogenetic Groups ◽

Rrna Sequence ◽

16S Rrna Sequence ◽

Freshwater Environment ◽

Type Strains ◽

Sequence Similarities

Members of the genus Methylobacterium are ubiquitous in nature and can be isolated from almost any freshwater environment where dissolved oxygen exists. This genus is composed of a variety of pink-pigmented, facultatively methylotrophic (PPFM) bacteria. During a screening programme to monitor the bacterial population present in the drinking water of a municipal water supply in Seville (Spain) during the year 2003, five strains of PPFM bacteria were isolated and characterized. Analysis of their complete 16S rRNA gene sequences revealed that they constituted two separate phylogenetic groups (strains GP34T and GR18, and strains GR16T, GP22 and GP32, respectively) showing highest similarity to members of the genus Methylobacterium. The highest 16S rRNA sequence similarities of strain GP34T were found with respect to the type strains of Methylobacterium radiotolerans (96·6 %) and Methylobacterium fujisawaense (96·4 %) and the highest 16S rRNA sequence similarities of strain GR16T were to the type strains of Methylobacterium extorquens (96·0 %) and Methylobacterium rhodesianum (95·8 %). The G+C content of their DNA ranged from 66·5 to 67·8 mol%. DNA–DNA hybridization studies confirmed that they constituted two separate genospecies. On the basis of this phenotypic, phylogenetic and genotypic study, two novel species of the genus Methylobacterium are proposed: Methylobacterium hispanicum sp. nov., with type strain GP34T (CECT 5997T=CCM 7219T=DSM 16372T=CIP 108332T), and Methylobacterium aquaticum sp. nov., with type strain GR16T (CECT 5998T=CCM 7218T=DSM 16371T=CIP 108333T).

Download Full-text

An Unbiased Predictive Model to Detect DNA Methylation Propensity of CpG Islands in the Human Genome

Current Bioinformatics ◽

10.2174/1574893615999200724145835 ◽

2020 ◽

Vol 15 ◽

Author(s):

Dicle Yalcin ◽

Hasan H. Otu

Keyword(s):

Model Building ◽

De Novo ◽

Cpg Islands ◽

Treatment Strategies ◽

Area Under The Curve ◽

Global Methylation ◽

Sequence Features ◽

A Genome ◽

Combined Features ◽

Epigenetic Repression

Background: Epigenetic repression mechanisms play an important role in gene regulation, specifically in cancer development. In many cases, a CpG island’s (CGI) susceptibility or resistance to methylation are shown to be contributed by local DNA sequence features. Objective: To develop unbiased machine learning models–individually and combined for different biological features–that predict the methylation propensity of a CGI. Methods: We developed our model consisting of CGI sequence features on a dataset of 75 sequences (28 prone, 47 resistant) representing a genome-wide methylation structure. We tested our model on two independent datasets that are chromosome (132 sequences) and disease (70 sequences) specific. Results: We provided improvements in prediction accuracy over previous models. Our results indicate that combined features better predict the methylation propensity of a CGI (area under the curve (AUC) ~0.81). Our global methylation classifier performs well on independent datasets reaching an AUC of ~0.82 for the complete model and an AUC of ~0.88 for the model using select sequences that better represent their classes in the training set. We report certain de novo motifs and transcription factor binding site (TFBS) motifs that are consistently better in separating prone and resistant CGIs. Conclusion: Predictive models for the methylation propensity of CGIs lead to a better understanding of disease mechanisms and can be used to classify genes based on their tendency to contain methylation prone CGIs, which may lead to preventative treatment strategies. MATLAB and Python™ scripts used for model building, prediction, and downstream analyses are available at https://github.com/dicleyalcin/methylProp_predictor.

Download Full-text

Pseudomonas eucalypticola sp. nov., a producer of antifungal agents isolated from Eucalyptus dunnii leaves

Scientific Reports ◽

10.1038/s41598-021-82682-7 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Yujing Liu ◽

Zhang Song ◽

Hualong Zeng ◽

Meng Lu ◽

Weiyao Zhu ◽

...

Keyword(s):

Dna Hybridization ◽

Antifungal Agents ◽

Novel Species ◽

Phytopathogenic Fungi ◽

Strictly Aerobic ◽

Rrna Sequence ◽

16S Rrna Sequence ◽

Eucalyptus Dunnii ◽

16S Rrna Sequence Analysis

AbstractPseudomonas are ubiquitously occurring microorganisms and are known for their ability to produce antimicrobials. An endophytic bacterial strain NP-1 T, isolated from Eucalyptus dunnii leaves, exhibits antifungal properties against five tested phytopathogenic fungi. The strain is a Gram-negative rod-shaped bacterium containing a single polar flagellum. It is strictly aerobic, grows at 4–37 °C, 2–5% NaCl, and pH 3–7. The 16S rRNA sequence analysis showed that NP-1 T belongs to the Pseudomonas genus. Phylogenetic analysis based on four concatenated partial genes (16S rDNA, gyrB, rpoB and rpoD) and the phylogenomic tree indicated that NP-1 T belongs to Pseudomonas fluorescens lineage but is distinct from any known Pseudomonas species. The G + C mol % of NP-1 T genome is 63.96, and the differences between NP-1 T and related species are larger than 1. The digital DNA-DNA hybridization and tetranucleotide signatures are 23.8 and 0.97, which clearly separates strain NP-1 T from its closest neighbours, Pseudomonas coleopterorum and Pseudomonas rhizosphaerae. Its phenotypic and chemotaxonomic features confirmed its differentiation from related taxa. The results from this polyphasic approach support the classification of NP-1 T as a novel species of Pseudomonas, and the name of Pseudomonas eucalypticola is thus proposed for this strain, whose type is NP-1 T (= CCTCC M2018494T = JCM 33572 T).

Download Full-text

Genomic Characterization Provides an Insight into the Pathogenicity of the Poplar Canker Bacterium Lonsdalea populi

Genes ◽

10.3390/genes12020246 ◽

2021 ◽

Vol 12 (2) ◽

pp. 246

Author(s):

Xiaomeng Chen ◽

Rui Li ◽

Yonglin Wang ◽

Aining Li

Keyword(s):

Genome Sequence ◽

Extracellular Enzymes ◽

De Novo ◽

Whole Genome Sequence ◽

Hybrid Poplars ◽

A Genome ◽

Conserved Genes ◽

Genomic Characterization ◽

Molecular Bases ◽

Insight Into

An emerging poplar canker caused by the gram-negative bacterium, Lonsdalea populi, has led to high mortality of hybrid poplars Populus × euramericana in China and Europe. The molecular bases of pathogenicity and bark adaptation of L. populi have become a focus of recent research. This study revealed the whole genome sequence and identified putative virulence factors of L. populi. A high-quality L. populi genome sequence was assembled de novo, with a genome size of 3,859,707 bp, containing approximately 3434 genes and 107 RNAs (75 tRNA, 22 rRNA, and 10 ncRNA). The L. populi genome contained 380 virulence-associated genes, mainly encoding for adhesion, extracellular enzymes, secretory systems, and two-component transduction systems. The genome had 110 carbohydrate-active enzyme (CAZy)-coding genes and putative secreted proteins. The antibiotic-resistance database annotation listed that L. populi was resistant to penicillin, fluoroquinolone, and kasugamycin. Analysis of comparative genomics found that L. populi exhibited the highest homology with the L. britannica genome and L. populi encompassed 1905 specific genes, 1769 dispensable genes, and 1381 conserved genes, suggesting high evolutionary diversity and genomic plasticity. Moreover, the pan genome analysis revealed that the N-5-1 genome is an open genome. These findings provide important resources for understanding the molecular basis of the pathogenicity and biology of L. populi and the poplar-bacterium interaction.

Download Full-text

riboCIRC: a comprehensive database of translatable circRNAs

Genome Biology ◽

10.1186/s13059-021-02300-7 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Huihui Li ◽

Mingzhe Xie ◽

Yan Wang ◽

Ludong Yang ◽

Zhi Xie ◽

...

Keyword(s):

De Novo ◽

Species Conservation ◽

Structure And Function ◽

Research Community ◽

Genome Browser ◽

Valuable Resource ◽

Sequence Structure ◽

A Genome ◽

Context Specific ◽

And Function

AbstractriboCIRC is a translatome data-oriented circRNA database specifically designed for hosting, exploring, analyzing, and visualizing translatable circRNAs from multi-species. The database provides a comprehensive repository of computationally predicted ribosome-associated circRNAs; a manually curated collection of experimentally verified translated circRNAs; an evaluation of cross-species conservation of translatable circRNAs; a systematic de novo annotation of putative circRNA-encoded peptides, including sequence, structure, and function; and a genome browser to visualize the context-specific occupant footprints of circRNAs. It represents a valuable resource for the circRNA research community and is publicly available at http://www.ribocirc.com.

Download Full-text

De Novo Genome Assembly of Limpet Bathyacmaea lactea (Gastropoda: Pectinodontidae): The First Reference Genome of a Deep-Sea Gastropod Endemic to Cold Seeps

Genome Biology and Evolution ◽

10.1093/gbe/evaa100 ◽

2020 ◽

Vol 12 (6) ◽

pp. 905-910 ◽

Cited By ~ 2

Author(s):

Ruoyu Liu ◽

Kun Wang ◽

Jun Liu ◽

Wenjie Xu ◽

Yang Zhou ◽

...

Keyword(s):

Deep Sea ◽

Metal Ion ◽

De Novo ◽

Demographic History ◽

Gene Families ◽

Phylogenetic Position ◽

Cold Seeps ◽

Nitrogen And Phosphorus ◽

De Novo Genome Assembly ◽

A Genome

Abstract Cold seeps, characterized by the methane, hydrogen sulfide, and other hydrocarbon chemicals, foster one of the most widespread chemosynthetic ecosystems in deep sea that are densely populated by specialized benthos. However, scarce genomic resources severely limit our knowledge about the origin and adaptation of life in this unique ecosystem. Here, we present a genome of a deep-sea limpet Bathyacmaea lactea, a common species associated with the dominant mussel beds in cold seeps. We yielded 54.6 gigabases (Gb) of Nanopore reads and 77.9-Gb BGI-seq raw reads, respectively. Assembly harvested a 754.3-Mb genome for B. lactea, with 3,720 contigs and a contig N50 of 1.57 Mb, covering 94.3% of metazoan Benchmarking Universal Single-Copy Orthologs. In total, 23,574 protein-coding genes and 463.4 Mb of repetitive elements were identified. We analyzed the phylogenetic position, substitution rate, demographic history, and TE activity of B. lactea. We also identified 80 expanded gene families and 87 rapidly evolving Gene Ontology categories in the B. lactea genome. Many of these genes were associated with heterocyclic compound metabolism, membrane-bounded organelle, metal ion binding, and nitrogen and phosphorus metabolism. The high-quality assembly and in-depth characterization suggest the B. lactea genome will serve as an essential resource for understanding the origin and adaptation of life in the cold seeps.

Download Full-text