scholarly journals Chromatiblock: scalable whole-genome visualization of structural differences in prokaryotes

2019 ◽  
Author(s):  
Mitchell John Sullivan ◽  
Harm van Bakel

AbstractSummaryChromatiblock is a Python application for visualizing the presence, absence and arrangement of syntenic blocks across large numbers of complete bacterial genomes.AvailabilityChromatiblock is freely available under a GPL license, for macOS, GNU/Linux and and Microsoft Windows from https://github.com/mjsull/chromatiblock/[email protected]

BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Andrew P. Wilkey ◽  
Anne V. Brown ◽  
Steven B. Cannon ◽  
Ethalinda K. S. Cannon

Abstract Background Large genotyping datasets have become commonplace due to efficient, cheap methods for SNP identification. Typical genotyping datasets may have thousands to millions of data points per accession, across tens to thousands of accessions. There is a need for tools to help rapidly explore such datasets, to assess characteristics such as overall differences between accessions and regional anomalies across the genome. Results We present GCViT (Genotype Comparison Visualization Tool), for visualizing and exploring large genotyping datasets. GCViT can be used to identify introgressions, conserved or divergent genomic regions, pedigrees, and other features for more detailed exploration. The program can be used online or as a local instance for whole genome visualization of resequencing or SNP array data. The program performs comparisons of variants among user-selected accessions to identify allele differences and similarities between accessions and a user-selected reference, providing visualizations through histogram, heatmap, or haplotype views. The resulting analyses and images can be exported in various formats. Conclusions GCViT provides methods for interactively visualizing SNP data on a whole genome scale, and can produce publication-ready figures. It can be used in online or local installations. GCViT enables users to confirm or identify genomics regions of interest associated with particular traits. GCViT is freely available at https://github.com/LegumeFederation/gcvit. The 1.0 version described here is available at 10.5281/zenodo.4008713.


2018 ◽  
Author(s):  
Christopher Dean ◽  
Noelle Noyes ◽  
Steven Lakin ◽  
Pablo Rovira-Sanz ◽  
Xiang Yang ◽  
...  

AbstractSummaryTychus is a tool that allows researchers to perform massively parallel whole genome sequence (WGS) analysis with the goal of producing a high confidence and comprehensive description of the bacterial genome. Key features of the Tychus pipeline include the assembly, annotation, alignment, variant discovery and phylogenetic inference of large numbers of WGS isolates in parallel using open-source bioinformatics tools and virtualization technology. All prerequisite tools and dependencies come packaged together in a single suite that can be easily downloaded and installed on Linux and Mac operating systems.AvailabilityTychus is freely available as an open-source package under the MIT license, and can be downloaded via GitHub (https://github.com/Abdo-Lab/Tychus)[email protected]


2014 ◽  
Vol 7 (1) ◽  
pp. 484 ◽  
Author(s):  
Basil Xavier ◽  
Julia Sabirova ◽  
Moons Pieter ◽  
Jean-Pierre Hernalsteens ◽  
Henri de Greve ◽  
...  

2020 ◽  
Vol 16 (12) ◽  
pp. e1008439
Author(s):  
Jennifer Lu ◽  
Steven L. Salzberg

GC skew is a phenomenon observed in many bacterial genomes, wherein the two replication strands of the same chromosome contain different proportions of guanine and cytosine nucleotides. Here we demonstrate that this phenomenon, which was first discovered in the mid-1990s, can be used today as an analysis tool for the 15,000+ complete bacterial genomes in NCBI’s Refseq library. In order to analyze all 15,000+ genomes, we introduce a new method, SkewIT (Skew Index Test), that calculates a single metric representing the degree of GC skew for a genome. Using this metric, we demonstrate how GC skew patterns are conserved within certain bacterial phyla, e.g. Firmicutes, but show different patterns in other phylogenetic groups such as Actinobacteria. We also discovered that outlier values of SkewIT highlight potential bacterial mis-assemblies. Using our newly defined metric, we identify multiple mis-assembled chromosomal sequences in previously published complete bacterial genomes. We provide a SkewIT web app https://jenniferlu717.shinyapps.io/SkewIT/ that calculates SkewI for any user-provided bacterial sequence. The web app also provides an interactive interface for the data generated in this paper, allowing users to further investigate the SkewI values and thresholds of the Refseq-97 complete bacterial genomes. Individual scripts for analysis of bacterial genomes are provided in the following repository: https://github.com/jenniferlu717/SkewIT.


2017 ◽  
Author(s):  
Mickael Silva ◽  
Miguel Machado ◽  
Diogo N. Silva ◽  
Mirko Rossi ◽  
Jacob Moran-Gilad ◽  
...  

ABSTRACTGene-by-gene approaches are becoming increasingly popular in bacterial genomic epidemiology and outbreak detection. However, there is a lack of open-source scalable software for schema definition and allele calling for these methodologies. The chewBBACA suite was designed to assist users in the creation and evaluation of novel whole-genome or core-genome gene-by-gene typing schemas and subsequent allele calling in bacterial strains of interest. The software can run in a laptop or in high performance clusters making it useful for both small laboratories and large reference centers. ChewBBACA is available athttps://github.com/B-UMMI/chewBBACAor as a docker image athttps://hub.docker.com/r/ummidock/chewbbaca/.DATA SUMMARYAssembled genomes used for the tutorial were downloaded from NCBI in August 2016 by selecting those submitted asStreptococcus agalactiaetaxon or sub-taxa. All the assemblies have been deposited as a zip file in FigShare (https://figshare.com/s/9cbe1d422805db54cd52), where a file with the original ftp link for each NCBI directory is also available.Code for the chewBBACA suite is available athttps://github.com/B-UMMI/chewBBACAwhile the tutorial example is found athttps://github.com/B-UMMI/chewBBACA_tutorial.I/We confirm all supporting data, code and protocols have been provided within the article or through supplementary data files. ⊠IMPACT STATEMENTThe chewBBACA software offers a computational solution for the creation, evaluation and use of whole genome (wg) and core genome (cg) multilocus sequence typing (MLST) schemas. It allows researchers to develop wg/cgMLST schemes for any bacterial species from a set of genomes of interest. The alleles identified by chewBBACA correspond to potential coding sequences, possibly offering insights into the correspondence between the genetic variability identified and phenotypic variability. The software performs allele calling in a matter of seconds to minutes per strain in a laptop but is easily scalable for the analysis of large datasets of hundreds of thousands of strains using multiprocessing options. The chewBBACA software thus provides an efficient and freely available open source solution for gene-by-gene methods. Moreover, the ability to perform these tasks locally is desirable when the submission of raw data to a central repository or web services is hindered by data protection policies or ethical or legal concerns.


Author(s):  
Hisami Kobayashi ◽  
Yasuhiro Tanizawa ◽  
Mitsuo Sakamoto ◽  
Moriya Ohkuma ◽  
Masanori Tohno

The taxonomic status of the species Clostridium methoxybenzovorans was assessed. The 16S rRNA gene sequence, whole-genome sequence and phenotypic characterizations suggested that the type strain deposited in the American Type Culture Collection ( C. methoxybenzovorans ATCC 700855T) is a member of the species Eubacterium callanderi . Hence, C. methoxybenzovorans ATCC 700855T cannot be used as a reference for taxonomic study. The type strain deposited in the German Collection of Microorganism and Cell Cultures GmbH (DSM 12182T) is no longer listed in its online catalogue. Also, both the 16S rRNA gene and the whole-genome sequences of the original strain SR3T showed high sequence identity with those of Lacrimispora indolis (recently reclassified from Clostridium indolis ) as the most closely related species. Analysis of the two genomes showed average nucleotide identity based on blast and digital DNA–DNA hybridization values of 98.3 and 87.9 %, respectively. Based on these results, C. methoxybenzovorans SR3T was considered to be a member of L. indolis .


2015 ◽  
Vol 53 (4) ◽  
pp. 1054-1055 ◽  
Author(s):  
Elizabeth Robilotti ◽  
Mini Kamboj

Microbial whole-genome sequencing (WGS) is poised to transform many of the currently used approaches in medical microbiology. Recent reports on the application of WGS to understand genetic evolution and reconstruct transmission pathways have provided valuable information that will influence infection control practices. While this technology holds great promise, obstacles to full implementation remain. Two articles in this issue of the Journal of Clinical Microbiology (S. Octavia, Q. Wang, M. M. Tanaka, S. Kaur, V. Sintchenko, and R. Lan, J Clin Microbiol 53:1063–1071, 2015, doi:10.1128/JCM.03235-14, andS. J. Salipante, D. J. SenGupta, L. A. Cummings, T. A. Land, D. R. Hoogestraat, and B. T. Cookson, J Clin Microbiol 53:1072–1079, 2015, doi:10.1128/JCM.03385-14) describe the breadth of application of WGS to the field of clinical epidemiology.


2013 ◽  
Vol 63 (Pt_10) ◽  
pp. 3920-3926 ◽  
Author(s):  
Julia S. Bennett ◽  
Keith A. Jolley ◽  
Martin C. J. Maiden

Phylogenies generated from whole genome sequence (WGS) data provide definitive means of bacterial isolate characterization for typing and taxonomy. The species status of strains recently defined with conventional taxonomic approaches as representing Neisseria oralis was examined by the analysis of sequences derived from WGS data, specifically: (i) 53 Neisseria ribosomal protein subunit (rps) genes (ribosomal multi-locus sequence typing, rMLST); and (ii) 246 Neisseria core genes (core genome MLST, cgMLST). These data were compared with phylogenies derived from 16S and 23S rRNA gene sequences, demonstrating that the N. oralis strains were monophyletic with strains described previously as representing ‘ Neisseria mucosa var. heidelbergensis’ and that this group was of equivalent taxonomic status to other well-described species of the genus Neisseria . Phylogenetic analyses also indicated that Neisseria sicca and Neisseria macacae should be considered the same species as Neisseria mucosa and that Neisseria flavescens should be considered the same species as Neisseria subflava . Analyses using rMLST showed that some strains currently defined as belonging to the genus Neisseria were more closely related to species belonging to other genera within the family; however, whole genome analysis of a more comprehensive selection of strains from within the family Neisseriaceae would be necessary to confirm this. We suggest that strains previously identified as representing ‘ N. mucosa var. heidelbergensis’ and deposited in culture collections should be renamed N. oralis . Finally, one of the strains of N. oralis was able to ferment lactose, due to the presence of β-galactosidase and lactose permease genes, a characteristic previously thought to be unique to Neisseria lactamica , which therefore cannot be thought of as diagnostic for this species; however, the rMLST and cgMLST analyses confirm that N. oralis is most closely related to N. mucosa .


Sign in / Sign up

Export Citation Format

Share Document