scholarly journals Efficient inference of recent and ancestral recombination within bacterial populations

2016 ◽  
Author(s):  
Rafal Mostowy ◽  
Nicholas J. Croucher ◽  
Cheryl P. Andam ◽  
Jukka Corander ◽  
William P. Hanage ◽  
...  

AbstractProkaryotic evolution is affected by horizontal transfer of genetic material through recombination. Inference of an evolutionary tree of bacteria thus relies on accurate identification of the population genetic structure and recombination-derived mosaicism. Rapidly growing databases represent a challenge for computational methods to detect recombinations in bacterial genomes. We introduce a novel algorithm called fastGEAR which identifies lineages in diverse microbial alignments, and recombinations between them and from external origins. The algorithm detects both recent recombinations (affecting a few isolates) and ancestral recombinations between detected lineages (affecting entire lineages), thus providing insight into recombinations affecting deep branches of the phylogenetic tree. In sim-ulations, fastGEAR had comparable power to detect recent recombinations and outstanding power to detect the ancestral ones, compared to state-of-the-art methods, often with a fraction of computational cost. We demonstrate the utility of the method by analysing a collection of 616 whole-genomes of a recombinogenic pathogen Streptococcus pneumoniae, for which the method provided a high-resolution view of recombination across the genome. We examined in detail the penicillin-binding genes across the Streptococcus genus, demonstrating previously undetected genetic exchanges between different species at these three loci. Hence, fastGEAR can be readily applied to investigate mosaicism in bacterial genes across multiple species. Finally, fastGEAR correctly identified many known recombination hotspots and pointed to potential new ones. Matlab code and Linux/Windows executables are available at https://users.ics.aalto.fi/~pemartti/fastGEAR/

2020 ◽  
Author(s):  
Yu Wan ◽  
Ryan R. Wick ◽  
Justin Zobel ◽  
Danielle J. Ingle ◽  
Michael Inouye ◽  
...  

AbstractAntimicrobial resistance (AMR) in bacteria has been a global threat to public health for decades. A well-known driving force for the emergence, evolution and dissemination of genetic AMR determinants in bacterial populations is horizontal gene transfer, which is frequently mediated by mobile genetic elements (MGEs). Some MGEs can capture, maintain, and rearrange multiple AMR genes in a donor bacterium before moving them into recipients, giving rise to a phenomenon called horizontal gene co-transfer (HGcoT). This physical linkage or co-localisation between mobile AMR genes is of particular concern because it facilitates rapid dissemination of multidrug resistance within and across bacterial populations, providing opportunities for co-selection of AMR genes and limiting our therapeutic options. The study of HGcoT can be benefited from large-scale whole-genome sequencing (WGS) data, however, by far most published studies of HGcoT only consider simple co-occurrence measures, which can be confounded by strong bacterial population structure due to clonal reproduction, leading to spurious associations. To address this issue, we present GeneMates, an R package implementing a network approach to identification of HGcoT using WGS data. The package enables users to test for associations between presence-absence of bacterial genes using univariate linear mixed models controlling for population structure based on core-genome variation. Furthermore, when physical distances between genes of interest are measurable in bacterial genomes, users can evaluate distance consistency to further support their inference of putative horizontally co-transferred genes, whose co-occurrence cannot be completely explained by the population structure. We demonstrate how this package can be used to identify co-transferred AMR genes and recover known MGEs from Escherichia coli and Salmonella Typhimurium WGS data. GeneMates is accessible at github.com/wanyuac/GeneMates.


2021 ◽  
Author(s):  
Guilherme Marcelino Viana de Siqueira ◽  
Felipe Marcelo Pereira-dos-Santos ◽  
Rafael Silva-Rocha ◽  
Maria-Eugenia Guazzaroni

Fast and accurate identification of pathogens is an essential task in healthcare settings. Next generation sequencing platforms such as Illumina have greatly expanded the capacity with which different organisms can be detected in hospital samples, and third-generation nanopore-driven sequencing devices such as Oxford Nanopore's minION have recently emerged as ideal sequencing platforms for routine healthcare surveillance due to their long-read capacity and high portability. Despite its great potential, protocols and analysis pipelines for nanopore sequencing are still being extensively validated. In this work, we assess the ability of nanopore sequencing to provide reliable community profiles based on 16S rRNA sequencing in comparison to traditional Illumina platforms using samples collected from Intensive Care Units from a hospital in Brazil. While our results point that lower throughputs may be a shortcoming of the method in more complex samples, we show that the use of single-use Flongle flowcells in nanopore sequencing runs can provide insightful information on the community composition in healthcare settings.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 70 ◽  
Author(s):  
Espen Mikal Robertsen ◽  
Hubert Denise ◽  
Alex Mitchell ◽  
Robert D. Finn ◽  
Lars Ailo Bongo ◽  
...  

Metagenomics, the study of genetic material recovered directly from environmental samples, has the potential to provide insight into the structure and function of heterogeneous microbial communities.  There has been an increased use of metagenomics to discover and understand the diverse biosynthetic capacities of marine microbes, thereby allowing them to be exploited for industrial, food, and health care products. This ELIXIR pilot action was motivated by the need to establish dedicated data resources and harmonized metagenomics pipelines for the marine domain, in order to enhance the exploration and exploitation of marine genetic resources. In this paper, we summarize some of the results from the ELIXIR pilot action “Marine metagenomics – towards user centric services”.


Author(s):  
Alexey Zabelkin ◽  
Yulia Yakovleva ◽  
Olga Bochkareva ◽  
Nikita Alexeev

Abstract Motivation High plasticity of bacterial genomes is provided by numerous mechanisms including horizontal gene transfer and recombination via numerous flanking repeats. Genome rearrangements such as inversions, deletions, insertions, and duplications may independently occur in different strains, providing parallel adaptation or phenotypic diversity. Specifically, such rearrangements might be responsible for virulence, antibiotic resistance, and antigenic variation. However, identification of such events requires laborious manual inspection and verification of phyletic pattern consistency. Results Here we define the term “parallel rearrangements” as events that occur independently in phylogenetically distant bacterial strains and present a formalization of the problem of parallel rearrangements calling. We implement an algorithmic solution for the identification of parallel rearrangements in bacterial populations as a tool PaReBrick. The tool takes a collection of strains represented as a sequence of oriented synteny blocks and a phylogenetic tree as input data. It identifies rearrangements, tests them for consistency with a tree, and sorts the events by their parallelism score. The tool provides diagrams of the neighbors for each block of interest, allowing the detection of horizontally transferred blocks or their extra copies and the inversions in which copied blocks are involved.We demonstrated PaReBrick’s efficiency and accuracy and showed its potential to detect genome rearrangements responsible for pathogenicity and adaptation in bacterial genomes. Availability PaReBrick is written in Python and is available on GitHub https://github.com/ctlab/parallelrearrangements Supplementary information Supplementary data are available at Bioinformatics online.


Author(s):  
K H Groves ◽  
P Bonello ◽  
P M Hai

Essential to effective aeroengine design is the rapid simulation of the dynamic performance of a variety of engine and non-linear squeeze-film damper (SFD) bearing configurations. Using recently introduced non-linear solvers combined with non-parametric identification of high-accuracy bearing models it is possible to run full-engine rotordynamic simulations, in both the time and frequency domains, at a fraction of the previous computational cost. Using a novel reduced form of Chebyshev polynomial fits, efficient and accurate identification of the numerical solution to the two-dimensional Reynolds equation (RE) is achieved. The engine analysed is a twin-spool five-SFD engine model provided by a leading manufacturer. Whole-engine simulations obtained using Chebyshev-identified bearing models of the finite difference (FD) solution to the RE are compared with those obtained from the original FD bearing models. For both time and frequency domain analysis, the Chebyshev-identified bearing models are shown to mimic accurately and consistently the simulations obtained from the FD models in under 10 per cent of the computational time. An illustrative parameter study is performed to demonstrate the unparalleled capabilities of the combination of recently developed and novel techniques utilised in this paper.


2001 ◽  
Vol 41 (3) ◽  
pp. 299 ◽  
Author(s):  
J. E. Thies ◽  
E. M. Holmes ◽  
A. Vachot

The symbiosis between legumes and their specific root-nodule bacteria, rhizobia, has been employed to improve agricultural productivity for most of the 20th century. During this time, great advances have been made in our knowledge of both plant and bacterial genomes, the biochemistry of the symbiosis, plant and bacterial signaling and the measurement of nitrogen fixation. However, knowledge of the ecology of the bacterial symbiont has lagged behind, largely due to a lack of practical techniques that can be used to monitor and assess the performance of these bacteria in the field. Most techniques developed in the last few decades have relied on somehow ‘marking’ individual strains to allow us to follow their fate in the field environment. Such techniques, while providing knowledge of the success or failure of specific strains in a range of environments, have not allowed insight into the nature of the pre-existing rhizobial populations in these sites, nor the interaction between marked strains and the background population. The advent of molecular techniques has revolutionised the study of Rhizobium ecology by allowing us to follow the flux of a variety of ecotypes within a particular site and to examine how introduced rhizobia interact with a genetically diverse background. In addition, molecular techniques have increased our understanding of how individual strains and populations of root-nodule bacteria respond to changes in the environment and how genetic diversity evolves in field sites over time. This review focuses on recently developed molecular techniques that hold promise for continuing to develop our understanding of Rhizobium ecology and how these can be used to address a range of applied problems to yield new insights into rhizobial life in soil and as legume symbionts.


Geophysics ◽  
2016 ◽  
Vol 81 (3) ◽  
pp. Q27-Q40 ◽  
Author(s):  
Katrin Löer ◽  
Andrew Curtis ◽  
Giovanni Angelo Meles

We have evaluated an explicit relationship between the representations of internal multiples by source-receiver interferometry and an inverse-scattering series. This provides a new insight into the interaction of different terms in each of these internal multiple prediction equations and explains why amplitudes of estimated multiples are typically incorrect. A downside of the existing representations is that their computational cost is extremely high, which can be a precluding factor especially in 3D applications. Using our insight from source-receiver interferometry, we have developed an alternative, computationally more efficient way to predict internal multiples. The new formula is based on crosscorrelation and convolution: two operations that are computationally cheap and routinely used in interferometric methods. We have compared the results of the standard and the alternative formulas qualitatively in terms of the constructed wavefields and quantitatively in terms of the computational cost using examples from a synthetic data set.


2020 ◽  
Author(s):  
Yu Wang ◽  
ZAHEER ULLAH KHAN ◽  
Shaukat Ali ◽  
Maqsood Hayat

Abstract BackgroundBacteriophage or phage is a type of virus that replicates itself inside bacteria. It consist of genetic material surrounded by a protein structure. Bacteriophage plays a vital role in the domain of phage therapy and genetic engineering. Phage and hydrolases enzyme proteins have a significant impact on the cure of pathogenic bacterial infections and disease treatment. Accurate identification of bacteriophage proteins is important in the host subcellular localization for further understanding of the interaction between phage, hydrolases, and in designing antibacterial drugs. Looking at the significance of Bacteriophage proteins, besides wet laboratory-based methods several computational models have been developed so far. However, the performance was not considerable due to inefficient feature schemes, redundancy, noise, and lack of an intelligent learning engine. Therefore we have developed an anovative bi-layered model name DeepEnzyPred. A Hybrid feature vector was obtained via a novel Multi-Level Multi-Threshold subset feature selection (MLMT-SFS) algorithm. A two-dimensional convolutional neural network was adopted as a baseline classifier.ResultsA conductive hybrid feature was obtained via a serial combination of CTD and KSAACGP features. The optimum feature was selected via a Novel Multi-Level Multi-Threshold Subset Feature selection algorithm. Over 5-fold jackknife cross-validation, an accuracy of 91.6 %, Sensitivity of 63.39%, Specificity 95.72%, MCC of 0.6049, and ROC value of 0.8772 over Layer-1 were recorded respectively. Similarly, the underline model obtained an Accuracy of 96.05%, Sensitivity of 96.22%, Specificity of 95.91%, MCC of 0.9219, and ROC value of 0.9899 over layer-2 respectivily.ConclusionThis paper presents a robust and effective classification model was developed for bacteriophage and their types. Primitive features were extracted via CTD and KSAACGP. A novel method (MLMT-SFS ) was devised for yielding optimum hybrid feature space out of primitive features. The result drew over hybrid feature space and 2D-CNN shown an excellent classification. Based on the recorded results, we believe that the developed predictor will be a valuable resource for large scale discrimination of unknown Phage and hydrolase enzymes in particular and new antibacterial drug design in pharmaceutical companies in general.


2016 ◽  
Vol 2016 ◽  
pp. 1-4 ◽  
Author(s):  
Na Han ◽  
Weiwen Yu ◽  
Yujun Qiang ◽  
Wen Zhang

Type IV secretion system (T4SS) can mediate the passage of macromolecules across cellular membranes and is essential for virulent and genetic material exchange among bacterial species. The Type IV Secretion Project 2.0 (T4SP 2.0) database is an improved and extended version of the platform released in 2013 aimed at assisting with the detection of Type IV secretion systems (T4SS) in bacterial genomes. This advanced version provides users with web server tools for detecting the existence and variations of T4SS genes online. The new interface for the genome browser provides a user-friendly access to the most complete and accurate resource of T4SS gene information (e.g., gene number, name, type, position, sequence, related articles, and quick links to other webs). Currently, this online database includes T4SS information of 5239 bacterial strains.Conclusions. T4SS is one of the most versatile secretion systems necessary for the virulence and survival of bacteria and the secretion of protein and/or DNA substrates from a donor to a recipient cell. This database on virB/D genes of the T4SS system will help scientists worldwide to improve their knowledge on secretion systems and also identify potential pathogenic mechanisms of various microbial species.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Sara J. Weaver ◽  
Davi R. Ortega ◽  
Matthew H. Sazinsky ◽  
Triana N. Dalia ◽  
Ankur B. Dalia ◽  
...  

Abstract Natural transformation is the process by which bacteria take up genetic material from their environment and integrate it into their genome by homologous recombination. It represents one mode of horizontal gene transfer and contributes to the spread of traits like antibiotic resistance. In Vibrio cholerae, a type IVa pilus (T4aP) is thought to facilitate natural transformation by extending from the cell surface, binding to exogenous DNA, and retracting to thread this DNA through the outer membrane secretin, PilQ. Here, we use a functional tagged allele of VcPilQ purified from native V. cholerae cells to determine the cryoEM structure of the VcPilQ secretin in amphipol to ~2.7 Å. We use bioinformatics to examine the domain architecture and gene neighborhood of T4aP secretins in Proteobacteria in comparison with VcPilQ. This structure highlights differences in the architecture of the T4aP secretin from the type II and type III secretion system secretins. Based on our cryoEM structure, we design a series of mutants to reversibly regulate VcPilQ gate dynamics. These experiments support the idea of VcPilQ as a potential druggable target and provide insight into the channel that DNA likely traverses to promote the spread of antibiotic resistance via horizontal gene transfer by natural transformation.


Sign in / Sign up

Export Citation Format

Share Document