scholarly journals Managing Uncertainty in Metabolic Network Structure and Improving Predictions Using EnsembleFBA

2016 ◽  
Author(s):  
Matthew B. Biggs ◽  
Jason A. Papin

AbstractGenome-scale metabolic network reconstructions (GENREs) are repositories of knowledge about the metabolic processes that occur in an organism. GENREs have been used to discover and interpret metabolic functions, and to engineer novel network structures. A major barrier preventing more widespread use of GENREs, particularly to study non-model organisms, is the extensive time required to produce a high-quality GENRE. Many automated approaches have been developed which reduce this time requirement, but automatically-reconstructed draft GENREs still require curation before useful predictions can be made. We present a novel ensemble approach to the analysis of GENREs which improves the predictive capabilities of draft GENREs and is compatible with many automated reconstruction approaches. We refer to this new approach as Ensemble Flux Balance Analysis (EnsembleFBA). We validate EnsembleFBA by predicting growth and gene essentiality in the model organism Pseudomonas aeruginosa UCBPP-PA14. We demonstrate how EnsembleFBA can be included in a systems biology workflow by predicting essential genes in six Streptococcus species and mapping the essential genes to small molecule ligands from DrugBank. We found that some metabolic subsystems contribute disproportionately to the set of predicted essential reactions in a way that is unique to each Streptococcus species. These species-specific network structures lead to species-specific outcomes from small molecule interactions. Through these analyses of P. aeruginosa and six Streptococci, we show that ensembles increase the quality of predictions without drastically increasing reconstruction time, thus making GENRE approaches more practical for applications which require predictions for many non-model organisms. All of our functions and accompanying example code are available in an open online repository.Author SummaryMetabolism is the driving force behind all biological activity. Genome-scale metabolic network reconstructions (GENREs) are representations of metabolic systems that can be analyzed mathematically to make predictions about how a biochemical system will behave as well as to design biochemical systems with new properties. GENREs have traditionally been reconstructed manually, which can require extensive time and effort. Recent software solutions automate the process (drastically reducing the required effort) but the resulting GENREs are of lower quality and produce less reliable predictions than the manually-curated versions. We present a novel method (“EnsembleFBA”) which overcomes uncertainties involved in automated reconstruction by pooling many different draft GENREs together into an ensemble. We tested EnsembleFBA by predicting the growth and essential genes of the common pathogen Pseudomonas aeruginosa. We found that when predicting growth or essential genes, ensembles of GENREs achieved much better precision or captured many more essential genes than any of the individual GENREs within the ensemble. By improving the predictions that can be made with automatically-generated GENREs, we open the door to studying systems which would otherwise be infeasible.

2021 ◽  
Vol 118 (30) ◽  
pp. e2102344118
Author(s):  
Hao Wang ◽  
Jonathan L. Robinson ◽  
Pinar Kocabas ◽  
Johan Gustafsson ◽  
Mihail Anton ◽  
...  

Genome-scale metabolic models (GEMs) are used extensively for analysis of mechanisms underlying human diseases and metabolic malfunctions. However, the lack of comprehensive and high-quality GEMs for model organisms restricts translational utilization of omics data accumulating from the use of various disease models. Here we present a unified platform of GEMs that covers five major model animals, including Mouse1 (Mus musculus), Rat1 (Rattus norvegicus), Zebrafish1 (Danio rerio), Fruitfly1 (Drosophila melanogaster), and Worm1 (Caenorhabditis elegans). These GEMs represent the most comprehensive coverage of the metabolic network by considering both orthology-based pathways and species-specific reactions. All GEMs can be interactively queried via the accompanying web portal Metabolic Atlas. Specifically, through integrative analysis of Mouse1 with RNA-sequencing data from brain tissues of transgenic mice we identified a coordinated up-regulation of lysosomal GM2 ganglioside and peptide degradation pathways which appears to be a signature metabolic alteration in Alzheimer’s disease (AD) mouse models with a phenotype of amyloid precursor protein overexpression. This metabolic shift was further validated with proteomics data from transgenic mice and cerebrospinal fluid samples from human patients. The elevated lysosomal enzymes thus hold potential to be used as a biomarker for early diagnosis of AD. Taken together, we foresee that this evolving open-source platform will serve as an important resource to facilitate the development of systems medicines and translational biomedical applications.


2018 ◽  
Author(s):  
Anna S. Blazier ◽  
Jason A. Papin

AbstractThe identification of genes essential for bacterial growth and survival represents a promising strategy for the discovery of antimicrobial targets. Essential genes can be identified on a genome-scale using transposon mutagenesis approaches; however, variability between screens and challenges with interpretation of essentiality data hinder the identification of both condition-independent and condition-dependent essential genes. To illustrate the scope of these challenges, we perform a large-scale comparison of multiple published Pseudomonas aeruginosa gene essentiality datasets, revealing substantial differences between the screens. We then contextualize essentiality using genome-scale metabolic network reconstructions and demonstrate the utility of this approach in providing functional explanations for essentiality and reconciling differences between screens. Genome-scale metabolic network reconstructions also enable a high-throughput, quantitative analysis to assess the impact of media conditions on the identification of condition-independent essential genes. Our computational model-driven analysis provides mechanistic insight into essentiality and contributes novel insights for design of future gene essentiality screens and the identification of core metabolic processes.Author SummaryWith the rise of antibiotic resistance, there is a growing need to discover new therapeutic targets to treat bacterial infections. One attractive strategy is to target genes that are essential for growth and survival. Essential genes can be identified with transposon mutagenesis approaches; however, variability between screens and challenges with interpretation of essentiality data hinder the identification and analysis of essential genes. We performed a large-scale comparison of multiple gene essentiality screens of the microbial pathogen Pseudomonas aeruginosa. We implemented a computational model-driven approach to provide functional explanations for essentiality and reconcile differences between screens. The integration of computational modeling with high-throughput experimental screens may enable the identification of drug targets with high-confidence and provide greater understanding for the development of novel therapeutic strategies.


2008 ◽  
Vol 190 (8) ◽  
pp. 2790-2803 ◽  
Author(s):  
Matthew A. Oberhardt ◽  
Jacek Puchałka ◽  
Kimberly E. Fryer ◽  
Vítor A. P. Martins dos Santos ◽  
Jason A. Papin

ABSTRACT Pseudomonas aeruginosa is a major life-threatening opportunistic pathogen that commonly infects immunocompromised patients. This bacterium owes its success as a pathogen largely to its metabolic versatility and flexibility. A thorough understanding of P. aeruginosa's metabolism is thus pivotal for the design of effective intervention strategies. Here we aim to provide, through systems analysis, a basis for the characterization of the genome-scale properties of this pathogen's versatile metabolic network. To this end, we reconstructed a genome-scale metabolic network of Pseudomonas aeruginosa PAO1. This reconstruction accounts for 1,056 genes (19% of the genome), 1,030 proteins, and 883 reactions. Flux balance analysis was used to identify key features of P. aeruginosa metabolism, such as growth yield, under defined conditions and with defined knowledge gaps within the network. BIOLOG substrate oxidation data were used in model expansion, and a genome-scale transposon knockout set was compared against in silico knockout predictions to validate the model. Ultimately, this genome-scale model provides a basic modeling framework with which to explore the metabolism of P. aeruginosa in the context of its environmental and genetic constraints, thereby contributing to a more thorough understanding of the genotype-phenotype relationships in this resourceful and dangerous pathogen.


2021 ◽  
Vol 7 (1) ◽  
Author(s):  
Dawson D. Payne ◽  
Alina Renz ◽  
Laura J. Dunphy ◽  
Taylor Lewis ◽  
Andreas Dräger ◽  
...  

AbstractMucins are present in mucosal membranes throughout the body and play a key role in the microbe clearance and infection prevention. Understanding the metabolic responses of pathogens to mucins will further enable the development of protective approaches against infections. We update the genome-scale metabolic network reconstruction (GENRE) of one such pathogen, Pseudomonas aeruginosa PA14, through metabolic coverage expansion, format update, extensive annotation addition, and literature-based curation to produce iPau21. We then validate iPau21 through MEMOTE, growth rate, carbon source utilization, and gene essentiality testing to demonstrate its improved quality and predictive capabilities. We then integrate the GENRE with transcriptomic data in order to generate context-specific models of P. aeruginosa metabolism. The contextualized models recapitulated known phenotypes of unaltered growth and a differential utilization of fumarate metabolism, while also revealing an increased utilization of propionate metabolism upon MUC5B exposure. This work serves to validate iPau21 and demonstrate its utility for providing biological insights.


2021 ◽  
Author(s):  
Charles Hawkins ◽  
Daniel Ginzburg ◽  
Kangmei Zhao ◽  
William Dwyer ◽  
Bo Xue ◽  
...  

AbstractPlant metabolism is a pillar of our ecosystem, food security, and economy. To understand and engineer plant metabolism, we first need a comprehensive and accurate annotation of all metabolic information across plant species. As a step towards this goal, we previously created the Plant Metabolic Network (PMN), an online resource of curated and computationally predicted information about the enzymes, compounds, reactions, and pathways that make up plant metabolism. Here we report PMN 15, which contains genome-scale metabolic pathway databases of 126 algal and plant genomes, ranging from model organisms to crops to medicinal plants, and new tools for analyzing and viewing metabolism information across species and integrating omics data in a metabolic context. We systematically evaluated the quality of the databases, which revealed that our semi-automated validation pipeline dramatically improves the quality. We then compared the metabolic content across the 126 organisms using multiple correspondence analysis and found that Brassicaceae, Poaceae, and Chlorophyta appeared as metabolically distinct groups. To demonstrate the utility of this resource, we used recently published sorghum transcriptomics data to discover previously unreported trends of metabolism underlying drought tolerance. We also used single-cell transcriptomics data from the Arabidopsis root to infer cell-type specific metabolic pathways. This work shows the continued growth and refinement of the PMN resource and demonstrates its wide-ranging utility in integrating metabolism with other areas of plant biology.One-sentence SummaryThe Plant Metabolic Network is a collection of databases containing experimentally-supported and predicted information about plant metabolism spanning many species.


2019 ◽  
Author(s):  
Maureen A. Carey ◽  
Gregory L. Medlock ◽  
Michał Stolarczyk ◽  
William A. Petri ◽  
Jennifer L. Guler ◽  
...  

AbstractProtozoan parasites cause diverse diseases with large global impacts. Research on the pathogenesis and biology of these organisms is limited by economic and experimental constraints. Accordingly, studies of one parasite are frequently extrapolated to infer knowledge about another parasite, across and within genera. Model in vitro or in vivo systems are frequently used to enhance experimental manipulability, but these systems generally use species related to, yet distinct from, the clinically relevant causal pathogen. Characterization of functional differences among parasite species is confined to post hoc or single target studies, limiting the utility of this extrapolation approach. To address this challenge and to accelerate parasitology research broadly, we present a functional comparative analysis of 192 genomes, representing every high-quality, publicly-available protozoan parasite genome including Plasmodium, Toxoplasma, Cryptosporidium, Entamoeba, Trypanosoma, Leishmania, Giardia, and other species. We generated an automated metabolic network reconstruction pipeline optimized for eukaryotic organisms. These metabolic network reconstructions serve as biochemical knowledgebases for each parasite, enabling qualitative and quantitative comparisons of metabolic behavior across parasites. We identified putative differences in gene essentiality and pathway utilization to facilitate the comparison of experimental findings. This knowledgebase represents the largest collection of genome-scale metabolic models for both pathogens and eukaryotes; with this resource, we can predict species-specific functions, contextualize experimental results, and optimize selection of experimental systems for fastidious species.


2021 ◽  
Author(s):  
Dawson D. Payne ◽  
Alina Renz ◽  
Laura J. Dunphy ◽  
Taylor Lewis ◽  
Andreas Dräger ◽  
...  

AbstractMucins are present in mucosal membranes throughout the body and play a key role in the microbe clearance and infection prevention. Understanding the metabolic responses of pathogens to mucins will further enable the development of protective approaches against infections. We update the genome-scale metabolic network reconstruction (GENRE) of one such pathogen, Pseudomonas aeruginosa PA14, through metabolic coverage expansion, format update, extensive annotation addition, and literature-based curation to produce iPau21. We then validate iPau21 through MEMOTE, growth rate, carbon source utilization, and gene essentiality testing to demonstrate its improved quality and predictive capabilities. We then integrate the GENRE with transcriptomic data in order to generate context-specific models of P. aeruginosa metabolism. The contextualized models recapitulated known phenotypes of unaltered growth and a differential utilization of fumarate metabolism, while also providing a novel insight about an increased utilization of propionate metabolism upon MUC5B exposure. This work serves to validate iPau21 and demonstrate its utility for providing novel biological insights.


Animals ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 2226
Author(s):  
Sazia Kunvar ◽  
Sylwia Czarnomska ◽  
Cino Pertoldi ◽  
Małgorzata Tokarska

The European bison is a non-model organism; thus, most of its genetic and genomic analyses have been performed using cattle-specific resources, such as BovineSNP50 BeadChip or Illumina Bovine 800 K HD Bead Chip. The problem with non-specific tools is the potential loss of evolutionary diversified information (ascertainment bias) and species-specific markers. Here, we have used a genotyping-by-sequencing (GBS) approach for genotyping 256 samples from the European bison population in Bialowieza Forest (Poland) and performed an analysis using two integrated pipelines of the STACKS software: one is de novo (without reference genome) and the other is a reference pipeline (with reference genome). Moreover, we used a reference pipeline with two different genomes, i.e., Bos taurus and European bison. Genotyping by sequencing (GBS) is a useful tool for SNP genotyping in non-model organisms due to its cost effectiveness. Our results support GBS with a reference pipeline without PCR duplicates as a powerful approach for studying the population structure and genotyping data of non-model organisms. We found more polymorphic markers in the reference pipeline in comparison to the de novo pipeline. The decreased number of SNPs from the de novo pipeline could be due to the extremely low level of heterozygosity in European bison. It has been confirmed that all the de novo/Bos taurus and Bos taurus reference pipeline obtained SNPs were unique and not included in 800 K BovineHD BeadChip.


2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Anastasiya Börsch ◽  
Daniel J. Ham ◽  
Nitish Mittal ◽  
Lionel A. Tintignac ◽  
Eugenia Migliavacca ◽  
...  

AbstractSarcopenia, the age-related loss of skeletal muscle mass and function, affects 5–13% of individuals aged over 60 years. While rodents are widely-used model organisms, which aspects of sarcopenia are recapitulated in different animal models is unknown. Here we generated a time series of phenotypic measurements and RNA sequencing data in mouse gastrocnemius muscle and analyzed them alongside analogous data from rats and humans. We found that rodents recapitulate mitochondrial changes observed in human sarcopenia, while inflammatory responses are conserved at pathway but not gene level. Perturbations in the extracellular matrix are shared by rats, while mice recapitulate changes in RNA processing and autophagy. We inferred transcription regulators of early and late transcriptome changes, which could be targeted therapeutically. Our study demonstrates that phenotypic measurements, such as muscle mass, are better indicators of muscle health than chronological age and should be considered when analyzing aging-related molecular data.


Sign in / Sign up

Export Citation Format

Share Document