scholarly journals Robust Moiety Model Selection Using Mass Spectrometry Measured Isotopologues

2019 ◽  
Author(s):  
Huan Jin ◽  
Hunter N.B. Moseley

AbstractStable isotope resolved metabolomics (SIRM) experiments uses stable isotope tracers to provide superior mass spectroscopy (MS) and nuclear magnetic resonance (NMR) metabolomics datasets for metabolic flux analysis and metabolic modeling. Several software packages exist for metabolic flux analysis when provided a metabolic model and appropriate isotopomer and/or isotopologue datasets, mostly from 13C tracer time series experiments. However, assumptions of model correctness can seriously compromise interpretation of metabolic flux results generated from these packages. Therefore, we have developed a metabolic modeling software package specifically designed for moiety model comparison and selection based on the metabolomics data provided. This moiety modeling framework facilitates analysis of time-series SIRM MS isotopologue profiles using a set of plausible moiety models and data in a JSONized representation. The moiety_modeling Python package is available on GitHub and the Python Package Index and provides facilities for model parameter optimization, analysis of optimization results, and model selection. Furthermore, this package is capable of analyzing multi-tracer datasets. Here, we tested the effectiveness of this moiety modeling framework in model selection with two sets of time-series MS isotopologue datasets for uridine diphosphate N-acetyl-D-glucosamine (UDP-GlcNAc) generated from different MS platforms: direct infusion nanoelectrospray Fourier transform MS and liquid chromatography MS. Results generated from the analyses of these datasets demonstrate the robustness of our model selection methods by the successful selection of the optimal model from over 40 models provided. Also, the effects of specific optimization methods, degree of optimization, selection criteria, and specific objective functions on model selection are illustrated. Furthermore, different types of error can exist in the datasets, and proper selection of the objective function can help reduce the optimization side effects caused by the specific types of uncertainty in these datasets. Overall, these results indicate that over-optimization can lead to failure in model selection, but combining multiple datasets can help prevent this overfitting effect. The implication is that SIRM datasets in public repositories of reasonable quality can be combined with newly acquired datasets to improve model selection. Furthermore, curation efforts of public metabolomics repositories to maintain high data quality could have huge impacts on future metabolic modeling efforts.

Metabolites ◽  
2020 ◽  
Vol 10 (3) ◽  
pp. 118 ◽  
Author(s):  
Huan Jin ◽  
Hunter N.B. Moseley

Stable isotope resolved metabolomics (SIRM) experiments use stable isotope tracers to provide superior metabolomics datasets for metabolic flux analysis and metabolic modeling. Since assumptions of model correctness can seriously compromise interpretation of metabolic flux results, we have developed a metabolic modeling software package specifically designed for moiety model comparison and selection based on the metabolomics data provided. Here, we tested the effectiveness of model selection with two time-series mass spectrometry (MS) isotopologue datasets for uridine diphosphate N-acetyl-d-glucosamine (UDP-GlcNAc) generated from different platforms utilizing direct infusion nanoelectrospray and liquid chromatography. Analysis results demonstrate the robustness of our model selection methods by the successful selection of the optimal model from over 40 models provided. Moreover, the effects of specific optimization methods, degree of optimization, selection criteria, and specific objective functions on model selection are illustrated. Overall, these results indicate that over-optimization can lead to model selection failure, but combining multiple datasets can help control this overfitting effect. The implication is that SIRM datasets in public repositories of reasonable quality can be combined with newly acquired datasets to improve model selection. Furthermore, curation efforts of public metabolomics repositories to maintain high data quality could have a huge impact on future metabolic modeling efforts.


2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Huan Jin ◽  
Hunter N. B. Moseley

Abstract Background Stable isotope tracing can follow individual atoms through metabolic transformations through the detection of the incorporation of stable isotope within metabolites. This resulting data can be interpreted in terms related to metabolic flux. However, detection of a stable isotope in metabolites by mass spectrometry produces a profile of isotopologue peaks that requires deconvolution to ascertain the localization of isotope incorporation. Results To aid the interpretation of the mass spectroscopy isotopologue profile, we have developed a moiety modeling framework for deconvoluting metabolite isotopologue profiles involving single and multiple isotope tracers. This moiety modeling framework provides facilities for moiety model representation, moiety model optimization, and moiety model selection. The moiety_modeling package was developed from the idea of metabolite decomposition into moiety units based on metabolic transformations, i.e. a moiety model. The SAGA-optimize package, solving a boundary-value inverse problem through a combined simulated annealing and genetic algorithm, was developed for model optimization. Additional optimization methods from the Python scipy library are utilized as well. Several forms of the Akaike information criterion and Bayesian information criterion are provided for selecting between moiety models. Moiety models and associated isotopologue data are defined in a JSONized format. By testing the moiety modeling framework on the timecourses of 13C isotopologue data for uridine diphosphate N-acetyl-D-glucosamine (UDP-GlcNAc) in human prostate cancer LnCaP-LN3 cells, we were able to confirm its robust performance in isotopologue deconvolution and moiety model selection. Conclusions SAGA-optimize is a useful Python package for solving boundary-value inverse problems, and the moiety_modeling package is an easy-to-use tool for mass spectroscopy isotopologue profile deconvolution involving single and multiple isotope tracers. Both packages are freely available on GitHub and via the Python Package Index.


Author(s):  
Fumio Matsuda ◽  
Kohsuke Maeda ◽  
Takeo Taniguchi ◽  
Yuya Kondo ◽  
Futa Yatabe ◽  
...  

2012 ◽  
Vol 13 (1) ◽  
pp. 295 ◽  
Author(s):  
C Hart Poskar ◽  
Jan Huege ◽  
Christian Krach ◽  
Mathias Franke ◽  
Yair Shachar-Hill ◽  
...  

2019 ◽  
Author(s):  
Huan Jin ◽  
Hunter N.B. Moseley

AbstractBackgroundStable isotope tracing can follow individual atoms through metabolic transformations through the detection of the incorporation of stable isotope within metabolites. This resulting data can be interpreted in terms related to metabolic flux. However, detection of a stable isotope in metabolites by mass spectrometry produces a profile of isotopologue peaks that requires deconvolution to ascertain the localization of isotope incorporation.ResultsTo aid the interpretation of the mass spectroscopy isotopologue profile, we have developed a moiety modeling framework for deconvoluting metabolite isotopologue profiles involving single and multiple isotope tracers. This moiety modeling framework provides facilities for moiety model representation, moiety model optimization, and moiety model selection. The moiety_modeling package was developed from the idea of metabolite decomposition into moiety units based on metabolic transformations, i.e. a moiety model. A SAGA-optimize package, solving a boundary-value inverse problem through a combined simulated annealing and genetic algorithm, was developed for model optimization. Additional optimization methods from the Python scipy library are utilized as well. Several forms of the Akaike information criterion and Bayesian information criterion are provided for selecting between moiety models. Moiety models and associated isotopologue data are defined in the JSON format.By testing the moiety modeling framework on the timecourses of 13C isotopologue data for UDP-N-acetyl-D-glucosamine (UDP-GlcNAc) in human prostate cancer LnCaP-LN3 cells, we were able to confirm its robust performance in isotopologue deconvolution and moiety model selection.ConclusionsSAGA-optimize is a useful Python package for solving boundary-value inverse problems, and the moiety_modeling package is an easy-to-use tool for mass spectroscopy isotopologue profile deconvolution involving single and multiple isotope tracers. Both packages are freely freely available on GitHub and via the Python Package Index.


2016 ◽  
Vol 4 (1) ◽  
Author(s):  
Daniel Weindl ◽  
Thekla Cordes ◽  
Nadia Battello ◽  
Sean C. Sapcariu ◽  
Xiangyi Dong ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document