scholarly journals MoMo: Discovery of statistically significant post-translational modification motifs

2018 ◽  
Author(s):  
Alice Cheng ◽  
Charles E. Grant ◽  
William S. Noble ◽  
Timothy L. Bailey

AbstractMotivationPost-translational modifications (PTMs) of proteins are associated with many significant biological functions and can be identified in high throughput using tandem mass spectrometry. Many PTMs are associated with short sequence patterns called “motifs” that help localize the modifying enzyme. Accordingly, many algorithms have been designed to identify these motifs from mass spectrometry data. Accurate statistical confidence estimates for discovered motifs are critically important for proper interpretation and in the design of downstream experimental validation.ResultsWe describe a method for assigning statistical confidence estimates to PTM motifs, and we demonstrate that this method provides accurate p-values on both simulated and real data. Our methods are implemented in MoMo, a software tool for discovering motifs among sets of PTMs that we make available as a web server and as downloadable source code. MoMo reimplements the two most widely used PTM motif discovery algorithms—motif-x and MoDL—while offering many enhancements. Relative to motif-x, MoMo offers improved statistical confidence estimates and more accurate calculation of motif scores. The MoMo web server offers more proteome databases, more input formats, larger inputs and longer running times than the motif-x web server. Finally, our study demonstrates that the confidence estimates produced by motif-x are inaccurate. This inaccuracy stems in part from the common practice of drawing “background” peptides from an unshuffled proteome database. Our results thus suggest that many of the hundreds of papers that use motif-x to find motifs may be reporting results that lack statistical support.Availabilityhttp://[email protected]

2018 ◽  
Vol 35 (16) ◽  
pp. 2774-2782 ◽  
Author(s):  
Alice Cheng ◽  
Charles E Grant ◽  
William S Noble ◽  
Timothy L Bailey

Abstract Motivation Post-translational modifications (PTMs) of proteins are associated with many significant biological functions and can be identified in high throughput using tandem mass spectrometry. Many PTMs are associated with short sequence patterns called ‘motifs’ that help localize the modifying enzyme. Accordingly, many algorithms have been designed to identify these motifs from mass spectrometry data. Accurate statistical confidence estimates for discovered motifs are critically important for proper interpretation and in the design of downstream experimental validation. Results We describe a method for assigning statistical confidence estimates to PTM motifs, and we demonstrate that this method provides accurate P-values on both simulated and real data. Our methods are implemented in MoMo, a software tool for discovering motifs among sets of PTMs that we make available as a web server and as downloadable source code. MoMo re-implements the two most widely used PTM motif discovery algorithms—motif-x and MoDL—while offering many enhancements. Relative to motif-x, MoMo offers improved statistical confidence estimates and more accurate calculation of motif scores. The MoMo web server offers more proteome databases, more input formats, larger inputs and longer running times than the motif-x web server. Finally, our study demonstrates that the confidence estimates produced by motif-x are inaccurate. This inaccuracy stems in part from the common practice of drawing ‘background’ peptides from an unshuffled proteome database. Our results thus suggest that many of the papers that use motif-x to find motifs may be reporting results that lack statistical support. Availability and implementation The MoMo web server and source code are provided at http://meme-suite.org. Supplementary information Supplementary data are available at Bioinformatics online.


2017 ◽  
Author(s):  
Alice Cheng ◽  
Charles E. Grant ◽  
Timothy L. Bailey ◽  
William Stafford Noble

AbstractMotivationPost-translational modifications (PTMs) of proteins are associated with many significant biological functions and can be identified in high throughput using tandem mass spectrometry. Many PTMs are associated with short sequence patterns called “motifs” that help localize the modifying enzyme. Accordingly, many algorithms have been designed to identify these motifs from mass spectrometry data.ResultsMoMo is a software tool for identifying motifs among sets of PTMs. The program re-implements two previously described algorithms, Motif-X and MoDL, packaging them in a web-accessible user interface. In addition to reading sequence files in FASTA format, MoMo is capable of directly parsing output files produced by commonly used mass spectrometry search engines. The resulting motifs are presented to the user in an HTML summary with motif logos and linked text files in MEME motif format.AvailabilitySource code and web server available at http://[email protected] and [email protected] informationSupplementary figures are available at Bioinformatics online.


2018 ◽  
Vol 46 (5) ◽  
pp. 1381-1392 ◽  
Author(s):  
Ivar W. Dilweg ◽  
Remus T. Dame

Post-translational modification (PTM) of histones has been investigated in eukaryotes for years, revealing its widespread occurrence and functional importance. Many PTMs affect chromatin folding and gene activity. Only recently the occurrence of such modifications has been recognized in bacteria. However, it is unclear whether PTM of the bacterial counterparts of eukaryotic histones, nucleoid-associated proteins (NAPs), bears a comparable significance. Here, we scrutinize proteome mass spectrometry data for PTMs of the four most abundantly present NAPs in Escherichia coli (H-NS, HU, IHF and FIS). This approach allowed us to identify a total of 101 unique PTMs in the 11 independent proteomic studies covered in this review. Combined with structural and genetic information on these proteins, we describe potential effects of these modifications (perturbed DNA-binding, structural integrity or interaction with other proteins) on their function.


Author(s):  
Anastasia Sarycheva ◽  
Anton Grigoryev ◽  
Evgeny N. Nikolaev ◽  
Yury Kostyukevich

Mass spectrometry imaging (MSI) with high resolution in mass and space is an analytical method that produces distributions of ions on a sample surface. The algorithms for preprocessing and analysis of the raw data acquired from a mass spectrometer should be evaluated. To do that, the ion composition at every point of the sample should be known. This is possible via the employment of a simulated MSI dataset. In this work, we suggest a pipeline for a robust simulation of MSI datasets that resemble real data with an option to simulate the spectra acquired from any mass spectrometry instrument through the use of the experimental MSI datasets to extract simulation parameters.


Sign in / Sign up

Export Citation Format

Share Document