proteomics data analysis Latest Research Papers

AbstractMotivationImputation is a prominent strategy when dealing with missing values (MVs) in proteomics data analysis pipelines. However, the performance of different imputation methods is difficult to assess and varies strongly depending on data characteristics. To overcome this issue, we present the concept of a data-driven selection of a suitable imputation algorithm (DIMA).ResultsThe performance and broad applicability of DIMA is demonstrated on 121 quantitative proteomics data sets from the PRIDE database and on simulated data consisting of 5 – 50% MVs with different proportions of missing not at random and missing completely at random values. DIMA reliably suggests a high-performing imputation algorithm which is always among the three best algorithms and results in a root mean square error difference (ΔRMSE) ≤ 10% in 84% of the cases.Availability and ImplementationSource code is freely available for download at github.com/clemenskreutz/OmicsData.

Download Full-text

PeptideMind – applying machine learning algorithms to assess replicate quality in shotgun proteomic data

10.1101/2020.08.20.260455 ◽

2020 ◽

Author(s):

D.C.L. Handler ◽

P.A. Haynes

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Shotgun Proteomics ◽

Machine Learning Algorithms ◽

Proteomics Data ◽

Proteomic Data ◽

Proteomics Experiment ◽

Significance Measures ◽

Proteomics Data Analysis

AbstractAssessment of replicate quality is an important process for any shotgun proteomics experiment. One fundamental question in proteomics data analysis is whether any specific replicates in a set of analyses are biasing the downstream comparative quantitation. In this paper, we present an experimental method to address such a concern. PeptideMind uses a series of clustering Machine Learning algorithms to assess outliers when comparing proteomics data from two states with six replicates each. The program is a JVM native application written in the Kotlin language with Python sub-process calls to scikit-learn. By permuting the six data replicates provided into four hundred triplet non redundant pairwise comparisons, PeptideMind determines if any one replicate is biasing the downstream quantitation of the states. In addition, PeptideMind generates useful visual representations of the spread of the significance measures, allowing researchers a rapid, effective way to monitor the quality of those identified proteins found to be differentially expressed between sample states.

Download Full-text

Philosopher: a versatile toolkit for shotgun proteomics data analysis

Nature Methods ◽

10.1038/s41592-020-0912-y ◽

2020 ◽

Vol 17 (9) ◽

pp. 869-870 ◽

Cited By ~ 6

Author(s):

Felipe da Veiga Leprevost ◽

Sarah E. Haynes ◽

Dmitry M. Avtonomov ◽

Hui-Yin Chang ◽

Avinash K. Shanmugam ◽

...

Keyword(s):

Data Analysis ◽

Shotgun Proteomics ◽

Proteomics Data ◽

Proteomics Data Analysis

Download Full-text

Using tRNA-seq, RNA-seq, and proteomics data analysis to investigate the importance of tRNA modification – A case study of tRNA Guanine and Inosine-N1-methyltransferase TRM5 in Arabidopsis thaliana.

10.26226/morressier.5ebd45acffea6f735881b074 ◽

2020 ◽

Author(s):

Pei Qin (Sabrina) Ng

Keyword(s):

Arabidopsis Thaliana ◽

Data Analysis ◽

Trna Modification ◽

Rna Seq ◽

Proteomics Data ◽

Proteomics Data Analysis

Download Full-text

Bioinformatics Methods for Mass Spectrometry-Based Proteomics Data Analysis

International Journal of Molecular Sciences ◽

10.3390/ijms21082873 ◽

2020 ◽

Vol 21 (8) ◽

pp. 2873 ◽

Cited By ~ 3

Author(s):

Chen Chen ◽

Jie Hou ◽

John J. Tanner ◽

Jianlin Cheng

Keyword(s):

Mass Spectrometry ◽

Protein Interactions ◽

Protein Identification ◽

Proteomics Data ◽

Cellular Mechanisms ◽

Tremendous Progress ◽

Analysis Strategies ◽

Bioinformatics Software ◽

The Relationship ◽

Proteomics Data Analysis

Recent advances in mass spectrometry (MS)-based proteomics have enabled tremendous progress in the understanding of cellular mechanisms, disease progression, and the relationship between genotype and phenotype. Though many popular bioinformatics methods in proteomics are derived from other omics studies, novel analysis strategies are required to deal with the unique characteristics of proteomics data. In this review, we discuss the current developments in the bioinformatics methods used in proteomics and how they facilitate the mechanistic understanding of biological processes. We first introduce bioinformatics software and tools designed for mass spectrometry-based protein identification and quantification, and then we review the different statistical and machine learning methods that have been developed to perform comprehensive analysis in proteomics studies. We conclude with a discussion of how quantitative protein data can be used to reconstruct protein interactions and signaling networks.

Download Full-text

Enhancing Top-Down Proteomics Data Analysis by Combining Deconvolution Results through a Machine Learning Strategy

Journal of the American Society for Mass Spectrometry ◽

10.1021/jasms.0c00035 ◽

2020 ◽

Vol 31 (5) ◽

pp. 1104-1113 ◽

Cited By ~ 3

Author(s):

Sean J. McIlwain ◽

Zhijie Wu ◽

Molly Wetzel ◽

Daniel Belongia ◽

Yutong Jin ◽

...

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Learning Strategy ◽

Proteomics Data ◽

Top Down ◽

Proteomics Data Analysis

Download Full-text

CHAPTER 14. Quantitative Proteomics Data Analysis with PANDA, LFAQ and PANDA-view

Processing Metabolomics and Proteomics Data with Open Software - New Developments in Mass Spectrometry ◽

10.1039/9781788019880-00345 ◽

2020 ◽

pp. 345-370

Author(s):

Kaikun Xu ◽

Cheng Chang

Keyword(s):

Data Analysis ◽

Quantitative Proteomics ◽

Proteomics Data ◽

Proteomics Data Analysis

Download Full-text

proteomics data analysis
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX