Comparison of species classification models of mass spectrometry data: Kernel Discriminant Analysis vs Random Forest; A case study of Afrormosia (Pericopsis elata(Harms) Meeuwen)

V. Deklerck; K. Finch; P. Gasson; J. Van den Bulcke; J. Van Acker; H. Beeckman; E. Espinoza

doi:10.1002/rcm.7939

Comparison of species classification models of mass spectrometry data: Kernel Discriminant Analysis vs Random Forest; A case study of Afrormosia (Pericopsis elata(Harms) Meeuwen)

Rapid Communications in Mass Spectrometry ◽

10.1002/rcm.7939 ◽

2017 ◽

Vol 31 (19) ◽

pp. 1582-1588 ◽

Cited By ~ 10

Author(s):

V. Deklerck ◽

K. Finch ◽

P. Gasson ◽

J. Van den Bulcke ◽

J. Van Acker ◽

...

Keyword(s):

Mass Spectrometry ◽

Discriminant Analysis ◽

Random Forest ◽

Mass Spectrometry Data ◽

Classification Models ◽

Species Classification ◽

Kernel Discriminant Analysis

Download Full-text

Classification of Breast Cancer versus Normal Samples from Mass Spectrometry Profiles Using Linear Discriminant Analysis of Important Features Selected by Random Forest

Statistical Applications in Genetics and Molecular Biology ◽

10.2202/1544-6115.1345 ◽

2008 ◽

Vol 7 (2) ◽

Cited By ~ 15

Author(s):

Somnath Datta

Keyword(s):

Breast Cancer ◽

Mass Spectrometry ◽

Discriminant Analysis ◽

Random Forest ◽

Linear Discriminant Analysis ◽

Linear Discriminant

Download Full-text

Coupling Proton Transfer Reaction−Mass Spectrometry with Linear Discriminant Analysis: a Case Study

Journal of Agricultural and Food Chemistry ◽

10.1021/jf030248i ◽

2003 ◽

Vol 51 (25) ◽

pp. 7227-7233 ◽

Cited By ~ 32

Author(s):

Franco Biasioli ◽

Flavia Gasperi ◽

Eugenio Aprea ◽

Daniela Mott ◽

Elena Boscaini ◽

...

Keyword(s):

Mass Spectrometry ◽

Discriminant Analysis ◽

Proton Transfer ◽

Linear Discriminant Analysis ◽

Transfer Reaction ◽

Proton Transfer Reaction ◽

Reaction Mass ◽

Linear Discriminant

Download Full-text

Comparison of Computational Algorithms for the Classification of Liver Cancer using SELDI Mass Spectrometry: A Case Study

Cancer Informatics ◽

10.1177/117693510700300021 ◽

2007 ◽

Vol 3 ◽

pp. 117693510700300 ◽

Cited By ~ 3

Author(s):

Changyu Shen ◽

Timothy E Breen ◽

Lacey E Dobrolecki ◽

C. Max Schmidt ◽

George W. Sledge ◽

...

Keyword(s):

Mass Spectrometry ◽

Hepatocellular Carcinoma ◽

Support Vector Machine ◽

Random Forest ◽

Radial Function ◽

Support Vector ◽

Laser Desorption Ionization ◽

Prediction Analysis

Introduction As an alternative to DNA microarrays, mass spectrometry based analysis of proteomic patterns has shown great potential in cancer diagnosis. The ultimate application of this technique in clinical settings relies on the advancement of the technology itself and the maturity of the computational tools used to analyze the data. A number of computational algorithms constructed on different principles are available for the classification of disease status based on proteomic patterns. Nevertheless, few studies have addressed the difference in the performance of these approaches. In this report, we describe a comparative case study on the classification accuracy of hepatocellular carcinoma based on the serum proteomic pattern generated from a Surface Enhanced Laser Desorption/Ionization (SELDI) mass spectrometer. Methods Nine supervised classification algorithms are implemented in R software and compared for the classification accuracy. Results We found that the support vector machine with radial function is preferable as a tool for classification of hepatocellular carcinoma using features in SELDI mass spectra. Among the rest of the methods, random forest and prediction analysis of microarrays have better performance. A permutation-based technique reveals that the support vector machine with a radial function seems intrinsically superior in learning from the training data since it has a lower prediction error than others when there is essentially no differential signal. On the other hand, the performance of the random forest and prediction analysis of microarrays rely on their capability of capturing the signals with substantial differentiation between groups. Conclusions Our finding is similar to a previous study, where classification methods based on the Matrix Assisted Laser Desorption/Ionization (MALDI) mass spectrometry are compared for the prediction accuracy of ovarian cancer. The support vector machine, random forest and prediction analysis of microarrays provide better prediction accuracy for hepatocellular carcinoma using SELDI proteomic data than six other approaches.

Download Full-text

Differential protein expression and peak selection in mass spectrometry data by binary discriminant analysis

Bioinformatics ◽

10.1093/bioinformatics/btv334 ◽

2015 ◽

Vol 31 (19) ◽

pp. 3156-3162 ◽

Cited By ~ 18

Author(s):

Sebastian Gibb ◽

Korbinian Strimmer

Keyword(s):

Mass Spectrometry ◽

Discriminant Analysis ◽

Protein Expression ◽

Mass Spectrometry Data ◽

Differential Protein Expression ◽

Differential Protein

Download Full-text

MSPolyCalc: A web‐based App for polymer mass spectrometry data interpretation. The case study of a pharmaceutical excipient

Rapid Communications in Mass Spectrometry ◽

10.1002/rcm.8652 ◽

2020 ◽

Vol 34 (S2) ◽

Cited By ~ 2

Author(s):

Jessica S. Desport ◽

Gilles Frache ◽

Luc Patiny

Keyword(s):

Mass Spectrometry ◽

Data Interpretation ◽

Mass Spectrometry Data ◽

Web Based ◽

Pharmaceutical Excipient

Download Full-text

154: Integration of TPSA and High-Throughput Mass Spectrometry Data Improves Prostate Cancer Prediction

The Journal of Urology ◽

10.1016/s0022-5347(18)30419-1 ◽

2007 ◽

Vol 177 (4S) ◽

pp. 52-53

Author(s):

Stefano Ongarello ◽

Eberhard Steiner ◽

Regina Achleitner ◽

Isabel Feuerstein ◽

Birgit Stenzel ◽

...

Keyword(s):

Prostate Cancer ◽

Mass Spectrometry ◽

High Throughput ◽

Mass Spectrometry Data ◽

Cancer Prediction

Download Full-text

Use of Posterior Probabilities to Evaluate Methods of Discriminant Analysis

Methods of Information in Medicine ◽

10.1055/s-0038-1635316 ◽

1981 ◽

Vol 20 (04) ◽

pp. 207-212 ◽

Cited By ~ 6

Author(s):

J. Hermans ◽

B. van Zomeren ◽

J. W. Raatgever ◽

P. J. Sterk ◽

J. D. F. Habbema

Keyword(s):

Experimental Data ◽

Discriminant Analysis ◽

Posterior Probabilities

By means of a case study the choice between several methods of discriminant analysis is presented. Experimental data of a two-groups problem with one or two variables is analysed. The different methods are compared according to posterior probabilities which can be computed for each subject and which are the basis of discriminant analysis. These posterior probabilities are analysed graphically as well as numerically.

Download Full-text

Algorithm of combining chromatography mass spectrometry-untargeted profiling and multivariate analysis for identification of marker-substances in samples of complex composition

Industrial laboratory Diagnostics of materials ◽

10.26896/1028-6861-2020-86-7-12-19 ◽

2020 ◽

Vol 86 (7) ◽

pp. 12-19

Author(s):

I. V. Plyushchenko ◽

D. G. Shakhmatov ◽

I. A. Rodin

Keyword(s):

Mass Spectrometry ◽

Multivariate Analysis ◽

Large Scale ◽

Complex Composition ◽

Unified Protocol ◽

Chromatography Mass Spectrometry ◽

Marker Substances ◽

Selection Testing ◽

Untargeted Profiling

A viral development of statistical data processing, computing capabilities, chromatography-mass spectrometry, and omics technologies (technologies based on the achievements of genomics, transcriptomics, proteomics, metabolomics) in recent decades has not led to formation of a unified protocol for untargeted profiling. Systematic errors reduce the reproducibility and reliability of the obtained results, and at the same time hinder consolidation and analysis of data gained in large-scale multi-day experiments. We propose an algorithm for conducting omics profiling to identify potential markers in the samples of complex composition and present the case study of urine samples obtained from different clinical groups of patients. Profiling was carried out by the method of liquid chromatography mass spectrometry. The markers were selected using methods of multivariate analysis including machine learning and feature selection. Testing of the approach was performed using an independent dataset by clustering and projection on principal components.

Download Full-text

Discriminant analysis and similarity evaluation of gas chromatography-mass spectrometry fingerprints of aroma components in green tea grading

Chinese Journal of Chromatography ◽

10.3724/sp.j.1123.2018.10017 ◽

2019 ◽

Vol 37 (3) ◽

pp. 325 ◽

Cited By ~ 1

Author(s):

Limei LONG ◽

Shasha SONG ◽

Xueli CAO

Keyword(s):

Mass Spectrometry ◽

Gas Chromatography ◽

Discriminant Analysis ◽

Green Tea ◽

Gas Chromatography Mass Spectrometry ◽

Chromatography Mass Spectrometry ◽

Aroma Components

Download Full-text

Nonparametric Pre-Processing Methods and Inference Tools for Analyzing Time-of-Flight Mass Spectrometry Data.

Current Analytical Chemistry ◽

10.2174/157341107780361718 ◽

2007 ◽

Vol 3 (2) ◽

pp. 127-147 ◽

Cited By ~ 8

Author(s):

Anestis Antoniadis ◽

Jeremie Bigot ◽

Sophie Lambert-Lacroix ◽

Frederique Letue

Keyword(s):

Mass Spectrometry ◽

Time Of Flight ◽

Mass Spectrometry Data ◽

Processing Methods ◽

Flight Mass Spectrometry

Download Full-text