Implications of measurement error structure on the visualization of multivariate chemical data: hazards and alternatives

Peter D. Wentzell; Chelsi C. Wicks; Jez W.B. Braga; Liz F. Soares; Tereza C.M. Pastore; Vera T.R. Coradin; Fabrice Davrieux

doi:10.1139/cjc-2017-0730

Implications of measurement error structure on the visualization of multivariate chemical data: hazards and alternatives

Canadian Journal of Chemistry ◽

10.1139/cjc-2017-0730 ◽

2018 ◽

Vol 96 (7) ◽

pp. 738-748 ◽

Cited By ~ 2

Author(s):

Peter D. Wentzell ◽

Chelsi C. Wicks ◽

Jez W.B. Braga ◽

Liz F. Soares ◽

Tereza C.M. Pastore ◽

...

Keyword(s):

Data Analysis ◽

Measurement Error ◽

Principal Components Analysis ◽

Principal Components ◽

Measurement Errors ◽

Alternative Methods ◽

Chemical Data ◽

Multidimensional Data ◽

Chemical Information ◽

Components Analysis

The analysis of multivariate chemical data is commonplace in fields ranging from metabolomics to forensic classification. Many of these studies rely on exploratory visualization methods that represent the multidimensional data in spaces of lower dimensionality, such as hierarchical cluster analysis (HCA) or principal components analysis (PCA). However, such methods rely on assumptions of independent measurement errors with uniform variance and can fail to reveal important information when these assumptions are violated, as they often are for chemical data. This work demonstrates how two alternative methods, maximum likelihood principal components analysis (MLPCA) and projection pursuit analysis (PPA), can reveal chemical information hidden from more traditional techniques. Experimental data to compare different methods consists of near-infrared (NIR) reflectance spectra from 108 samples of wood that are derived from four different species of Brazilian trees. The measurement error characteristics of the spectra are examined and it is shown that, by incorporating measurement error information into the data analysis (through MLPCA) or using alternative projection criteria (i.e., PPA), samples can be separated by species. These techniques are proposed as powerful tools for multivariate data analysis in chemistry.

Download Full-text

A pedogenic investigation of some soil chronosequences on neoglacial moraine ridges, southern Norway: Examination of soil chemical data using principal components analysis

CATENA ◽

10.1016/0341-8162(87)90010-5 ◽

1987 ◽

Vol 14 (5) ◽

pp. 369-381 ◽

Cited By ~ 21

Author(s):

A. Mellor

Keyword(s):

Principal Components Analysis ◽

Principal Components ◽

Chemical Data ◽

Components Analysis ◽

Southern Norway

Download Full-text

Principal Components Analysis of Environmental Chemical Data: Experience and Application

Environmental Forensics ◽

10.1039/9781849732062-00202 ◽

2010 ◽

pp. 202-209

Keyword(s):

Principal Components Analysis ◽

Principal Components ◽

Chemical Data ◽

Environmental Chemical ◽

Components Analysis

Download Full-text

Principal components analysis for the visualisation of multidimensional chemical data acquired by scanning Raman microspectroscopy

The Analyst ◽

10.1039/b203281c ◽

2002 ◽

Vol 127 (9) ◽

pp. 1261-1266 ◽

Cited By ~ 9

Author(s):

Michael Malecha ◽

Conrad Bessant ◽

Selwayan Saini

Keyword(s):

Principal Components Analysis ◽

Principal Components ◽

Raman Microspectroscopy ◽

Chemical Data ◽

Components Analysis

Download Full-text

A Comparison of Self-organising Maps and Principal Components Analysis

International Journal of Market Research ◽

10.2501/ijmr-2016-039 ◽

2016 ◽

Vol 58 (6) ◽

pp. 815-834 ◽

Cited By ~ 2

Author(s):

Gopal Das ◽

Manojit Chattopadhyay ◽

Sumeet Gupta

Keyword(s):

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Analysis Data ◽

Retail Store ◽

Visual Clustering ◽

Components Analysis ◽

Personality Construct

This paper attempts to compare self-organising maps (SOM) and principal components analysis (CPA) by applying them to the marketing construct ‘retail store personality’. Data were collected for the retail store personality construct via a validated scale from previous studies that had used the mall intercept technique. A total of 367 people responded, of whom 353 were found to be valid for data analysis. Data were analysed using CPA and SOM; both methods gave comparable clustering results, although the results for SOM were quite conclusive. In addition, we found that SOM complemented PCA by providing visual clustering results far superior to those of PCA. SOM can be used to further analyse PCA data using visual clustering features; both could be used in tandem. Although SOM have been used in a number of studies in marketing, this is the first paper to compare PCA and SOM on terms of application to the marketing construct ‘retail store personality’.

Download Full-text

Multivariate data analysis A trial of establisment of normal ranges for multiple analyzed items by principal components analysis

Japanese journal of AMHTS ◽

10.7143/jhep1975.6.91 ◽

1979 ◽

Vol 6 (2) ◽

pp. 91-91

Keyword(s):

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Multivariate Data Analysis ◽

Multivariate Data ◽

Normal Ranges ◽

Components Analysis

Download Full-text

Data analysis on sea water quality data in Jakarta Bay using Principal Components Analysis (PCA) method during transitional monsoon 2012

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/339/1/012023 ◽

2019 ◽

Vol 339 ◽

pp. 012023

Author(s):

A Martina ◽

I M Radjawane

Keyword(s):

Water Quality ◽

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Sea Water ◽

Quality Data ◽

Water Quality Data ◽

Pca Method ◽

Components Analysis

Download Full-text

Principal components analysis: theory and application to gene expression data analysis

Genomics and Computational Biology ◽

10.18547/gcb.2018.vol4.iss2.e100041 ◽

2018 ◽

Vol 4 (2) ◽

pp. 100041 ◽

Cited By ~ 8

Author(s):

Hristo Todorov ◽

David Fournier ◽

Susanne Gerber

Keyword(s):

Gene Expression ◽

Data Analysis ◽

Principal Components Analysis ◽

Gene Expression Data ◽

Principal Components ◽

Expression Data ◽

Statistical Tool ◽

Theoretical Understanding ◽

Transgenic Mouse Line ◽

Components Analysis

Advances in computational power have enabled research to generate significant amounts of data related to complex biological problems. Consequently, applying appropriate data analysis techniques has become paramount to tackle this complexity. However, theoretical understanding of statistical methods is necessary to ensure that the correct method is used and that sound inferences are made based on the analysis. In this article, we elaborate on the theory behind principal components analysis (PCA), which has become a favoured multivariate statistical tool in the field of omics-data analysis. We discuss the necessary prerequisites and steps to produce statistically valid results and provide guidelines for interpreting the output. Using PCA on gene expression data from a mouse experiment, we demonstrate that the main distinctive pattern in the data is associated with the transgenic mouse line and is not related to the mouse gender. A weaker association of the pattern with the genotype was also identified.

Download Full-text

Multivariate analysis of stream water chemical data: The use of principal components analysis for the end-member mixing problem

Water Resources Research ◽

10.1029/91wr02518 ◽

1992 ◽

Vol 28 (1) ◽

pp. 99-107 ◽

Cited By ~ 283

Author(s):

Nils Christophersen ◽

Richard P. Hooper

Keyword(s):

Multivariate Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Stream Water ◽

Chemical Data ◽

Components Analysis ◽

Mixing Problem

Download Full-text

APL functions for interactive data analysis: Principal components analysis

Behavior Research Methods ◽

10.3758/bf03202083 ◽

1981 ◽

Vol 13 (5) ◽

pp. 657-666 ◽

Cited By ~ 5

Author(s):

Selby Evans ◽

Jerry D. Neideffer ◽

Fred H. Gage

Keyword(s):

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Interactive Data Analysis ◽

Interactive Data ◽

Components Analysis

Download Full-text

Neural network based principal components analysis for EEG pre-processing and analysis

Electroencephalography and Clinical Neurophysiology ◽

10.1016/s0013-4694(97)88499-8 ◽

1997 ◽

Vol 103 (1) ◽

pp. 115

Author(s):

S Petránek

Keyword(s):

Neural Network ◽

Principal Components Analysis ◽

Principal Components ◽

Components Analysis

Download Full-text