scholarly journals Implications of measurement error structure on the visualization of multivariate chemical data: hazards and alternatives

2018 ◽  
Vol 96 (7) ◽  
pp. 738-748 ◽  
Author(s):  
Peter D. Wentzell ◽  
Chelsi C. Wicks ◽  
Jez W.B. Braga ◽  
Liz F. Soares ◽  
Tereza C.M. Pastore ◽  
...  

The analysis of multivariate chemical data is commonplace in fields ranging from metabolomics to forensic classification. Many of these studies rely on exploratory visualization methods that represent the multidimensional data in spaces of lower dimensionality, such as hierarchical cluster analysis (HCA) or principal components analysis (PCA). However, such methods rely on assumptions of independent measurement errors with uniform variance and can fail to reveal important information when these assumptions are violated, as they often are for chemical data. This work demonstrates how two alternative methods, maximum likelihood principal components analysis (MLPCA) and projection pursuit analysis (PPA), can reveal chemical information hidden from more traditional techniques. Experimental data to compare different methods consists of near-infrared (NIR) reflectance spectra from 108 samples of wood that are derived from four different species of Brazilian trees. The measurement error characteristics of the spectra are examined and it is shown that, by incorporating measurement error information into the data analysis (through MLPCA) or using alternative projection criteria (i.e., PPA), samples can be separated by species. These techniques are proposed as powerful tools for multivariate data analysis in chemistry.

2016 ◽  
Vol 58 (6) ◽  
pp. 815-834 ◽  
Author(s):  
Gopal Das ◽  
Manojit Chattopadhyay ◽  
Sumeet Gupta

This paper attempts to compare self-organising maps (SOM) and principal components analysis (CPA) by applying them to the marketing construct ‘retail store personality’. Data were collected for the retail store personality construct via a validated scale from previous studies that had used the mall intercept technique. A total of 367 people responded, of whom 353 were found to be valid for data analysis. Data were analysed using CPA and SOM; both methods gave comparable clustering results, although the results for SOM were quite conclusive. In addition, we found that SOM complemented PCA by providing visual clustering results far superior to those of PCA. SOM can be used to further analyse PCA data using visual clustering features; both could be used in tandem. Although SOM have been used in a number of studies in marketing, this is the first paper to compare PCA and SOM on terms of application to the marketing construct ‘retail store personality’.


2018 ◽  
Vol 4 (2) ◽  
pp. 100041 ◽  
Author(s):  
Hristo Todorov ◽  
David Fournier ◽  
Susanne Gerber

Advances in computational power have enabled research to generate significant amounts of data related to complex biological problems. Consequently, applying appropriate data analysis techniques has become paramount to tackle this complexity. However, theoretical understanding of statistical methods is necessary to ensure that the correct method is used and that sound inferences are made based on the analysis. In this article, we elaborate on the theory behind principal components analysis (PCA), which has become a favoured multivariate statistical tool in the field of omics-data analysis. We discuss the necessary prerequisites and steps to produce statistically valid results and provide guidelines for interpreting the output. Using PCA on gene expression data from a mouse experiment, we demonstrate that the main distinctive pattern in the data is associated with the transgenic mouse line and is not related to the mouse gender. A weaker association of the pattern with the genotype was also identified.


Sign in / Sign up

Export Citation Format

Share Document