A Comparison of Self-organising Maps and Principal Components Analysis

Gopal Das; Manojit Chattopadhyay; Sumeet Gupta

doi:10.2501/ijmr-2016-039

A Comparison of Self-organising Maps and Principal Components Analysis

International Journal of Market Research ◽

10.2501/ijmr-2016-039 ◽

2016 ◽

Vol 58 (6) ◽

pp. 815-834 ◽

Cited By ~ 2

Author(s):

Gopal Das ◽

Manojit Chattopadhyay ◽

Sumeet Gupta

Keyword(s):

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Analysis Data ◽

Retail Store ◽

Visual Clustering ◽

Components Analysis ◽

Personality Construct

This paper attempts to compare self-organising maps (SOM) and principal components analysis (CPA) by applying them to the marketing construct ‘retail store personality’. Data were collected for the retail store personality construct via a validated scale from previous studies that had used the mall intercept technique. A total of 367 people responded, of whom 353 were found to be valid for data analysis. Data were analysed using CPA and SOM; both methods gave comparable clustering results, although the results for SOM were quite conclusive. In addition, we found that SOM complemented PCA by providing visual clustering results far superior to those of PCA. SOM can be used to further analyse PCA data using visual clustering features; both could be used in tandem. Although SOM have been used in a number of studies in marketing, this is the first paper to compare PCA and SOM on terms of application to the marketing construct ‘retail store personality’.

Download Full-text

Using multivariate factorial kriging for multiscale ordination: a case study

Canadian Journal of Forest Research ◽

10.1139/x05-211 ◽

2005 ◽

Vol 35 (12) ◽

pp. 2860-2874 ◽

Cited By ~ 16

Author(s):

Nikos Nanos ◽

Fernando Pardo ◽

Jesus Alonso Nager ◽

José Alberto Pardos ◽

Luis Gil

Keyword(s):

Principal Components Analysis ◽

Principal Components ◽

Multiple Scale ◽

Analysis Data ◽

Vegetation Pattern ◽

Soil Conditions ◽

Linear Model Of Coregionalization ◽

Factorial Kriging ◽

Components Analysis

Vegetation ordination is usually based on classical data reduction techniques such as principal components analysis, correspondence analysis, or multidimensional scaling. The usual methods do not account for multiscale correlations among species. In this paper, we use a geostatistical method, known as multivariate factorial kriging, for studying multiple-scale correlations. The case study was carried out in a mixed broadleaf forest of central Spain. Six tree species were included in the analysis. Data analysis included (i) experimental variogram calculation and modeling with the use of the linear model of coregionalization, (ii) principal components analysis, and (iii) cokriging. The results indicate that correlations among species are different depending on the spatial scale. We conclude that competition for light is the main factor controlling the spatial distribution of species at the plot-level scale of variation. At larger scales of variation, soil conditions and (or) human intervention are the key factors in determining the observed vegetation pattern. Based on the factor scores for the largest scale of variation, we conducted a cluster analysis to identify plots with similar characteristics. The resulting clusters have the remarkable property of being spatially continuous.

Download Full-text

Implications of measurement error structure on the visualization of multivariate chemical data: hazards and alternatives

Canadian Journal of Chemistry ◽

10.1139/cjc-2017-0730 ◽

2018 ◽

Vol 96 (7) ◽

pp. 738-748 ◽

Cited By ~ 2

Author(s):

Peter D. Wentzell ◽

Chelsi C. Wicks ◽

Jez W.B. Braga ◽

Liz F. Soares ◽

Tereza C.M. Pastore ◽

...

Keyword(s):

Data Analysis ◽

Measurement Error ◽

Principal Components Analysis ◽

Principal Components ◽

Measurement Errors ◽

Alternative Methods ◽

Chemical Data ◽

Multidimensional Data ◽

Chemical Information ◽

Components Analysis

The analysis of multivariate chemical data is commonplace in fields ranging from metabolomics to forensic classification. Many of these studies rely on exploratory visualization methods that represent the multidimensional data in spaces of lower dimensionality, such as hierarchical cluster analysis (HCA) or principal components analysis (PCA). However, such methods rely on assumptions of independent measurement errors with uniform variance and can fail to reveal important information when these assumptions are violated, as they often are for chemical data. This work demonstrates how two alternative methods, maximum likelihood principal components analysis (MLPCA) and projection pursuit analysis (PPA), can reveal chemical information hidden from more traditional techniques. Experimental data to compare different methods consists of near-infrared (NIR) reflectance spectra from 108 samples of wood that are derived from four different species of Brazilian trees. The measurement error characteristics of the spectra are examined and it is shown that, by incorporating measurement error information into the data analysis (through MLPCA) or using alternative projection criteria (i.e., PPA), samples can be separated by species. These techniques are proposed as powerful tools for multivariate data analysis in chemistry.

Download Full-text

REDUCTION OF INPUT VARIABLES IN ARTIFICIAL NEURAL NETWORKS AS FROM PRINCIPAL COMPONENTS ANALYSIS DATA IN THE MODELING OF DISSOLVED OXYGEN

Química Nova ◽

10.5935/0100-4042.20160024 ◽

2016 ◽

Author(s):

Saulo Rodrigues e Silva ◽

Fernando Schimidt

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Dissolved Oxygen ◽

Principal Components Analysis ◽

Principal Components ◽

Analysis Data ◽

Components Analysis ◽

Artificial Neural ◽

Input Variables

Download Full-text

Multivariate data analysis A trial of establisment of normal ranges for multiple analyzed items by principal components analysis

Japanese journal of AMHTS ◽

10.7143/jhep1975.6.91 ◽

1979 ◽

Vol 6 (2) ◽

pp. 91-91

Keyword(s):

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Multivariate Data Analysis ◽

Multivariate Data ◽

Normal Ranges ◽

Components Analysis

Download Full-text

Data analysis on sea water quality data in Jakarta Bay using Principal Components Analysis (PCA) method during transitional monsoon 2012

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/339/1/012023 ◽

2019 ◽

Vol 339 ◽

pp. 012023

Author(s):

A Martina ◽

I M Radjawane

Keyword(s):

Water Quality ◽

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Sea Water ◽

Quality Data ◽

Water Quality Data ◽

Pca Method ◽

Components Analysis

Download Full-text

Principal components analysis: theory and application to gene expression data analysis

Genomics and Computational Biology ◽

10.18547/gcb.2018.vol4.iss2.e100041 ◽

2018 ◽

Vol 4 (2) ◽

pp. 100041 ◽

Cited By ~ 8

Author(s):

Hristo Todorov ◽

David Fournier ◽

Susanne Gerber

Keyword(s):

Gene Expression ◽

Data Analysis ◽

Principal Components Analysis ◽

Gene Expression Data ◽

Principal Components ◽

Expression Data ◽

Statistical Tool ◽

Theoretical Understanding ◽

Transgenic Mouse Line ◽

Components Analysis

Advances in computational power have enabled research to generate significant amounts of data related to complex biological problems. Consequently, applying appropriate data analysis techniques has become paramount to tackle this complexity. However, theoretical understanding of statistical methods is necessary to ensure that the correct method is used and that sound inferences are made based on the analysis. In this article, we elaborate on the theory behind principal components analysis (PCA), which has become a favoured multivariate statistical tool in the field of omics-data analysis. We discuss the necessary prerequisites and steps to produce statistically valid results and provide guidelines for interpreting the output. Using PCA on gene expression data from a mouse experiment, we demonstrate that the main distinctive pattern in the data is associated with the transgenic mouse line and is not related to the mouse gender. A weaker association of the pattern with the genotype was also identified.

Download Full-text

APL functions for interactive data analysis: Principal components analysis

Behavior Research Methods ◽

10.3758/bf03202083 ◽

1981 ◽

Vol 13 (5) ◽

pp. 657-666 ◽

Cited By ~ 5

Author(s):

Selby Evans ◽

Jerry D. Neideffer ◽

Fred H. Gage

Keyword(s):

Data Analysis ◽

Principal Components Analysis ◽

Principal Components ◽

Interactive Data Analysis ◽

Interactive Data ◽

Components Analysis

Download Full-text

Neural network based principal components analysis for EEG pre-processing and analysis

Electroencephalography and Clinical Neurophysiology ◽

10.1016/s0013-4694(97)88499-8 ◽

1997 ◽

Vol 103 (1) ◽

pp. 115

Author(s):

S Petránek

Keyword(s):

Neural Network ◽

Principal Components Analysis ◽

Principal Components ◽

Components Analysis

Download Full-text

Principal Components Analysis of the Old and the New: The SAT vs. the CSUC English Placement Test

Measurement and Evaluation in Guidance ◽

10.1080/00256307.1978.12022165 ◽

1978 ◽

Vol 11 (3) ◽

pp. 162-168 ◽

Cited By ~ 2

Author(s):

Roger L. Bailey

Keyword(s):

Principal Components Analysis ◽

Principal Components ◽

Placement Test ◽

Components Analysis

Download Full-text

Diagnoses Generated by Numerical Taxonomic Methods Applied to Standard Blood Variables

Methods of Information in Medicine ◽

10.1055/s-0038-1635282 ◽

1980 ◽

Vol 19 (04) ◽

pp. 205-209

Author(s):

L. A. Abbott ◽

J. B. Mitton

Keyword(s):

Liver Cirrhosis ◽

Renal Disease ◽

Principal Components Analysis ◽

Discriminant Function ◽

Principal Components ◽

Discriminant Function Analysis ◽

Chronic Renal Disease ◽

Function Analysis ◽

Infectious Hepatitis ◽

Components Analysis

Data taken from the blood of 262 patients diagnosed for malabsorption, elective cholecystectomy, acute cholecystitis, infectious hepatitis, liver cirrhosis, or chronic renal disease were analyzed with three numerical taxonomy (NT) methods : cluster analysis, principal components analysis, and discriminant function analysis. Principal components analysis revealed discrete clusters of patients suffering from chronic renal disease, liver cirrhosis, and infectious hepatitis, which could be displayed by NT clustering as well as by plotting, but other disease groups were poorly defined. Sharper resolution of the same disease groups was attained by discriminant function analysis.

Download Full-text