scholarly journals Compositional Canonical Correlation Analysis

2017 ◽  
Author(s):  
Jan Graffelman ◽  
Vera Pawlowsky-Glahn ◽  
Juan José Egozcue ◽  
Antonella Buccianti

AbstractThe study of the relationships between two compositions by means of canonical correlation analysis is addressed A coimnositional version of canonical correlation analysis is developed. and called CODA-CCO. We consider two approaches, using the centred log-ratio transformation and the calculation of all possible pairwise log-ratios within sets. The relationships between both approaches are pointed out, and their merits are discussed. The related covariance matrices are structurally singular, and this is efficiently dealt with by using generalized inverses. We develop compositional canonical biplots and detail their properties. The canonical biplots are shown to be powerful tools for discovering the most salient relationships between two compositions. Some guidelines for compositional canonical biplots construction are discussed. A geological data set with X-ray fluorescence spectrometry measurements on major oxides and trace elements is used to illustrate the proposed method. The relationships between an analysis based on centred log-ratios and on isometric log-ratios are also shown.

2021 ◽  
Vol 12 ◽  
Author(s):  
Dabin Jeong ◽  
Sangsoo Lim ◽  
Sangseon Lee ◽  
Minsik Oh ◽  
Changyun Cho ◽  
...  

Gene expression profile or transcriptome can represent cellular states, thus understanding gene regulation mechanisms can help understand how cells respond to external stress. Interaction between transcription factor (TF) and target gene (TG) is one of the representative regulatory mechanisms in cells. In this paper, we present a novel computational method to construct condition-specific transcriptional networks from transcriptome data. Regulatory interaction between TFs and TGs is very complex, specifically multiple-to-multiple relations. Experimental data from TF Chromatin Immunoprecipitation sequencing is useful but produces one-to-multiple relations between TF and TGs. On the other hand, co-expression networks of genes can be useful for constructing condition transcriptional networks, but there are many false positive relations in co-expression networks. In this paper, we propose a novel method to construct a condition-specific and combinatorial transcriptional network, applying kernel canonical correlation analysis (kernel CCA) to identify multiple-to-multiple TF–TG relations in certain biological condition. Kernel CCA is a well-established statistical method for computing the correlation of a group of features vs. another group of features. We, therefore, employed kernel CCA to embed TFs and TGs into a new space where the correlation of TFs and TGs are reflected. To demonstrate the usefulness of our network construction method, we used the blood transcriptome data for the investigation on the response to high fat diet in a human and an arabidopsis data set for the investigation on the response to cold/heat stress. Our method detected not only important regulatory interactions reported in previous studies but also novel TF–TG relations where a module of TF is regulating a module of TGs upon specific stress.


2020 ◽  
Vol 35 (6) ◽  
pp. 951-951
Author(s):  
Gracian E ◽  
Mathew A ◽  
Jimenez T ◽  
Oleson S ◽  
Kaufman D ◽  
...  

Abstract Objective We used canonical correlation analysis (CCA) to examine the relationship between performance on cognitive neuroscience measures of sustained attention, deterministic reversal learning (DRLT), and visual task-shifting (VTS). We evaluated whether DRLT and VTS predicted performance on the Continuous Performance Test-II (CPT-II). Method Participants were 1011 adults from the Consortium for Neuropsychiatric Phenomics. The first CCA was conducted between four VST variables (set 1) and three CPT-II variables (set 2). The second CCA was conducted using eight Reversal Learning variables (set 1) and three CPT-II variables (set 2). Results Our first CCA suggests that accuracy of performance in VTS predicts CPT-II measures, Rc = 0.33, Wilks’s λ = 0.86, F(12, 2646) = 1.92, p < .001. The analysis revealed a positive relationship with Hits (=0.87) and a negative relationship with FA (= − 0.76), consistent with sustained attention. The second CCA revealed that acquisition trials and RT on reversal trials significantly predicted less FA and more hits on the CPT-II, Rc = 0.23, Wilks’s λ = 0.90, F(24, 1273) = 1.92, p = .005. Conclusion Our multivariate findings confirm that attention is significantly involved in executive and mnemonic processes. To our knowledge, we are the first neuroscientific group to report multivariate evidence from a large data set that confirms sustained attention plays a significant role in reversal learning and task-shifting. Our results show that the CPT-II FA and mean RT variables specifically are important predictors of reversal learning and task-shifting, strengthening the concurrent validity of our experimental measures.


2012 ◽  
Vol 12 (05) ◽  
pp. 1250091 ◽  
Author(s):  
LI ZHANG ◽  
YUDING WANG ◽  
CHUANHONG HE

Eye blink artifact, the main contamination in electroencephalography (EEG), brings serious problems for the analysis of EEG data. In this paper, an online method for eye blink artifact removal is presented. Canonical correlation analysis (CCA) is used to decompose the recorded signals containing several-channel EEG and one-channel vertical electrooculography (EOG). The identification of the artifactual component is fully automatically implemented based on evaluating the similarity between the reference EOG and decomposed CCA components. This method was compared with an independent component analysis based technique on a synthetic data set and achieved comparable performance for removing eye blink artifact. Moreover, the CCA based method is less time-consuming. The proposed method was finally implemented with Labview for removing eye blink artifact in online test. The online experiment results show that the proposed method could fulfill the identification and suppression of eye blink artifact from contaminated EEG in real-time.


2013 ◽  
Vol 2013 ◽  
pp. 1-11 ◽  
Author(s):  
Xun Chen ◽  
Aiping Liu ◽  
Z. Jane Wang ◽  
Hu Peng

Corticomuscular activity modeling based on multiple data sets such as electroencephalography (EEG) and electromyography (EMG) signals provides a useful tool for understanding human motor control systems. In this paper, we propose modeling corticomuscular activity by combining partial least squares (PLS) and canonical correlation analysis (CCA). The proposed method takes advantage of both PLS and CCA to ensure that the extracted components are maximally correlated across two data sets and meanwhile can well explain the information within each data set. This complementary combination generalizes the statistical assumptions beyond both PLS and CCA methods. Simulations were performed to illustrate the performance of the proposed method. We also applied the proposed method to concurrent EEG and EMG data collected in a Parkinson’s disease (PD) study. The results reveal several highly correlated temporal patterns between EEG and EMG signals and indicate meaningful corresponding spatial activation patterns. In PD subjects, enhanced connections between occipital region and other regions are noted, which is consistent with previous medical knowledge. The proposed framework is a promising technique for performing multisubject and bimodal data analysis.


1995 ◽  
Vol 76 (3) ◽  
pp. 959-962 ◽  
Author(s):  
Janette Jelinek ◽  
Martin E. Morf

Correlations were computed among the five personality scales of the NEO Personality Inventory, two measures derived from the Hassles Scale, and eight ways of dealing with stress measured by the Ways of Coping Questionnaire. Subjects were 66 undergraduate psychology students. Canonical correlation analysis suggests that multivariate procedures treating the data set as a whole can detect underlying patterns obscured by large sampling errors at lower levels of analysis.


2018 ◽  
Vol 15 (1) ◽  
pp. 172988141775282 ◽  
Author(s):  
Shiying Sun ◽  
Ning An ◽  
Xiaoguang Zhao ◽  
Min Tan

Object recognition is one of the essential issues in computer vision and robotics. Recently, deep learning methods have achieved excellent performance in red-green-blue (RGB) object recognition. However, the introduction of depth information presents a new challenge: How can we exploit this RGB-D data to characterize an object more adequately? In this article, we propose a principal component analysis–canonical correlation analysis network for RGB-D object recognition. In this new method, two stages of cascaded filter layers are constructed and followed by binary hashing and block histograms. In the first layer, the network separately learns principal component analysis filters for RGB and depth. Then, in the second layer, canonical correlation analysis filters are learned jointly using the two modalities. In this way, the different characteristics of the RGB and depth modalities are considered by our network as well as the characteristics of the correlation between the two modalities. Experimental results on the most widely used RGB-D object data set show that the proposed method achieves an accuracy which is comparable to state-of-the-art methods. Moreover, our method has a simpler structure and is efficient even without graphics processing unit acceleration.


2020 ◽  
Author(s):  
Lluís Revilla ◽  
Aida Mayorgas ◽  
Ana Maria Corraliza ◽  
Maria C. Masamunt ◽  
Amira Metwaly ◽  
...  

AbstractBackgroundPersonalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a model of the expected interaction between those variables. Describing these interactions is challenging when the relationship is unknown or when there is no pre-established hypothesis.AimTo develop a method to find the relationships between microbiome and transcriptome data and the relevant clinical variables in a complex disease, such as Crohn’s disease.ResultsWe present here a method to identify interactions based on canonical correlation analysis. Our main contribution is to show that the model is the most important factor to identify relationships between blocks. Analysis were conducted on three independent datasets: a glioma, Crohn’s disease and a pouchitis data set. We describe how to select the optimum hyperparameters on the glioma dataset. Using such hyperparameters on the Crohn’s disease data set, our analysis revealed the best model for identifying relationships between transcriptome, gut microbiome and clinically relevant variables. With the pouchitis data set our analysis revealed that adding the clinically relevant variables improves the average variance explained by the model.ConclusionsThe methodology described herein provides a framework for identifying interactions between sets of (omic) data and clinically relevant variables. Following this method, we found genes and microorganisms that were related to each other independently of the model, while others were specific to the model used. Thus, model selection proved crucial to finding the existing relationships in multi-omics datasets.


2021 ◽  
Vol 19 (1) ◽  
pp. 624-642
Author(s):  
Hongming Liu ◽  
◽  
Yunyuan Gao ◽  
Jianhai Zhang ◽  
Juanjuan Zhang ◽  
...  

<abstract><p>Existing epileptic seizure automatic detection systems are often troubled by high-dimensional electroencephalogram (EEG) features. High-dimensional features will not only bring redundant information and noise, but also reduce the response speed of the system. In order to solve this problem, supervised locality preserving canonical correlation analysis (SLPCCA), which can effectively use both sample category information and nonlinear relationships between features, is introduced. And an epileptic signal classification method based on SLPCCA is proposed. Firstly, the power spectral density and the fluctuation index of the frequency slice wavelet transform are extracted as features from the EEG fragments. Next, SLPCCA obtains the optimal projection direction by maximizing the weight correlation between the paired samples in the class and their neighbors. And the projection combination of original features in the optimal direction is the fusion feature. The fusion features are then input into LS-SVM for training and testing. This method is verified on the Bonn dataset and the CHB-MIT dataset and gets good results. On various classification tasks of Bonn data set, the proposed method achieves an average classification accuracy of 99.16%. On the binary classification task of the inter-seizure and seizure epileptic EEG of the CHB-MIT dataset, the proposed method achieves an average accuracy of 97.18%. The experimental results show that the algorithm achieves excellent results compared with several state-of-the-art methods. In addition, the parameter sensitivity of SLPCCA and the relationship between the dimension of the fusion features and the classification results are discussed. Therefore, the stability and effectiveness of the method are further verified.</p></abstract>


2016 ◽  
Vol 13 (2) ◽  
Author(s):  
Glòria Mateu-Figueras ◽  
Josep Daunis-i-Estadella ◽  
Germà Coenders ◽  
Berta Ferrer-Rosell ◽  
Ricard Serlavós ◽  
...  

The aim of this article is to describe a method for relating two compositions which combines compositional data analysis and canonical correlation analysis (CCA), and to examine its main statistical properties. We use additive log-ratio (alr) transformation on both compositions and apply standard CCA to the transformed data. We show that canonical variates are themselves log-ratios and log-contrasts. The first pair of canonical variates can be interpreted as the log-contrast of a composition that has the maximum correlation with a log-contrast of the other composition. The second pair can be interpreted as the log-contrast of a composition that has the maximum correlation with a log-contrast of the other composition, under the restriction that they are uncorrelated with the first pair, and so on. Using properties from changes of basis, we prove that both canonical correlations and canonical variates are invariant to the choice of divisors in alr transformation. We show how to implement the analysis and interpret the results by means of an illustration from the social sciences field using data from Kolb's Learning Style Inventory and Boyatzis' Philosophical Orientation Questionnaire, which distribute a fixed total score among several learning modes and philosophical orientations.


Sign in / Sign up

Export Citation Format

Share Document