Radiomics Analysis Using Stability Selection Supervised Principal Component Analysis for Right-censored Survival Data

Mapping Intimacies ◽

10.1101/408831 ◽

2018 ◽

Author(s):

Kang K. Yan ◽

Xiaofei Wang ◽

Wendy Lam ◽

Varut Vardhanabhuti ◽

Anne W.M. Lee ◽

...

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Prognostic Markers ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Survival Outcomes ◽

Imaging Features ◽

Biomedical Images ◽

Stability Selection

AbstractRadiomics is a newly emerging field that involves the extraction of a large number of quantitative features from biomedical images through the use of data-characterization algorithms. Radiomics provides a noninvasive approach for personalized therapy decision by identifying distinctive imaging features for predicting prognosis and therapeutic response. So far, many of the published radiomics studies utilize existing out of the box algorithms to identify the prognostic markers from biomedical images that are not specific to radiomics data. T o better utilize biomedical image, we propose a novel machine learning approach, stability selection supervised principal component analysis (SSSuperPCA) that identify a set of stable features from radiomics big data coupled with dimension reduction for right censored survival outcomes. In this paper, we describe stability selection supervised principal component analysis for radiomics data with right-censored survival outcomes. The proposed approach allows us to identify a set of stable features that are highly associated with the survival outcomes, control the per-family error rate, and predict the survival in a simple yet meaningful manner. We evaluate the performance of SSSuperPCA using simulations and real data sets for non-small cell lung cancer and head and neck cancer, and compare it with other machine learning algorithms. The results demonstrate that our method has a competitive edge over other existing methods in identifying the prognostic markers from biomedical big imaging data for the prediction of right-censored survival outcomes. An R package SSSuperPCA is available at the website: http://web.hku.hk/∼herbpang/SSSuperPCA.html

Download Full-text

Comparative study of Principal Component Analysis based Intrusion Detection approach using machine learning algorithms

2015 3rd International Conference on Signal Processing, Communication and Networking (ICSCN) ◽

10.1109/icscn.2015.7219853 ◽

2015 ◽

Cited By ~ 5

Author(s):

Krupa Joel Chabathula ◽

C. D. Jaidhar ◽

M. A. Ajay Kumara

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Intrusion Detection ◽

Comparative Study ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Detection Approach

Download Full-text

Gap‐filling approaches for eddy covariance methane fluxes: A comparison of three machine learning algorithms and a traditional method with principal component analysis

Global Change Biology ◽

10.1111/gcb.14845 ◽

2019 ◽

Vol 26 (3) ◽

pp. 1499-1518 ◽

Cited By ~ 12

Author(s):

Yeonuk Kim ◽

Mark S. Johnson ◽

Sara H. Knox ◽

T. Andrew Black ◽

Higo J. Dalmagro ◽

...

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Eddy Covariance ◽

Traditional Method ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Gap Filling ◽

Methane Fluxes

Download Full-text

Data Mining and Principal Component Analysis on Coimbra Breast Cancer Dataset

Proceedings of Intelligent Computing and Technologies Conference ◽

10.21467/proceedings.115.5 ◽

2021 ◽

Author(s):

Anupam Sen

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Breast Cancer Dataset ◽

Analysis Tool ◽

Machine Learning Classification

Machine Learning (ML) techniques play an important role in the medical field. Early diagnosis is required to improve the treatment of carcinoma. During this analysis Breast Cancer Coimbra dataset (BCCD) with ten predictors are analyzed to classify carcinoma. In this paper method for feature selection and Machine learning algorithms are applied to the dataset from the UCI repository. WEKA (“Waikato Environment for Knowledge Analysis”) tool is used for machine learning techniques. In this paper Principal Component Analysis (PCA) is used for feature extraction. Different Machine Learning classification algorithms are applied through WEKA such as Glmnet, Gbm, ada Boosting, Adabag Boosting, C50, Cforest, DcSVM, fnn, Ksvm, Node Harvest compares the accuracy and also compare values such as Kappa statistic, Mean Absolute Error (MAE), Root Mean Square Error (RMSE). Here the 10-fold cross validation method is used for training, testing and validation purposes.

Download Full-text

Churn Prediction in Telecom Industry Using Machine Learning Algorithms with K-Best and Principal Component Analysis

Algorithms for Intelligent Systems - Applications of Artificial Intelligence in Engineering ◽

10.1007/978-981-33-4604-8_40 ◽

2021 ◽

pp. 499-507

Author(s):

K. V. Anjana ◽

Siddhaling Urolagin

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Churn Prediction ◽

Telecom Industry

Download Full-text

Analysis of Supervised Machine Learning Algorithms for Heart Disease Prediction with Reduced Number of Attributes using Principal Component Analysis

International Journal of Computer Applications ◽

10.5120/ijca2016909231 ◽

2016 ◽

Vol 140 (2) ◽

pp. 27-31 ◽

Cited By ~ 1

Author(s):

Ayon Dey ◽

Jyoti Singh ◽

Neeta Singh

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Heart Disease ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Disease Prediction

Download Full-text

Classification of Observations through Combination of the Dimension Reduction and the Cluster Analysis

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i8.13 ◽

2017 ◽

Vol 7 (8) ◽

pp. 30

Author(s):

Hyeuk Kim

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Cluster Analysis ◽

Unsupervised Learning ◽

Principal Component ◽

Component Analysis ◽

Baseball Players ◽

Partitioning Around Medoids ◽

Different Characteristics

Unsupervised learning in machine learning divides data into several groups. The observations in the same group have similar characteristics and the observations in the different groups have the different characteristics. In the paper, we classify data by partitioning around medoids which have some advantages over the k-means clustering. We apply it to baseball players in Korea Baseball League. We also apply the principal component analysis to data and draw the graph using two components for axis. We interpret the meaning of the clustering graphically through the procedure. The combination of the partitioning around medoids and the principal component analysis can be used to any other data and the approach makes us to figure out the characteristics easily.

Download Full-text

Analysis of the Bath Motion in the MM-SQC Dynamics Using Unsupervised Machine Learning Dimensionality Reduction Approaches: Principal Component Analysis

10.26434/chemrxiv.13332530 ◽

2020 ◽

Author(s):

Jiawei Peng ◽

Yu Xie ◽

Deping Hu ◽

Zhenggang Lan

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Collective Motion ◽

Principal Component ◽

Component Analysis ◽

Nonadiabatic Dynamics ◽

Trajectory Data ◽

Unsupervised Machine Learning ◽

Physical Knowledge ◽

Vibronic Couplings

The system-plus-bath model is an important tool to understand nonadiabatic dynamics for large molecular systems. The understanding of the collective motion of a huge number of bath modes is essential to reveal their key roles in the overall dynamics. We apply the principal component analysis (PCA) to investigate the bath motion based on the massive data generated from the MM-SQC (symmetrical quasi-classical dynamics method based on the Meyer-Miller mapping Hamiltonian) nonadiabatic dynamics of the excited-state energy transfer dynamics of Frenkel-exciton model. The PCA method clearly clarifies that two types of bath modes, which either display the strong vibronic couplings or have the frequencies close to electronic transition, are very important to the nonadiabatic dynamics. These observations are fully consistent with the physical insights. This conclusion is obtained purely based on the PCA understanding of the trajectory data, without the large involvement of pre-defined physical knowledge. The results show that the PCA approach, one of the simplest unsupervised machine learning methods, is very powerful to analyze the complicated nonadiabatic dynamics in condensed phase involving many degrees of freedom.

Download Full-text

Comparative Analysis of Machine Learning Techniques with Principal Component Analysis on Kidney and Heart Disease

10.1109/icesc51422.2021.9533011 ◽

2021 ◽

Author(s):

Reena Chandra ◽

Manoj Kapil ◽

Avinash Sharma

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Heart Disease ◽

Comparative Analysis ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

Criteria for choosing the number of dimensions in a principal component analysis: An empirical assessment

10.5753/sbbd.2020.13632 ◽

2020 ◽

Author(s):

Renata Silva ◽

Daniel Oliveira ◽

Davi Pereira Santos ◽

Lucio F.D. Santos ◽

Rodrigo Erthal Wilson ◽

...

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Hypothesis Test ◽

Feature Learning ◽

Principal Component ◽

Component Analysis ◽

Scree Plot ◽

Open Issue ◽

Chained Tasks ◽

High Dimensional Datasets

Principal component analysis (PCA) is an efficient model for the optimization problem of finding d' axes of a subspace Rd' ⊆ Rd so that the mean squared distances from a given set R of points to the axes are minimal. Despite being steadily employed since 1901 in different scenarios, e.g., mechanics, PCA has become an important link in machine learning chained tasks, such as feature learning and AutoML designs. A frequent yet open issue that arises from supervised-based problems is how many PCA axes are required for the performance of machine learning constructs to be tuned. Accordingly, we investigate the behavior of six independent and uncoupled criteria for estimating the number of PCA axes, namely Scree-Plot %, Scree Plot Gap, Kaiser-Guttman, Broken-Stick, p-Score, and 2D. In total, we evaluate the performance of those approaches in 20 high dimensional datasets by using (i) four different classifiers, and (ii) a hypothesis test upon the reported F-Measures. Results indicate Broken-Stick and Scree-Plot % criteria consistently outperformed the competitors regarding supervised-based tasks, whereas estimators Kaiser-Guttman and Scree-Plot Gap delivered poor performances in the same scenarios.

Download Full-text

A machine learning approach to medical data identification through principal component analysis

Big Data III: Learning, Analytics, and Applications ◽

10.1117/12.2586038 ◽

2021 ◽

Author(s):

Lorenzo E. Jaques ◽

Arthur C. Depoian ◽

Dong Xie ◽

Colleen P. Bailey ◽

Parthasarathy Guturu

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Medical Data ◽

Learning Approach ◽

Machine Learning Approach

Download Full-text