Prediction of Placental Barrier Permeability: A Model Based on Partial Least Squares Variable Selection Procedure

Yong-Hong Zhang; Zhi-Ning Xia; Li Yan; Shu-Shen Liu

doi:10.3390/molecules20058270

Partial Least Squares Discriminant Analysis Model Based on Variable Selection Applied to Identify the Adulterated Olive Oil

Food Analytical Methods ◽

10.1007/s12161-015-0355-8 ◽

2015 ◽

Vol 9 (6) ◽

pp. 1713-1718 ◽

Cited By ~ 12

Author(s):

Xinhui Li ◽

Sulan Wang ◽

Weimin Shi ◽

Qi Shen

Keyword(s):

Discriminant Analysis ◽

Variable Selection ◽

Olive Oil ◽

Least Squares ◽

Partial Least Squares ◽

Analysis Model ◽

Model Based

Download Full-text

Sigma Tuning of Gaussian Kernels Detection of Ischemia from Magnetocardiograms

Computational Modeling and Simulation of Intellect ◽

10.4018/978-1-60960-551-3.ch009 ◽

2011 ◽

pp. 206-223 ◽

Cited By ~ 3

Author(s):

Long Han ◽

Mark J. Embrechts ◽

Boleslaw K. Szymanski ◽

Karsten Sternickel ◽

Alexander Ross

Keyword(s):

Variable Selection ◽

Least Squares ◽

Selection Procedure ◽

Second Order ◽

Gaussian Kernel ◽

Support Vector ◽

Data Sets ◽

Variable Selection Procedure ◽

Rbf Kernel ◽

Variable Subset Selection

This chapter introduces a novel Levenberg-Marquardt like second-order algorithm for tuning the Parzen window s in a Radial Basis Function (Gaussian) kernel. In this case, each attribute has its own sigma parameter associated with it. The values of the optimized s are then used as a gauge for variable selection. In this study, the Kernel Partial Least Squares (K-PLS) model is applied to several benchmark data sets in order to estimate the effectiveness of the second-order sigma tuning procedure for an RBF kernel. The variable subset selection method based on these sigma values is then compared with different feature selection procedures such as random forests and sensitivity analysis. The sigma-tuned RBF kernel model outperforms K-PLS and SVM models with a single sigma value. K-PLS models also compare favorably with Least Squares Support Vector Machines (LS-SVM), epsilon-insensitive Support Vector Regression and traditional PLS. The sigma tuning and variable selection procedure introduced in this chapter is applied to industrial magnetocardiogram data for the detection of ischemic heart disease from measurement of the magnetic field around the heart.

Download Full-text

Multiple and complex network built by the path coefficients for partial least squares variable selection research

Future Communication Technology ◽

10.2495/icct130901 ◽

2014 ◽

Author(s):

Wangping Xiong ◽

Ying Xiong ◽

Jianqiang Du ◽

Bin Nie

Keyword(s):

Variable Selection ◽

Least Squares ◽

Partial Least Squares ◽

Complex Network ◽

Path Coefficients ◽

Selection Research

Download Full-text

Variable Selection with Partial Least Squares Sensitivity Analysis: An Application to Currency Crises' Real Effects

SSRN Electronic Journal ◽

10.2139/ssrn.909508 ◽

2006 ◽

Author(s):

Fabio A. Arciniegas ◽

Mark Embrechts ◽

Ismael E. Arciniegas Rueda

Keyword(s):

Sensitivity Analysis ◽

Variable Selection ◽

Least Squares ◽

Partial Least Squares ◽

Currency Crises ◽

Real Effects

Download Full-text

Variable Selection for Partial Least-Squares Calibration of Near-Infrared Data from Orthogonally Designed Experiments

Applied Spectroscopy ◽

10.1366/0003702021954692 ◽

2002 ◽

Vol 56 (3) ◽

pp. 337-345 ◽

Cited By ~ 1

Author(s):

S. Kamaledin Setarehdan ◽

John J. Soraghan ◽

David Littlejohn ◽

Daran A. Sadler

Keyword(s):

Variable Selection ◽

Least Squares ◽

Partial Least Squares ◽

Near Infrared ◽

Designed Experiments ◽

Selection For ◽

Infrared Data

Download Full-text

Abstract P114: Sparse Partial Least Squares Regression: A Promising Technique To Identify Heart-healthy Dietary Patterns In The Multi-Ethnic Study Of Atherosclerosis (MESA)

Circulation ◽

10.1161/circ.143.suppl_1.p114 ◽

2021 ◽

Vol 143 (Suppl_1) ◽

Author(s):

Natalie Gasca ◽

Robyn McClelland

Keyword(s):

Heart Disease ◽

Variable Selection ◽

Least Squares ◽

Partial Least Squares ◽

Dietary Patterns ◽

Survival Outcome ◽

Nutritional Epidemiology ◽

Model Parameters ◽

Ethnic Study ◽

C Statistic

Most nutritional epidemiology studies investigating trends between diet and heart disease use outcome-independent dimension reduction methods, like principal component analysis, to create dietary patterns. While these methods construct patterns that describe important aspects of food consumption, these patterns are not inherently related to heart disease. Incorporating disease data into the pattern construction offers the possibility of more concisely summarizing the most disease-related foods. Sparse partial least squares (SPLS), one such method, was found to have favorable interpretation and prediction properties in the continuous outcome setting; while selecting a subset of relevant foods, it constructed a few dietary patterns that were correlated with BMI while also capturing variation in diet composition. These results were validated with simulated data. We propose incorporating SPLS into the Cox proportional hazards model to analyze a right-censored survival outcome. We hypothesized that this method would inherit the beneficial parsimony properties seen in the continuous setting, and we assessed whether this proposed method could use the most relevant covariates to create a few patterns that were associated with a survival outcome. While the proposed method targets covariate-level sparsity (i.e. variable selection), one competitor method exists that integrates pattern-level parsimony and partial least squares (PLS) in the Cox model, but it imposes more model parameters than the proposed method. We compared the variable selection, pattern selection, and predictive performance of four survival methods (Lasso, PLS, competitor sparse PLS, and proposed SPLS) via a simulation study. Simulation settings were informed in part by the Multi-Ethnic Study of Atherosclerosis (MESA), which has detailed food frequency questionnaire data on a large multi-ethnic population-based sample (6814 participants aged 45-84), as well as subsequent cardiovascular disease follow-up for over 15 years. In most studied simulation settings, the proposed method selected all 9 relevant predictors and the fewest number of irrelevant predictors (of 15) while creating a similar number of patterns and maintaining predictive ability of the outcome. In the setting most comparable to MESA, PLS chose all 24 predictors (by default) and 3.4 patterns (C-statistic=0.90), the competitor SPLS selected 21.1 predictors and 4.4 patterns (C-statistic=0.91), Lasso chose 16.4 predictors (C-statistic=0.91), and the proposed SPLS selected 11.7 predictors and 4.3 patterns (C-statistic=0.91), on average. We will also present an analysis of a coronary event in MESA using these four survival methods. In conclusion, we propose that using methods like SPLS to summarize food intake can create more heart disease-tailored dietary patterns that can complement the current nutritional epidemiology literature.

Download Full-text

A High Dimensional Input Data Processing Method for Load Forecasting Model Based on Tensor Partial Least Squares

2021 IEEE 10th Data Driven Control and Learning Systems Conference (DDCLS) ◽

10.1109/ddcls52934.2021.9455590 ◽

2021 ◽

Author(s):

Yu Feng ◽

Chao Tian

Keyword(s):

Data Processing ◽

Least Squares ◽

Partial Least Squares ◽

Input Data ◽

Load Forecasting ◽

Processing Method ◽

High Dimensional ◽

Forecasting Model ◽

Model Based ◽

Data Processing Method

Download Full-text

Determination of Iodine Value of Palm Oils Using Partial Least Squares Regression-Fourier Transform Infrared Data

Jurnal Teknologi ◽

10.11113/jt.v70.3522 ◽

2014 ◽

Vol 70 (5) ◽

Cited By ~ 1

Author(s):

Nor Fazila Rasaruddin ◽

Mas Ezatul Nadia Mohd Ruah ◽

Mohamed Noor Hasan ◽

Mohd Zuli Jaafar

Keyword(s):

Fourier Transform ◽

Variable Selection ◽

Least Squares ◽

Partial Least Squares ◽

Correlation Coefficient ◽

Iodine Value ◽

Cross Validation ◽

Pls Regression ◽

Pure Sample

This paper shows the determination of iodine value (IV) of pure and frying palm oils using Partial Least Squares (PLS) regression with application of variable selection. A total of 28 samples consisting of pure and frying palm oils which acquired from markets. Seven of them were considered as high-priced palm oils while the remaining was low-priced. PLS regression models were developed for the determination of IV using Fourier Transform Infrared (FTIR) spectra data in absorbance mode in the range from 650 cm-1 to 4000 cm-1. Savitzky Golay derivative was applied before developing the prediction models. The models were constructed using wavelength selected in the FTIR region by adopting selectivity ratio (SR) plot and correlation coefficient to the IV parameter. Each model was validated through Root Mean Square Error Cross Validation, RMSECV and cross validation correlation coefficient, R2cv. The best model using SR plot was the model with mean centring for pure sample and model with a combination of row scaling and standardization of frying sample. The best model with the application of the correlation coefficient variable selection was the model with a combination of row scaling and standardization of pure sample and model with mean centering data pre-processing for frying sample. It is not necessary to row scaled the variables to develop the model since the effect of row scaling on model quality is insignificant.

Download Full-text

A pseudo knockoff filter for correlated features

Information and Inference A Journal of the IMA ◽

10.1093/imaiai/iay012 ◽

2018 ◽

Vol 8 (2) ◽

pp. 313-341

Author(s):

Jiajie Chen ◽

Anthony Hou ◽

Thomas Y Hou

Keyword(s):

Variable Selection ◽

False Discovery Rate ◽

Numerical Experiments ◽

Selection Procedure ◽

Numerical Examples ◽

False Discovery ◽

Variable Selection Procedure ◽

Partial Analysis ◽

False Discovery Proportion

Abstract In Barber & Candès (2015, Ann. Statist., 43, 2055–2085), the authors introduced a new variable selection procedure called the knockoff filter to control the false discovery rate (FDR) and proved that this method achieves exact FDR control. Inspired by the work by Barber & Candès (2015, Ann. Statist., 43, 2055–2085), we propose a pseudo knockoff filter that inherits some advantages of the original knockoff filter and has more flexibility in constructing its knockoff matrix. Moreover, we perform a number of numerical experiments that seem to suggest that the pseudo knockoff filter with the half Lasso statistic has FDR control and offers more power than the original knockoff filter with the Lasso Path or the half Lasso statistic for the numerical examples that we consider in this paper. Although we cannot establish rigourous FDR control for the pseudo knockoff filter, we provide some partial analysis of the pseudo knockoff filter with the half Lasso statistic and establish a uniform false discovery proportion bound and an expectation inequality.

Download Full-text

Variable Selection Procedure for Discrimination Between two Multinormal Populations with Common Dispersion Matrix Proportional to a Known Positive Definite Matrix

Calcutta Statistical Association Bulletin ◽

10.1177/0008068320100301 ◽

2010 ◽

Vol 62 (3-4) ◽

pp. 129-142

Author(s):

Sisir Kumar Samanta

Keyword(s):

Variable Selection ◽

Selection Procedure ◽

Positive Definite Matrix ◽

Positive Definite ◽

Dispersion Matrix ◽

Variable Selection Procedure

Download Full-text