variable selection problem
Recently Published Documents


TOTAL DOCUMENTS

18
(FIVE YEARS 1)

H-INDEX

6
(FIVE YEARS 0)

Mathematics ◽  
2021 ◽  
Vol 9 (3) ◽  
pp. 218
Author(s):  
Gonzalo García-Donato ◽  
María Eugenia Castellanos ◽  
Alicia Quirós

In health sciences, identifying the leading causes that govern the behaviour of a response variable is a question of crucial interest. Formally, this can be formulated as a variable selection problem. In this paper, we introduce the basic concepts of the Bayesian approach for variable selection based on model choice, emphasizing the model space prior adoption and the algorithms for sampling from the model space and for posterior probabilities approximation; and show its application to two common problems in health sciences. The first concerns a problem in the field of genetics while the second is a longitudinal study in cardiology. In the context of these applications, considerations about control for multiplicity via the prior distribution over the model space, linear models in which the number of covariates exceed the sample size, variable selection with censored data, and computational aspects are discussed. The applications presented here also have an intrinsic statistical interest as the proposed models go beyond the standard general linear model. We believe this work will broaden the access of practitioners to Bayesian methods for variable selection.


Entropy ◽  
2020 ◽  
Vol 22 (9) ◽  
pp. 948
Author(s):  
Stefano Cabras

The variable selection problem in general, and specifically for the ordinary linear regression model, is considered in the setup in which the number of covariates is large enough to prevent the exploration of all possible models. In this context, Gibbs-sampling is needed to perform stochastic model exploration to estimate, for instance, the model inclusion probability. We show that under a Bayesian non-parametric prior model for analyzing Gibbs-sampling output, the usual empirical estimator is just the asymptotic version of the expected posterior inclusion probability given the simulation output from Gibbs-sampling. Other posterior conditional estimators of inclusion probabilities can also be considered as related to the latent probabilities distributions on the model space which can be sampled given the observed Gibbs-sampling output. This paper will also compare, in this large model space setup the conventional prior approach against the non-local prior approach used to define the Bayes Factors for model selection. The approach is exposed along with simulation samples and also an application of modeling the Travel and Tourism factors all over the world.


2020 ◽  
Author(s):  
Lu Wang ◽  
Feng Vankee Lin ◽  
Martin Cole ◽  
Zhengwu Zhang

AbstractStructural brain networks constructed from diffusion MRI are important biomarkers for understanding human brain structure and its relation to cognitive functioning. There is increasing interest in learning differences in structural brain networks between groups of subjects in neuroimaging studies, leading to a variable selection problem in network classification. Traditional methods often use independent edgewise tests or unstructured generalized linear model (GLM) with regularization on vectorized networks to select edges distinguishing the groups, which ignore the network structure and make the results hard to interpret. In this paper, we develop a symmetric bilinear logistic regression (SBLR) with elastic-net penalty to identify a set of small clique subgraphs in network classification. Clique subgraphs, consisting of all the interconnections among a subset of brain regions, have appealing neurological interpretations as they may correspond to some anatomical circuits in the brain related to the outcome. We apply this method to study differences in the structural connectome between adolescents with high and low crystallized cognitive ability, using the crystallized cognition composite score, picture vocabulary and oral reading recognition tests from NIH Toolbox. A few clique subgraphs containing several small sets of brain regions are identified between different levels of functioning, indicating their importance in crystallized cognition.


2019 ◽  
Vol 144 (2) ◽  
pp. 625-646 ◽  
Author(s):  
Erdem Yörük ◽  
İbrahim Öker ◽  
Kerem Yıldırım ◽  
Burcu Yakut-Çakar

Proceedings ◽  
2018 ◽  
Vol 2 (18) ◽  
pp. 1190
Author(s):  
Silvia Novo ◽  
Germán Aneiros ◽  
Philippe Vieu

The variable selection problem is studied in the sparse semi-functional partial linear model, with single-index type influence of the functional covariate in the response. The penalized least squares procedure is employed for this task. Some properties of the resultant estimators are derived: the existence (and rate of convergence) of a consistent estimator for the parameters in the linear part and an oracle property for the variable selection method. Finally, a real data application illustrates the good performance of our procedure.


2018 ◽  
Author(s):  
Colleen Molloy Farrelly

Paper overviews variable selection problem in high dimensionality, particularly focused on genetic psychiatry and genetic epidemiology in general. Genetic and quantum evolutionary algorithms, tree-based classification/regression models, random forest, and other approaches are detailed. Paper concludes with a roadmap for new algorithm and two-stage selection methodology.


2017 ◽  
Vol 13 (11) ◽  
pp. 659-666
Author(s):  
Lauro Cassio Martins de Paula ◽  
Anderson da Silva Soares ◽  
Telma Woerle Soares ◽  
Anselmo Elcana Pereira ◽  
Clarimar José Coelho

Sign in / Sign up

Export Citation Format

Share Document