scholarly journals Use of Wrapper Algorithms Coupled with a Random Forests Classifier for Variable Selection in Large-Scale Genomic Association Studies

2009 ◽  
Vol 16 (12) ◽  
pp. 1705-1718 ◽  
Author(s):  
Andrei S. Rodin ◽  
Anatoliy Litvinenko ◽  
Kathy Klos ◽  
Alanna C. Morrison ◽  
Trevor Woodage ◽  
...  
2021 ◽  
Vol 135 (24) ◽  
pp. 2691-2708
Author(s):  
Simon T. Bond ◽  
Anna C. Calkin ◽  
Brian G. Drew

Abstract The escalating prevalence of individuals becoming overweight and obese is a rapidly rising global health problem, placing an enormous burden on health and economic systems worldwide. Whilst obesity has well described lifestyle drivers, there is also a significant and poorly understood component that is regulated by genetics. Furthermore, there is clear evidence for sexual dimorphism in obesity, where overall risk, degree, subtype and potential complications arising from obesity all differ between males and females. The molecular mechanisms that dictate these sex differences remain mostly uncharacterised. Many studies have demonstrated that this dimorphism is unable to be solely explained by changes in hormones and their nuclear receptors alone, and instead manifests from coordinated and highly regulated gene networks, both during development and throughout life. As we acquire more knowledge in this area from approaches such as large-scale genomic association studies, the more we appreciate the true complexity and heterogeneity of obesity. Nevertheless, over the past two decades, researchers have made enormous progress in this field, and some consistent and robust mechanisms continue to be established. In this review, we will discuss some of the proposed mechanisms underlying sexual dimorphism in obesity, and discuss some of the key regulators that influence this phenomenon.


PLoS ONE ◽  
2008 ◽  
Vol 3 (10) ◽  
pp. e3583 ◽  
Author(s):  
Brendan J. Keating ◽  
Sam Tischfield ◽  
Sarah S. Murray ◽  
Tushar Bhangale ◽  
Thomas S. Price ◽  
...  

2016 ◽  
Vol 9 (1) ◽  
Author(s):  
Silke Szymczak ◽  
Emily Holzinger ◽  
Abhijit Dasgupta ◽  
James D. Malley ◽  
Anne M. Molloy ◽  
...  

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
James M. Kunert-Graf ◽  
Nikita A. Sakhanenko ◽  
David J. Galas

Abstract Background Permutation testing is often considered the “gold standard” for multi-test significance analysis, as it is an exact test requiring few assumptions about the distribution being computed. However, it can be computationally very expensive, particularly in its naive form in which the full analysis pipeline is re-run after permuting the phenotype labels. This can become intractable in multi-locus genome-wide association studies (GWAS), in which the number of potential interactions to be tested is combinatorially large. Results In this paper, we develop an approach for permutation testing in multi-locus GWAS, specifically focusing on SNP–SNP-phenotype interactions using multivariable measures that can be computed from frequency count tables, such as those based in Information Theory. We find that the computational bottleneck in this process is the construction of the count tables themselves, and that this step can be eliminated at each iteration of the permutation testing by transforming the count tables directly. This leads to a speed-up by a factor of over 103 for a typical permutation test compared to the naive approach. Additionally, this approach is insensitive to the number of samples making it suitable for datasets with large number of samples. Conclusions The proliferation of large-scale datasets with genotype data for hundreds of thousands of individuals enables new and more powerful approaches for the detection of multi-locus genotype-phenotype interactions. Our approach significantly improves the computational tractability of permutation testing for these studies. Moreover, our approach is insensitive to the large number of samples in these modern datasets. The code for performing these computations and replicating the figures in this paper is freely available at https://github.com/kunert/permute-counts.


2016 ◽  
Vol 27 (9) ◽  
pp. 2657-2673 ◽  
Author(s):  
Mathieu Emily

The Cochran-Armitage trend test (CA) has become a standard procedure for association testing in large-scale genome-wide association studies (GWAS). However, when the disease model is unknown, there is no consensus on the most powerful test to be used between CA, allelic, and genotypic tests. In this article, we tackle the question of whether CA is best suited to single-locus scanning in GWAS and propose a power comparison of CA against allelic and genotypic tests. Our approach relies on the evaluation of the Taylor decompositions of non-centrality parameters, thus allowing an analytical comparison of the power functions of the tests. Compared to simulation-based comparison, our approach offers the advantage of simultaneously accounting for the multidimensionality of the set of features involved in power functions. Although power for CA depends on the sample size, the case-to-control ratio and the minor allelic frequency (MAF), our results first show that it is largely influenced by the mode of inheritance and a deviation from Hardy–Weinberg Equilibrium (HWE). Furthermore, when compared to other tests, CA is shown to be the most powerful test under a multiplicative disease model or when the single-nucleotide polymorphism largely deviates from HWE. In all other situations, CA lacks in power and differences can be substantial, especially for the recessive mode of inheritance. Finally, our results are illustrated by the comparison of the performances of the statistics in two genome scans.


2018 ◽  
Vol 8 (10) ◽  
pp. 3255-3267 ◽  
Author(s):  
Genevieve L. Wojcik ◽  
Christian Fuchsberger ◽  
Daniel Taliun ◽  
Ryan Welch ◽  
Alicia R Martin ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document