BLINK: A Package for Next Level of Genome Wide Association Studies with Both Individuals and Markers in Millions

Mapping Intimacies ◽

10.1101/227249 ◽

2017 ◽

Cited By ~ 2

Author(s):

Meng Huang ◽

Xiaolei Liu ◽

Yao Zhou ◽

Ryan M. Summers ◽

Zhiwu Zhang

Keyword(s):

Big Data ◽

Linkage Disequilibrium ◽

Association Studies ◽

Information Criteria ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Effect Model ◽

Computationally Expensive ◽

Bin Method

Big data, accumulated from biomedical and agronomic studies, provides the potential to identify genes controlling complex human diseases and agriculturally important traits through genome-wide association studies (GWAS). However, big data also leads to extreme computational challenges, especially when sophisticated statistical models are employed to simultaneously reduce false positives and false negatives. The newly developed Fixed and random model Circulating Probability Unification (FarmCPU) method uses a bin method under the assumption that Quantitative Trait Nucleotides (QTNs) are evenly distributed throughout the genome. The estimated QTNs are used to separate a mixed linear model into a computationally efficient fixed effect model (FEM) and a computationally expensive random effect model (REM), which are then used iteratively. To completely eliminate the computationally expensive REM, we replaced REM with FEM by using Bayesian information criteria. To eliminate the requirement that QTNs be evenly distributed throughout the genome, we replaced the bin method with linkage disequilibrium information. The new method is called Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK). Both real and simulated data analyses demonstrated that BLINK improves statistical power compared to FarmCPU, in addition to a remarkable improvement in computing time. Now, a dataset with half million markers and one million individuals can be analyzed within five hours, compared with one week using FarmCPU.

Download Full-text

The Impact of Incomplete Linkage Disequilibrium and Genetic Model Choice on the Analysis and Interpretation of Genome-wide Association Studies

Annals of Human Genetics ◽

10.1111/j.1469-1809.2010.00579.x ◽

2010 ◽

Vol 74 (4) ◽

pp. 375-379 ◽

Cited By ~ 6

Author(s):

Mark M. Iles

Keyword(s):

Linkage Disequilibrium ◽

Genetic Model ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Model Choice ◽

Genome Wide ◽

The Impact

Download Full-text

Addressing Provenance Issues in Big Data Genome Wide Association Studies (GWAS)

2016 IEEE First International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE) ◽

10.1109/chase.2016.79 ◽

2016 ◽

Author(s):

David Lauzon ◽

Beatriz Kanzki ◽

Victor Dupuy ◽

Alain April ◽

Michael S. Phillips ◽

...

Keyword(s):

Big Data ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

A New Diversity Panel for Winter Rapeseed (Brassica napus L.) Genome-Wide Association Studies

Agronomy ◽

10.3390/agronomy10122006 ◽

2020 ◽

Vol 10 (12) ◽

pp. 2006

Author(s):

David P. Horvath ◽

Michael Stamm ◽

Zahirul I. Talukder ◽

Jason Fiedler ◽

Aidan P. Horvath ◽

...

Keyword(s):

Linkage Disequilibrium ◽

Brassica Napus ◽

Association Studies ◽

Decay Rates ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

High Quality ◽

Brassica Napus L ◽

Genome Wide ◽

Quality Markers

A diverse population (429 member) of canola (Brassica napus L.) consisting primarily of winter biotypes was assembled and used in genome-wide association studies. Genotype by sequencing analysis of the population identified and mapped 290,972 high-quality markers ranging from 18.5 to 82.4% missing markers per line and an average of 36.8%. After interpolation, 251,575 high-quality markers remained. After filtering for markers with low minor allele counts (count > 5), we were left with 190,375 markers. The average distance between these markers is 4463 bases with a median of 69 and a range from 1 to 281,248 bases. The heterozygosity among the imputed population ranges from 0.9 to 11.0% with an average of 5.4%. The filtered and imputed dataset was used to determine population structure and kinship, which indicated that the population had minimal structure with the best K value of 2–3. These results also indicated that the majority of the population has substantial sequence from a single population with sub-clusters of, and admixtures with, a very small number of other populations. Analysis of chromosomal linkage disequilibrium decay ranged from ~7 Kb for chromosome A01 to ~68 Kb for chromosome C01. Local linkage decay rates determined for all 500 kb windows with a 10kb sliding step indicated a wide range of linkage disequilibrium decay rates, indicating numerous crossover hotspots within this population, and provide a resource for determining the likely limits of linkage disequilibrium from any given marker in which to identify candidate genes. This population and the resources provided here should serve as helpful tools for investigating genetics in winter canola.

Download Full-text

A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies

BMC Bioinformatics ◽

10.1186/1471-2105-12-16 ◽

2011 ◽

Vol 12 (1) ◽

Cited By ~ 26

Author(s):

Raphaël Mourad ◽

Christine Sinoquet ◽

Philippe Leray

Keyword(s):

Linkage Disequilibrium ◽

Dimensionality Reduction ◽

Bayesian Network ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Hierarchical Bayesian ◽

Network Approach ◽

Genome Wide ◽

Data Dimensionality Reduction

Download Full-text

An Academic Clinician’s Road Map to Hypertension Genomics

Hypertension ◽

10.1161/hypertensionaha.120.14535 ◽

2021 ◽

Author(s):

Emma F. Magavern ◽

Helen R. Warren ◽

Fu L. Ng ◽

Claudia P. Cabrera ◽

Patricia B. Munroe ◽

...

Keyword(s):

Big Data ◽

Essential Hypertension ◽

Clinical Benefit ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Polygenic Risk ◽

Road Map ◽

Genome Wide ◽

Cardiovascular Morbidity And Mortality

At the dawn of the new decade, it is judicious to reflect on the boom of knowledge about polygenic risk for essential hypertension supplied by the wealth of genome-wide association studies. Hypertension continues to account for significant cardiovascular morbidity and mortality, with increasing prevalence anticipated. Here, we overview recent advances in the use of big data to understand polygenic hypertension, as well as opportunities for future innovation to translate this windfall of knowledge into clinical benefit.

Download Full-text

Faculty Opinions recommendation of Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1032179.497533 ◽

2006 ◽

Author(s):

Tony Long

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Faculty Opinions recommendation of Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1032179.373886 ◽

2006 ◽

Author(s):

Karin Schmitt

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

The Molecular Revolution in Cutaneous Biology: The Era of Genome-Wide Association Studies and Statistical, Big Data, and Computational Topics

Journal of Investigative Dermatology ◽

10.1016/j.jid.2016.03.047 ◽

2017 ◽

Vol 137 (5) ◽

pp. e113-e118 ◽

Cited By ~ 11

Author(s):

Hima Anbunathan ◽

Anne M. Bowcock

Keyword(s):

Big Data ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Improving the detection of pathways in genome-wide association studies by combined effects of SNPs from Linkage Disequilibrium blocks

Scientific Reports ◽

10.1038/s41598-017-03826-2 ◽

2017 ◽

Vol 7 (1) ◽

Cited By ~ 4

Author(s):

Huiying Zhao ◽

Dale R. Nyholt ◽

Yuanhao Yang ◽

Jihua Wang ◽

Yuedong Yang

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Combined Effects ◽

Genome Wide

Download Full-text

A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies

BMC Bioinformatics ◽

10.1186/s12859-018-2054-0 ◽

2018 ◽

Vol 19 (1) ◽

Cited By ~ 3

Author(s):

Christine Sinoquet

Keyword(s):

Linkage Disequilibrium ◽

Random Forest ◽

Latent Variables ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text