scholarly journals LDBlockShow: a fast and convenient tool for visualizing linkage disequilibrium and haplotype blocks based on variant call format files

Author(s):  
Shan-Shan Dong ◽  
Wei-Ming He ◽  
Jing-Jing Ji ◽  
Chi Zhang ◽  
Yan Guo ◽  
...  

Abstract The triangular correlation heatmap aiming to visualize the linkage disequilibrium (LD) pattern and haplotype block structure of SNPs is ubiquitous component of population-based genetic studies. However, current tools suffered from the problem of time and memory consuming. Here, we developed LDBlockShow, an open source software, for visualizing LD and haplotype blocks from variant call format files. It is time and memory saving. In a test dataset with 100 SNPs from 60 000 subjects, it was at least 10.60 times faster and used only 0.03–13.33% of physical memory as compared with other tools. In addition, it could generate figures that simultaneously display additional statistical context (e.g. association P-values) and genomic region annotations. It can also compress the SVG files with a large number of SNPs and support subgroup analysis. This fast and convenient tool will facilitate the visualization of LD and haplotype blocks for geneticists.

2020 ◽  
Author(s):  
Shan-Shan Dong ◽  
Wei-Ming He ◽  
Jing-Jing Ji ◽  
Chi Zhang ◽  
Yan Guo ◽  
...  

AbstractThe triangular correlation heatmap aiming to visualize the linkage disequilibrium (LD) pattern and haplotype block structure of SNPs is ubiquitous component of population-based genetic studies. However, current tools suffered from the problem of time and memory consuming, and direct calculation from variant call format (VCF) files is not supported. Here we developed LDBlockShow, an open source software, for visualizing LD and haplotype blocks from VCF files. It is time and memory saving. In a test dataset with 100 SNPs from 60,000 subjects, it was at least 429.03 times faster and used only 0.04% – 20.00% of physical memory as compared to other tools. In addition, it could generate figures that simultaneously display additional statistical context (e.g., association P values) and genomic region annotations. It can also compress the SVG files with large number of SNPs and support subgroup analysis. This fast and convenient tool would facilitate the visualization of LD and haplotype blocks for geneticists.


2018 ◽  
Vol 35 (10) ◽  
pp. 1786-1788 ◽  
Author(s):  
Chi Zhang ◽  
Shan-Shan Dong ◽  
Jun-Yang Xu ◽  
Wei-Ming He ◽  
Tie-Lin Yang

Genetics ◽  
2002 ◽  
Vol 161 (3) ◽  
pp. 1269-1278 ◽  
Author(s):  
Bernhard Haubold ◽  
Jürgen Kroymann ◽  
Andreas Ratzka ◽  
Thomas Mitchell-Olds ◽  
Thomas Wiehe

Abstract Arabidopsis thaliana is a highly selfing plant that nevertheless appears to undergo substantial recombination. To reconcile its selfing habit with the observations of recombination, we have sampled the genetic diversity of A. thaliana at 14 loci of ~500 bp each, spread across 170 kb of genomic sequence centered on a QTL for resistance to herbivory. A total of 170 of the 6321 nucleotides surveyed were polymorphic, with 169 being biallelic. The mean silent genetic diversity (πs) varied between 0.001 and 0.03. Pairwise linkage disequilibria between the polymorphisms were negatively correlated with distance, although this effect vanished when only pairs of polymorphisms with four haplotypes were included in the analysis. The absence of a consistent negative correlation between distance and linkage disequilibrium indicated that gene conversion might have played an important role in distributing genetic diversity throughout the region. We tested this by coalescent simulations and estimate that up to 90% of recombination is due to gene conversion.


2021 ◽  
Vol 4 ◽  
Author(s):  
Joeselle Serrana ◽  
Kozo Watanabe

DNA metabarcoding is a robust method for environmental impact assessments of freshwater ecosystems that enables the simultaneous multi-species identification of complex mixed community samples from different origins using extracellular and total genomic DNA. The development and evaluation of DNA metabarcoding protocols for haplotype level resolution require attention, specifically for basic population genetic applications, i.e., analysis to allow genetic diversity estimations and dispersal abilities of the species present in the bulk community samples. Various literature has proposed using DNA metabarcoding for population genetics, and few studies have provided preliminary applications and proof of concepts that always refer to particular taxa. However, further exploration and assessment of the laboratory and bioinformatics strategies are required to unlock the potential of metabarcoding-based population-level ecological assessments. Here, we assessed the ability to infer haplotype information of freshwater macroinvertebrate species from DNA metabarcoding community sequence. Using mock samples with known Sanger-sequenced haplotypes, we also assayed the effects of PCR cycle for the detection and reduction of spurious haplotypes obtained from DNA metabarcoding. We tested our haplotyping strategy on a mock sample containing 20 specimens from four species with known haplotypes based on the 658-bp Folmer region of the mitochondrial cytochrome c oxidase (mtCOI) gene. The read processing and denoising-step resulted in 14 zero-radius operational taxonomic units (ZOTUs) of 421-bp length, with 12 ZOTUs having 100% match with 12 of the mock haplotype sequences. The remaining eight haplotypes that were not detected from the DNA metabarcoding dataset were all the A. decemseta samples (0.01, 0.05, 0.10 ng/μL DNA template concentrations), two E. bulba (0.01 and 0.05 ng/μL), E. latifolium (0.01 ng/μL), and two K. tibialis (0.01 and 0.10 ng/μL). Given that most of the undetected samples had low concentrations, we report the influence of initial DNA template concentration on the amplification from a mock community sample. Our observation is in accordance with previous studies that reported that samples or taxa with low DNA template concentrations have lower detection probability. Accordingly, abundant taxa or samples with high biomass tend to have higher detection probabilities than those rare, smaller or have low biomass from mixed-community samples. The difference in biomass affects haplotypes' detection since most of the large specimens would be retained after read processing. Hence, these factors need to be addressed when metabarcoding-based haplotyping is to be used to infer abundance-based analysis for population genetics applications. The phylogenetic-based analysis (Fig. 1) revealed that the two ZOTUs without taxonomic matches clustered with one of the species from the mock sample. This supports our observation that only the samples with low concentration were unrepresented from the DNA metabarcoding data. Although we still reported false positive detections because two of the 14 ZOTUs failed to have a 100% match with the mock reference sequences, we could at least identify them as A. decemseta sequences based on the phylogenetic approach. Quality passing reads relatively increased with increasing cycle number, and the relative abundance of each ZOTUs was consistent for each cycle number. This suggests that increasing the cycle number, from 24 to 64, did not affect the relative abundance of quality passing filter reads. Our study demonstrated that DNA metabarcoding data could be used to infer intraspecific variability, showing promise for possible applications in population-based genetic studies. As DNA metabarcoding becomes more established and laboratory protocols and bioinformatics pipelines are continuously being developed, our proof of concept study demonstrated that the method could be used to infer intraspecific variability, showing promise for possible applications on population-based genetic studies.


2013 ◽  
Vol 280 (1772) ◽  
pp. 20132400 ◽  
Author(s):  
M. H. D. Larmuseau ◽  
J. Vanoverbeke ◽  
A. Van Geystelen ◽  
G. Defraene ◽  
N. Vanderheyden ◽  
...  

Recent evidence suggests that seeking out extra-pair paternity (EPP) can be a viable alternative reproductive strategy for both males and females in many pair-bonded species, including humans. Accurate data on EPP rates in humans, however, are scant and mostly restricted to extant populations. Here, we provide the first large-scale, unbiased genetic study of historical EPP rates in a Western European human population based on combining Y-chromosomal data to infer genetic patrilineages with genealogical and surname data, which reflect known historical presumed paternity. Using two independent methods, we estimate that over the last few centuries, EPP rates in Flanders (Belgium) were only around 1–2% per generation. This figure is substantially lower than the 8–30% per generation reported in some behavioural studies on historical EPP rates, but comparable with the rates reported by other genetic studies of contemporary Western European populations. These results suggest that human EPP rates have not changed substantially during the last 400 years in Flanders and imply that legal genealogies rarely differ from the biological ones. This result has significant implications for a diverse set of fields, including human population genetics, historical demography, forensic science and human sociobiology.


Gut ◽  
2020 ◽  
pp. gutjnl-2020-322429 ◽  
Author(s):  
Ho-Su Lee ◽  
Evy Lobbestael ◽  
Séverine Vermeire ◽  
João Sabino ◽  
Isabelle Cleynen

Inflammatory bowel disease and Parkinson’s disease are chronic progressive disorders that mainly affect different organs: the gut and brain, respectively. Accumulating evidence has suggested a bidirectional link between gastrointestinal inflammation and neurodegeneration, in accordance with the concept of the ‘gut–brain axis’. Moreover, recent population-based studies have shown that inflammatory bowel disease might increase the risk of Parkinson's disease. Although the precise mechanisms underlying gut–brain interactions remain elusive, some of the latest findings have begun to explain the link. Several genetic loci are shared between both disorders with a similar direction of effect on the risk of both diseases. The most interesting example is LRRK2 (leucine-rich repeat kinase 2), initially identified as a causal gene in Parkinson's disease, and recently also implicated in Crohn’s disease. In this review, we highlight recent findings on the link between these seemingly unrelated diseases with shared genetic susceptibility. We discuss supporting and conflicting data obtained from epidemiological and genetic studies along with remaining questions and concerns. In addition, we discuss possible biological links including the gut–brain axis, microbiota, autoimmunity, mitochondrial function and autophagy.


2006 ◽  
Vol 2 (3) ◽  
pp. 435-438 ◽  
Author(s):  
Niclas Backström ◽  
Anna Qvarnström ◽  
Lars Gustafsson ◽  
Hans Ellegren

Population-based mapping approaches are attractive for tracing the genetic background to phenotypic traits in wild species, given that it is often difficult to gather extensive and well-defined pedigrees needed for quantitative trait locus analysis. However, the feasibility of association or hitch-hiking mapping is dependent on the degree of linkage disequilibrium (LD) in the population, on which there is yet limited information for wild species. Here we use single nucleotide polymorphism (SNP) markers from 23 genes in a recently established linkage map of the Z chromosome of the collared flycatcher, to study the extent of LD in a natural bird population. In most but not all cases we find SNPs within the same intron (less than 500 bp) to be in perfect LD. However, LD then decays to background level at a distance 1 cM or 400–500 kb. Although LD seems more extensive than in other species, if the observed pattern is representative for other regions of the genome and turns out to be a general feature of natural bird populations, dense marker maps might be needed for genome scans aimed at identifying association between marker and trait loci.


2006 ◽  
Vol 7 (2) ◽  
pp. 130-140 ◽  
Author(s):  
A Blomhoff ◽  
M Olsson ◽  
S Johansson ◽  
H E Akselsen ◽  
F Pociot ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document