scholarly journals Atlas of Transcription Factor Binding Sites from ENCODE DNase Hypersensitivity Data Across 27 Tissue Types

2018 ◽  
Author(s):  
Cory C. Funk ◽  
Alex M. Casella ◽  
Segun Jung ◽  
Matthew A. Richards ◽  
Alex Rodriguez ◽  
...  

AbstractThere is intense interest in mapping the tissue-specific binding sites of transcription factors in the human genome to reconstruct gene regulatory networks and predict functions for non-coding genetic variation. DNase-seq footprinting provides a means to predict genome-wide binding sites for hundreds of transcription factors (TFs) simultaneously. However, despite the public availability of DNase-seq data for hundreds of samples, there is neither a unified analytical workflow nor a publicly accessible database providing the locations of footprints across all available samples. Here, we implemented a workflow for uniform processing of footprints using two state-of-the-art footprinting algorithms: Wellington and HINT. Our workflow scans the footprints generated by these algorithms for 1,530 sequence motifs to predict binding sites for 1,515 human transcription factors. We applied our workflow to detect footprints in 192 DNase-seq experiments from ENCODE spanning 27 human tissues. This collection of footprints describes an expansive landscape of potential TF occupancy. At thresholds optimized through machine learning, we report high-quality footprints covering 9.8% of the human genome. These footprints were enriched for true positive TF binding sites as defined by ChIP-seq peaks, as well as for genetic variants associated with changes in gene expression. Integrating our footprint atlas with summary statistics from genome-wide association studies revealed that risk for neuropsychiatric traits was enriched specifically at highly-scoring footprints in human brain, while risk for immune traits was enriched specifically at highly-scoring footprints in human lymphoblasts. Our cloud-based workflow is available at github.com/globusgenomics/genomics-footprint and a database with all footprints and TF binding site predictions are publicly available at http://data.nemoarchive.org/other/grant/sament/sament/footprint_atlas.

2021 ◽  
Vol 8 (6) ◽  
pp. 70
Author(s):  
Mathilde R. Rivaud ◽  
Michiel Blok ◽  
Monique R. M. Jongbloed ◽  
Bastiaan J. Boukens

The electrophysiological signatures of the myocardium in cardiac structures, such as the atrioventricular node, pulmonary veins or the right ventricular outflow tract, are established during development by the spatial and temporal expression of transcription factors that guide expression of specific ion channels. Genome-wide association studies have shown that small variations in genetic regions are key to the expression of these transcription factors and thereby modulate the electrical function of the heart. Moreover, mutations in these factors are found in arrhythmogenic pathologies such as congenital atrioventricular block, as well as in specific forms of atrial fibrillation and ventricular tachycardia. In this review, we discuss the developmental origin of distinct electrophysiological structures in the heart and their involvement in cardiac arrhythmias.


2020 ◽  
Vol 21 (16) ◽  
pp. 5717 ◽  
Author(s):  
Estefanía Lozano-Velasco ◽  
Diego Franco ◽  
Amelia Aranega ◽  
Houria Daimi

Atrial fibrillation (AF) is known to be the most common supraventricular arrhythmia affecting up to 1% of the general population. Its prevalence exponentially increases with age and could reach up to 8% in the elderly population. The management of AF is a complex issue that is addressed by extensive ongoing basic and clinical research. AF centers around different types of disturbances, including ion channel dysfunction, Ca2+-handling abnormalities, and structural remodeling. Genome-wide association studies (GWAS) have uncovered over 100 genetic loci associated with AF. Most of these loci point to ion channels, distinct cardiac-enriched transcription factors, as well as to other regulatory genes. Recently, the discovery of post-transcriptional regulatory mechanisms, involving non-coding RNAs (especially microRNAs), DNA methylation, and histone modification, has allowed to decipher how a normal heart develops and which modifications are involved in reshaping the processes leading to arrhythmias. This review aims to provide a current state of the field regarding the identification and functional characterization of AF-related epigenetic regulatory networks


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Matthias Munz ◽  
Inken Wohlers ◽  
Eric Simon ◽  
Tobias Reinberger ◽  
Hauke Busch ◽  
...  

AbstractExploration of genetic variant-to-gene relationships by quantitative trait loci such as expression QTLs is a frequently used tool in genome-wide association studies. However, the wide range of public QTL databases and the lack of batch annotation features complicate a comprehensive annotation of GWAS results. In this work, we introduce the tool “Qtlizer” for annotating lists of variants in human with associated changes in gene expression and protein abundance using an integrated database of published QTLs. Features include incorporation of variants in linkage disequilibrium and reverse search by gene names. Analyzing the database for base pair distances between best significant eQTLs and their affected genes suggests that the commonly used cis-distance limit of 1,000,000 base pairs might be too restrictive, implicating a substantial amount of wrongly and yet undetected eQTLs. We also ranked genes with respect to the maximum number of tissue-specific eQTL studies in which a most significant eQTL signal was consistent. For the top 100 genes we observed the strongest enrichment with housekeeping genes (P = 2 × 10–6) and with the 10% highest expressed genes (P = 0.005) after grouping eQTLs by r2 > 0.95, underlining the relevance of LD information in eQTL analyses. Qtlizer can be accessed via https://genehopper.de/qtlizer or by using the respective Bioconductor R-package (https://doi.org/10.18129/B9.bioc.Qtlizer).


PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3618 ◽  
Author(s):  
Rana Dajani ◽  
Jin Li ◽  
Zhi Wei ◽  
Michael E. March ◽  
Qianghua Xia ◽  
...  

The prevalence of Type II Diabetes (T2D) has been increasing and has become a disease of significant public health burden in Jordan. None of the previous genome-wide association studies (GWAS) have specifically investigated the Middle East populations. The Circassian and Chechen communities in Jordan represent unique populations that are genetically distinct from the Arab population and other populations in the Caucasus. Prevalence of T2D is very high in both the Circassian and Chechen communities in Jordan despite low obesity prevalence. We conducted GWAS on T2D in these two populations and further performed meta-analysis of the results. We identified a novel T2D locus at chr20p12.2 at genome-wide significance (rs6134031, P = 1.12 × 10−8) and we replicated the results in the Wellcome Trust Case Control Consortium (WTCCC) dataset. Another locus at chr12q24.31 is associated with T2D at suggestive significance level (top SNP rs4758690, P = 4.20 × 10−5) and it is a robust eQTL for the gene, MLXIP (P = 1.10 × 10−14), and is significantly associated with methylation level in MLXIP, the functions of which involves cellular glucose response. Therefore, in this first GWAS of T2D in Jordan subpopulations, we identified novel and unique susceptibility loci which may help inform the genetic underpinnings of T2D in other populations.


2018 ◽  
Author(s):  
John A Lees ◽  
Marco Galardini ◽  
Stephen D Bentley ◽  
Jeffrey N Weiser ◽  
Jukka Corander

AbstractSummaryGenome-wide association studies (GWAS) in microbes face different challenges to eukaryotes and have been addressed by a number of different methods. pyseer brings these techniques together in one package tailored to microbial GWAS, allows greater flexibility of the input data used, and adds new methods to interpret the association results.Availability and Implementationpyseer is written in python and is freely available at https://github.com/mgalardini/pyseer, or can be installed through pip. Documentation and a tutorial are available at http://[email protected] and [email protected] informationSupplementary data are available online.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Fabricio Almeida-Silva ◽  
Thiago M. Venancio

AbstractSoybean is one of the most important legume crops worldwide. However, soybean yield is dramatically affected by fungal diseases, leading to economic losses of billions of dollars yearly. Here, we integrated publicly available genome-wide association studies and transcriptomic data to prioritize candidate genes associated with resistance to Cadophora gregata, Fusarium graminearum, Fusarium virguliforme, Macrophomina phaseolina, and Phakopsora pachyrhizi. We identified 188, 56, 11, 8, and 3 high-confidence candidates for resistance to F. virguliforme, F. graminearum, C. gregata, M. phaseolina and P. pachyrhizi, respectively. The prioritized candidate genes are highly conserved in the pangenome of cultivated soybeans and are heavily biased towards fungal species-specific defense responses. The vast majority of the prioritized candidate resistance genes are related to plant immunity processes, such as recognition, signaling, oxidative stress, systemic acquired resistance, and physical defense. Based on the number of resistance alleles, we selected the five most resistant accessions against each fungal species in the soybean USDA germplasm. Interestingly, the most resistant accessions do not reach the maximum theoretical resistance potential. Hence, they can be further improved to increase resistance in breeding programs or through genetic engineering. Finally, the coexpression network generated here is available in a user-friendly web application (https://soyfungigcn.venanciogroup.uenf.br/) and an R/Shiny package (https://github.com/almeidasilvaf/SoyFungiGCN) that serve as a public resource to explore soybean-pathogenic fungi interactions at the transcriptional level.


2010 ◽  
Vol 3 (5) ◽  
pp. 513-526 ◽  
Author(s):  
Richard Cowper-Sal·lari ◽  
Michael D. Cole ◽  
Margaret R. Karagas ◽  
Mathieu Lupien ◽  
Jason H. Moore

2011 ◽  
Vol 52 (6) ◽  
pp. 1139-1149 ◽  
Author(s):  
Magalie S. Leduc ◽  
Malcolm Lyons ◽  
Katayoon Darvishi ◽  
Kenneth Walsh ◽  
Susan Sheehan ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document