scholarly journals Genome-wide identification of directed gene networks using large-scale population genomics data

2017 ◽  
Author(s):  
René Luijk ◽  
Koen F. Dekkers ◽  
Maarten van Iterson ◽  
Wibowo Arindrarto ◽  
Annique Claringbould ◽  
...  

ABSTRACTIdentification of causal drivers behind regulatory gene networks is crucial in understanding gene function. We developed a method for the large-scale inference of gene-gene interactions in observational population genomics data that are both directed (using local genetic instruments as causal anchors, akin to Mendelian Randomization) and specific (by controlling for linkage disequilibrium and pleiotropy). The analysis of genotype and whole-blood RNA-sequencing data from 3,072 individuals identified 49 genes as drivers of downstream transcriptional changes (P < 7 × 10−10), among which transcription factors were overrepresented (P = 3.3 × 10−7). Our analysis suggests new gene functions and targets including for SENP7 (zinc-finger genes involved in retroviral repression) and BCL2A1 (novel target genes possibly involved in auditory dysfunction). Our work highlights the utility of population genomics data in deriving directed gene expression networks. A resource of trans-effects for all 6,600 genes with a genetic instrument can be explored individually using a web-based browser.

2018 ◽  
Vol 9 (1) ◽  
Author(s):  
René Luijk ◽  
◽  
Koen F. Dekkers ◽  
Maarten van Iterson ◽  
Wibowo Arindrarto ◽  
...  

BMC Genomics ◽  
2015 ◽  
Vol 16 (1) ◽  
Author(s):  
Julia Ulrich ◽  
Van Anh Dao ◽  
Upalparna Majumdar ◽  
Christian Schmitt-Engel ◽  
Jonas Schwirz ◽  
...  

Genes ◽  
2019 ◽  
Vol 10 (7) ◽  
pp. 536 ◽  
Author(s):  
Xiaobo Zhao ◽  
Liming Gan ◽  
Caixia Yan ◽  
Chunjuan Li ◽  
Quanxi Sun ◽  
...  

Long non-coding RNAs (lncRNAs) are involved in various regulatory processes although they do not encode protein. Presently, there is little information regarding the identification of lncRNAs in peanut (Arachis hypogaea Linn.). In this study, 50,873 lncRNAs of peanut were identified from large-scale published RNA sequencing data that belonged to 124 samples involving 15 different tissues. The average lengths of lncRNA and mRNA were 4335 bp and 954 bp, respectively. Compared to the mRNAs, the lncRNAs were shorter, with fewer exons and lower expression levels. The 4713 co-expression lncRNAs (expressed in all samples) were used to construct co-expression networks by using the weighted correlation network analysis (WGCNA). LncRNAs correlating with the growth and development of different peanut tissues were obtained, and target genes for 386 hub lncRNAs of all lncRNAs co-expressions were predicted. Taken together, these findings can provide a comprehensive identification of lncRNAs in peanut.


PLoS ONE ◽  
2020 ◽  
Vol 15 (12) ◽  
pp. e0244480
Author(s):  
Xinghuo Ye ◽  
Zhihong Yang ◽  
Yeqin Jiang ◽  
Lan Yu ◽  
Rongkai Guo ◽  
...  

Identification of the target genes of microRNAs (miRNAs), trans-acting small interfering RNAs (ta-siRNAs), and small interfering RNAs (siRNAs) is an important step for understanding their regulatory roles in plants. In recent years, many bioinformatics software packages based on small RNA (sRNA) high-throughput sequencing (HTS) and degradome sequencing data analysis have provided strong technical support for large-scale mining of sRNA-target pairs. However, sRNA-target regulation is achieved using a complex network of interactions since one transcript might be co-regulated by multiple sRNAs and one sRNA may also affect multiple targets. Currently used mining software can realize the mining of multiple unknown targets using known sRNA, but it cannot rule out the possibility of co-regulation of the same target by other unknown sRNAs. Hence, the obtained regulatory network may be incomplete. We have developed a new mining software, sRNATargetDigger, that includes two function modules, “Forward Digger” and “Reverse Digger”, which can identify regulatory sRNA-target pairs bidirectionally. Moreover, it has the ability to identify unknown sRNAs co-regulating the same target, in order to obtain a more authentic and reliable sRNA-target regulatory network. Upon re-examination of the published sRNA-target pairs in Arabidopsis thaliana, sRNATargetDigger found 170 novel co-regulatory sRNA-target pairs. This software can be downloaded from http://www.bioinfolab.cn/sRNATD.html.


eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
MGP van der Wijst ◽  
DH de Vries ◽  
HE Groot ◽  
G Trynka ◽  
CC Hon ◽  
...  

In recent years, functional genomics approaches combining genetic information with bulk RNA-sequencing data have identified the downstream expression effects of disease-associated genetic risk factors through so-called expression quantitative trait locus (eQTL) analysis. Single-cell RNA-sequencing creates enormous opportunities for mapping eQTLs across different cell types and in dynamic processes, many of which are obscured when using bulk methods. Rapid increase in throughput and reduction in cost per cell now allow this technology to be applied to large-scale population genetics studies. To fully leverage these emerging data resources, we have founded the single-cell eQTLGen consortium (sc-eQTLGen), aimed at pinpointing the cellular contexts in which disease-causing genetic variants affect gene expression. Here, we outline the goals, approach and potential utility of the sc-eQTLGen consortium. We also provide a set of study design considerations for future single-cell eQTL studies.


2017 ◽  
Author(s):  
Anika Gupta ◽  
Heiko Horn ◽  
Parisa Razaz ◽  
April Kim ◽  
Michael Lawrence ◽  
...  

ABSTRACTLarge-scale cancer sequencing studies have uncovered dozens of mutations critical to cancer initiation and progression. However, a significant proportion of genes linked to tumor propagation remain hidden, often due to noise in sequencing data confounding low frequency alterations. Further, genes in networks under purifying selection (NPS), or those that are mutated in cancers less frequently than would be expected by chance, may play crucial roles in sustaining cancers but have largely been overlooked. We describe here a statistical framework that identifies genes that have a first order protein interaction network significantly depleted for mutations, to elucidate key genetic contributors to cancers. Not reliant on and thus, unbiased by, the gene of interest’s mutation rate, our approach has identified 685 putative genes linked to cancer development. Comparative analysis indicates statistically significant enrichment of NPS genes in previously validated cancer vulnerability gene sets, while further identifying novel cancer-specific candidate gene targets. As more tumor genomes are sequenced, integrating systems level mutation data through this network approach should become increasingly useful in pinpointing gene targets for cancer diagnosis and treatment.


2019 ◽  
Author(s):  
Paul J. Hop ◽  
René Luijk ◽  
Lucia Daxinger ◽  
Maarten van Iterson ◽  
Koen F. Dekkers ◽  
...  

SUMMARYDNA methylation is a key epigenetic modification in human development and disease, yet there is limited understanding of its highly coordinated regulation. Here, we identified 818 genes that influence DNA methylation patterns in blood using large-scale population genomics data. By employing genetic instruments as causal anchors, we identified directed associations between gene expression and distant DNA methylation levels, whilst ensuring specificity of the associations by correcting for linkage disequilibrium and pleiotropy among neighboring genes. We found that DNA methylation patterns are commonly shaped by transcription factors that consistently increase or decrease DNA methylation levels. However, we also observed genes encoding proteins without DNA binding activity with widespread effects on DNA methylation (e.g. NFKBIE, CDCA7(L) and NLRC5) and we suggest plausible mechanisms underlying these findings. Many of the reported genes were unknown to influence DNA methylation, resulting in a comprehensive resource providing insights in the principles underlying epigenetic regulation.


2021 ◽  
Vol 67 (3) ◽  
pp. 222-230
Author(s):  
Y.L. Orlov ◽  
A.G. Galieva ◽  
N.G. Orlova ◽  
E.N. Ivanova ◽  
Y.A. Mozyleva ◽  
...  

Accumulation of genetic data in the field of Parkinson's disease research culminated in identifying risk factors and confident prediction of the disease occurrence. To find new gene-targets for diagnostics and therapy we have to reconstruct gene network of the disease, to cluster genes in the network, to reveal key (hub) genes with largest number of interactions in the network. Using the on-line bioinformatics tools OMIM, PANTHER, g:Profiler, GeneMANIA, and STRING-DB, we have analyzed the current array of data related to Parkinson's disease, calculated the categories of gene ontologies for a large list of genes, visualized them, and built gene networks containing the identified key objects and their relationships. However, translating the results into biological understanding is still a promising major challenge. The analysis of the genes associated with the disease, the assessment of their place in the gene network (connectivity) allows us to evaluate them as target genes for medicinal effects.


2011 ◽  
Vol 39 (4) ◽  
pp. 577-582 ◽  
Author(s):  
Bartha Maria Knoppers ◽  
Amy Dam

The last few years have witnessed the growth of large-scale, population genomics biobanks, which serve as longitudinal, gene-environment databases for future yet unspecified research. An international consortium, the Public Population Project in Genomics (P3G), builds harmonization tools for such biobanks and has catalogued numerous studies — at least 139 with over 10,000 banked participants and 34 with over 100,000. As their potential use for translational, clinical research draws near, it is opportune to clarify the duties of such biobanks to communicate results to participants. To identify the potential obligations, some demystification of the terminology surrounding the return of results as found in international and national norms on biobanking generally is essential. On the whole, our proposed lexicon is based on a study of norms as found in national and international policies but excludes debates found in the literature.


2015 ◽  
Vol 32 (11) ◽  
pp. 1686-1696 ◽  
Author(s):  
Lin Huang ◽  
Bo Wang ◽  
Ruitang Chen ◽  
Sivan Bercovici ◽  
Serafim Batzoglou

Sign in / Sign up

Export Citation Format

Share Document