scholarly journals TopicNet: a framework for measuring transcriptional regulatory network change

2019 ◽  
Author(s):  
Shaoke Lou ◽  
Tianxiao Li ◽  
Xiangmeng Kong ◽  
Jing Zhang ◽  
Jason Liu ◽  
...  

SummaryNext generation sequencing data highlights comprehensive and dynamic changes in the human gene regulatory network. Moreover, changes in regulatory network connectivity (network “rewiring”) manifest different regulatory programs in multiple cellular states. However, due to the dense and noisy nature of the connectivity in regulatory networks, directly comparing the gains and losses of targets of key TFs is not that informative. Thus, here, we seek a abstracted lower-dimensional representation to understand the main features of network change. In particular, we propose a method called TopicNet that applies latent Dirichlet allocation (LDA) to extract meaningful functional topics for a collection of genes regulated by a TF. We then define a rewiring score to quantify the large-scale changes in the regulatory network in terms of topic change for a TF. Using this framework, we can pinpoint particular TFs that change greatly in network connectivity between different cellular states. This is particularly relevant in oncogenesis. Also, incorporating gene-expression data, we define a topic activity score that gives the degree that a topic is active in a particular cellular state. Furthermore, we show how activity differences can highlight differential survival in certain cancers.

2020 ◽  
Vol 36 (Supplement_1) ◽  
pp. i474-i481
Author(s):  
Shaoke Lou ◽  
Tianxiao Li ◽  
Xiangmeng Kong ◽  
Jing Zhang ◽  
Jason Liu ◽  
...  

Abstract Motivation Recently, many chromatin immunoprecipitation sequencing experiments have been carried out for a diverse group of transcription factors (TFs) in many different types of human cells. These experiments manifest large-scale and dynamic changes in regulatory network connectivity (i.e. network ‘rewiring’), highlighting the different regulatory programs operating in disparate cellular states. However, due to the dense and noisy nature of current regulatory networks, directly comparing the gains and losses of targets of key TFs across cell states is often not informative. Thus, here, we seek an abstracted, low-dimensional representation to understand the main features of network change. Results We propose a method called TopicNet that applies latent Dirichlet allocation to extract functional topics for a collection of genes regulated by a given TF. We then define a rewiring score to quantify regulatory-network changes in terms of the topic changes for this TF. Using this framework, we can pinpoint particular TFs that change greatly in network connectivity between different cellular states (such as observed in oncogenesis). Also, incorporating gene expression data, we define a topic activity score that measures the degree to which a given topic is active in a particular cellular state. And we show how activity differences can indicate differential survival in various cancers. Availability and Implementation The TopicNet framework and related analysis were implemented using R and all codes are available at https://github.com/gersteinlab/topicnet. Supplementary information Supplementary data are available at Bioinformatics online.


2022 ◽  
Vol 54 (8) ◽  
pp. 1-36
Author(s):  
Satyaki Roy ◽  
Preetam Ghosh ◽  
Nirnay Ghosh ◽  
Sajal K. Das

The advent of the edge computing network paradigm places the computational and storage resources away from the data centers and closer to the edge of the network largely comprising the heterogeneous IoT devices collecting huge volumes of data. This paradigm has led to considerable improvement in network latency and bandwidth usage over the traditional cloud-centric paradigm. However, the next generation networks continue to be stymied by their inability to achieve adaptive, energy-efficient, timely data transfer in a dynamic and failure-prone environment—the very optimization challenges that are dealt with by biological networks as a consequence of millions of years of evolution. The transcriptional regulatory network (TRN) is a biological network whose innate topological robustness is a function of its underlying graph topology. In this article, we survey these properties of TRN and the metrics derived therefrom that lend themselves to the design of smart networking protocols and architectures. We then review a body of literature on bio-inspired networking solutions that leverage the stated properties of TRN. Finally, we present a vision for specific aspects of TRNs that may inspire future research directions in the fields of large-scale social and communication networks.


2020 ◽  
Vol 48 (W1) ◽  
pp. W200-W207
Author(s):  
Simone Puccio ◽  
Giorgio Grillo ◽  
Arianna Consiglio ◽  
Maria Felicia Soluri ◽  
Daniele Sblattero ◽  
...  

Abstract High-Throughput Sequencing technologies are transforming many research fields, including the analysis of phage display libraries. The phage display technology coupled with deep sequencing was introduced more than a decade ago and holds the potential to circumvent the traditional laborious picking and testing of individual phage rescued clones. However, from a bioinformatics point of view, the analysis of this kind of data was always performed by adapting tools designed for other purposes, thus not considering the noise background typical of the ‘interactome sequencing’ approach and the heterogeneity of the data. InteractomeSeq is a web server allowing data analysis of protein domains (‘domainome’) or epitopes (‘epitome’) from either Eukaryotic or Prokaryotic genomic phage libraries generated and selected by following an Interactome sequencing approach. InteractomeSeq allows users to upload raw sequencing data and to obtain an accurate characterization of domainome/epitome profiles after setting the parameters required to tune the analysis. The release of this tool is relevant for the scientific and clinical community, because InteractomeSeq will fill an existing gap in the field of large-scale biomarkers profiling, reverse vaccinology, and structural/functional studies, thus contributing essential information for gene annotation or antigen identification. InteractomeSeq is freely available at https://InteractomeSeq.ba.itb.cnr.it/


2017 ◽  
Vol 2017 ◽  
pp. 1-8 ◽  
Author(s):  
Yuming Zhao ◽  
Fang Wang ◽  
Su Chen ◽  
Jun Wan ◽  
Guohua Wang

MicroRNAs (miRNAs) are short (~22 nucleotides) noncoding RNAs and disseminated throughout the genome, either in the intergenic regions or in the intronic sequences of protein-coding genes. MiRNAs have been proved to play important roles in regulating gene expression. Hence, understanding the transcriptional mechanism of miRNA genes is a very critical step to uncover the whole regulatory network. A number of miRNA promoter prediction models have been proposed in the past decade. This review summarized several most popular miRNA promoter prediction models which used genome sequence features, or other features, for example, histone markers, RNA Pol II binding sites, and nucleosome-free regions, achieved by high-throughput sequencing data. Some databases were described as resources for miRNA promoter information. We then performed comprehensive discussion on prediction and identification of transcription factor mediated microRNA regulatory networks.


2020 ◽  
Vol 19 ◽  
pp. 153303382090911
Author(s):  
Qi-en He ◽  
Yi-fan Tong ◽  
Zhou Ye ◽  
Li-xia Gao ◽  
Yi-zhi Zhang ◽  
...  

Radiotherapy is one of the most important cancer treatments, but its response varies greatly among individual patients. Therefore, the prediction of radiosensitivity, identification of potential signature genes, and inference of their regulatory networks are important for clinical and oncological reasons. Here, we proposed a novel multiple genomic fused partial least squares deep regression method to simultaneously analyze multi-genomic data. Using 60 National Cancer Institute cell lines as examples, we aimed to identify signature genes by optimizing the radiosensitivity prediction model and uncovering regulatory relationships. A total of 113 signature genes were selected from more than 20,000 genes. The root mean square error of the model was only 0.0025, which was much lower than previously published results, suggesting that our method can predict radiosensitivity with the highest accuracy. Additionally, our regulatory network analysis identified 24 highly important ‘hub’ genes. The data analysis workflow we propose provides a unified and computational framework to harness the full potential of large-scale integrated cancer genomic data for integrative signature discovery. Furthermore, the regression model, signature genes, and their regulatory network should provide a reliable quantitative reference for optimizing personalized treatment options, and may aid our understanding of cancer progress mechanisms.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Jonathan P. Ling ◽  
Christopher Wilks ◽  
Rone Charles ◽  
Patrick J. Leavey ◽  
Devlina Ghosh ◽  
...  

AbstractPublic archives of next-generation sequencing data are growing exponentially, but the difficulty of marshaling this data has led to its underutilization by scientists. Here, we present ASCOT, a resource that uses annotation-free methods to rapidly analyze and visualize splice variants across tens of thousands of bulk and single-cell data sets in the public archive. To demonstrate the utility of ASCOT, we identify novel cell type-specific alternative exons across the nervous system and leverage ENCODE and GTEx data sets to study the unique splicing of photoreceptors. We find that PTBP1 knockdown and MSI1 and PCBP2 overexpression are sufficient to activate many photoreceptor-specific exons in HepG2 liver cancer cells. This work demonstrates how large-scale analysis of public RNA-Seq data sets can yield key insights into cell type-specific control of RNA splicing and underscores the importance of considering both annotated and unannotated splicing events.


PLoS ONE ◽  
2020 ◽  
Vol 15 (12) ◽  
pp. e0244480
Author(s):  
Xinghuo Ye ◽  
Zhihong Yang ◽  
Yeqin Jiang ◽  
Lan Yu ◽  
Rongkai Guo ◽  
...  

Identification of the target genes of microRNAs (miRNAs), trans-acting small interfering RNAs (ta-siRNAs), and small interfering RNAs (siRNAs) is an important step for understanding their regulatory roles in plants. In recent years, many bioinformatics software packages based on small RNA (sRNA) high-throughput sequencing (HTS) and degradome sequencing data analysis have provided strong technical support for large-scale mining of sRNA-target pairs. However, sRNA-target regulation is achieved using a complex network of interactions since one transcript might be co-regulated by multiple sRNAs and one sRNA may also affect multiple targets. Currently used mining software can realize the mining of multiple unknown targets using known sRNA, but it cannot rule out the possibility of co-regulation of the same target by other unknown sRNAs. Hence, the obtained regulatory network may be incomplete. We have developed a new mining software, sRNATargetDigger, that includes two function modules, “Forward Digger” and “Reverse Digger”, which can identify regulatory sRNA-target pairs bidirectionally. Moreover, it has the ability to identify unknown sRNAs co-regulating the same target, in order to obtain a more authentic and reliable sRNA-target regulatory network. Upon re-examination of the published sRNA-target pairs in Arabidopsis thaliana, sRNATargetDigger found 170 novel co-regulatory sRNA-target pairs. This software can be downloaded from http://www.bioinfolab.cn/sRNATD.html.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8966 ◽  
Author(s):  
Kexue Li ◽  
Yakang Lu ◽  
Li Deng ◽  
Lili Wang ◽  
Lizhen Shi ◽  
...  

Metagenome assembly from short next-generation sequencing data is a challenging process due to its large scale and computational complexity. Clustering short reads by species before assembly offers a unique opportunity for parallel downstream assembly of genomes with individualized optimization. However, current read clustering methods suffer either false negative (under-clustering) or false positive (over-clustering) problems. Here we extended our previous read clustering software, SpaRC, by exploiting statistics derived from multiple samples in a dataset to reduce the under-clustering problem. Using synthetic and real-world datasets we demonstrated that this method has the potential to cluster almost all of the short reads from genomes with sufficient sequencing coverage. The improved read clustering in turn leads to improved downstream genome assembly quality.


2020 ◽  
Author(s):  
Liliang Yang ◽  
Kaizhen Wang ◽  
Wenjing Guo ◽  
Xian Chen ◽  
Qinglong Guo ◽  
...  

Abstract Background:RNA polymerase II subunit K (POLR2K) belongs to one of the multiple subunits of RNA polymerase II (Pol II), whose biological function is to synthesize mRNA. Aberrant POLR2K expression is related to carcinogenesis. However, POLR2K’s underlying role in bladder cancer has not been explored. In the current study, we intend to analyze the function of POLR2K and its regulatory network within bladder cancer.Methods: Public sequencing data was obtained from GEO and TCGA to investigate POLR2K expression and regulatory network within bladder cancer (BLCA) by using GEPIA and Oncomine as well as cBioPortal online tool. LinkedOmics was employed to identify genes displaying significantly differential expression patterns and to perform GO and KEGG analyses. After differential genes was assigned and ranked, GSEA analyses was performed to obtain target networks for transcription factors, miRNAs, and kinases that could regulate POLR2K–associated gene network. Subsequent functional webwork analyses were used to identify cancer-relevant pathways Moreover, POLR2K gene is verified, by ChIP-seq in MCF-7 cell line , with transcription factor binding evidence in the ENCODE Transcription Factor Binding Site Profiles dataset.Conclusions: The current study implies that POLR2K gene is overexpressed and often amplified in BLCA, providing the first evidence that POLR2K deregulation, in particular increased transcription, may promote BLCA. These findings uncover a unique expression patterns of POLR2K and its potential regulatory networks in BLCA, contributing greatly to study of the role of POLR2K in cancer development.


Sign in / Sign up

Export Citation Format

Share Document