scholarly journals Inferring biosynthetic and gene regulatory networks from Artemisia annua RNA sequencing data on a credit card-sized ARM computer

2019 ◽  
Author(s):  
Qiao Wen Tan ◽  
Marek Mutwil

0.ABSTRACTPrediction of gene function and gene regulatory networks is one of the most active topics in bioinformatics. The accumulation of publicly available gene expression data for hundreds of plant species, together with advances in bioinformatical methods and affordable computing, sets ingenuity as the major bottleneck in understanding gene function and regulation. Here, we show how a credit card-sized computer retailing for less than 50 USD can be used to rapidly predict gene function and infer regulatory networks from RNA sequencing data. To achieve this, we constructed a bioinformatical pipeline that downloads and allows quality-control of RNA sequencing data; and generates a gene co-expression network that can reveal enzymes and transcription factors participating and controlling a given biosynthetic pathway. We exemplify this by first identifying genes and transcription factors involved in the biosynthesis of secondary cell wall in the plant Artemisia annua, the main natural source of the anti-malarial drug artemisinin. Networks were then used to dissect the artemisinin biosynthesis pathway, which suggest potential transcription factors regulating artemisinin biosynthesis. We provide the source code of our pipeline and envision that the ubiquity of affordable computing, availability of biological data and increased bioinformatical training of biologists will transform the field of bioinformatics.HighlightsProcessing of large scale transcriptomic data with affordable single-board computersTranscription factors can be found in the same network as their targetsCo-expression of transcription factors and genes in secondary cell wall biosynthesisCo-expression of transcription factors and genes involved in artemisinin biosynthesis

Author(s):  
Rui-Qi Wang ◽  
Wei Zhao ◽  
Hai-Kui Yang ◽  
Jia-Mei Dong ◽  
Wei-Jie Lin ◽  
...  

Colorectal cancer (CRC) manifests as gastrointestinal tumors with high intratumoral heterogeneity. Recent studies have demonstrated that CRC may consist of tumor cells with different consensus molecular subtypes (CMS). The advancements in single-cell RNA sequencing have facilitated the development of gene regulatory networks to decode key regulators for specific cell types. Herein, we comprehensively analyzed the CMS of CRC patients by using single-cell RNA-sequencing data. CMS for all malignant cells were assigned using CMScaller. Gene set variation analysis showed pathway activity differences consistent with those reported in previous studies. Cell–cell communication analysis confirmed that CMS1 was more closely related to immune cells, and that monocytes and macrophages play dominant roles in the CRC tumor microenvironment. On the basis of the constructed gene regulation networks (GRNs) for each subtype, we identified that the critical transcription factor ERG is universally activated and upregulated in all CMS in comparison with normal cells, and that it performed diverse roles by regulating the expression of different downstream genes. In summary, molecular subtyping of single-cell RNA-sequencing data for colorectal cancer could elucidate the heterogeneity in gene regulatory networks and identify critical regulators of CRC.


2019 ◽  
Vol 101 (3) ◽  
pp. 716-730 ◽  
Author(s):  
Ryan J. Spurney ◽  
Lisa Van den Broeck ◽  
Natalie M. Clark ◽  
Adam P. Fisher ◽  
Maria A. de Luis Balaguer ◽  
...  

2021 ◽  
Author(s):  
Boris M. Brenerman ◽  
Benjamin D. Shapiro ◽  
Michael C. Schatz ◽  
Alexis Battle

AbstractSingle-cell RNA sequencing data contain patterns of correlation that are poorly captured by techniques that rely on linear estimation or assumptions of Gaussian behavior. We apply random forest regression to scRNAseq data from mouse brains, which identifies the co-regulation of genes within specific cellular contexts. By analyzing the estimators of the random forest, we identify several novel candidate gene regulatory networks and compare these networks in aged and young mice. We demonstrate that cell populations have cell-type specific phenotypes of aging that are not detected by other methods, including the collapse of differentiating oligodendrocytes but not precursors or mature oligodendrocytes.


2018 ◽  
Author(s):  
Maria Angels de Luis Balaguer ◽  
Ryan J. Spurney ◽  
Natalie M. Clark ◽  
Adam P. Fisher ◽  
Rosangela Sozzani

ABSTRACTPredicting gene regulatory networks (GRNs) from gene expression profiles has become a common approach for identifying important biological regulators. Despite the increase in the use of inference methods, existing computational approaches do not integrate RNA-sequencing data analysis, are often not automated, and are restricted to users with bioinformatics and programming backgrounds. To address these limitations, we have developed TuxNet, an integrated user-friendly platform, which, with just a few selections, allows to process raw RNA-sequencing data (using the Tuxedo pipeline) and infer GRNs from these processed data. TuxNet is implemented as a graphical user interface and, using expression data from any organism with an existing reference genome, can mine the regulations among genes either by applying a dynamic Bayesian network inference algorithm, GENIST, or a regression tree-based pipeline that uses spatiotemporal data, RTP-STAR. To illustrate the use of TuxNet while getting insight into the regulatory cascade downstream of the Arabidopsis root stem cell regulator PERIANTHIA (PAN), we obtained time course gene expression data of a PAN inducible line and inferred a GRN using GENIST. Using RTP-STAR, we then inferred the network of a PAN secondary downstream gene, ATHB13, for which we obtained wildtype and mutant expression profiles. Our case studies feature the versatility of TuxNet to infer networks using different types of gene expression data (i.e time course and steady-state data) as well as how inference networks are used to identify important regulators.SUMMARYTuxNet offers a simple interface for non-computational biologists to infer GRNs from raw RNA-seq data.


2016 ◽  
Vol 113 (13) ◽  
pp. E1835-E1843 ◽  
Author(s):  
Mina Fazlollahi ◽  
Ivor Muroff ◽  
Eunjee Lee ◽  
Helen C. Causton ◽  
Harmen J. Bussemaker

Regulation of gene expression by transcription factors (TFs) is highly dependent on genetic background and interactions with cofactors. Identifying specific context factors is a major challenge that requires new approaches. Here we show that exploiting natural variation is a potent strategy for probing functional interactions within gene regulatory networks. We developed an algorithm to identify genetic polymorphisms that modulate the regulatory connectivity between specific transcription factors and their target genes in vivo. As a proof of principle, we mapped connectivity quantitative trait loci (cQTLs) using parallel genotype and gene expression data for segregants from a cross between two strains of the yeast Saccharomyces cerevisiae. We identified a nonsynonymous mutation in the DIG2 gene as a cQTL for the transcription factor Ste12p and confirmed this prediction empirically. We also identified three polymorphisms in TAF13 as putative modulators of regulation by Gcn4p. Our method has potential for revealing how genetic differences among individuals influence gene regulatory networks in any organism for which gene expression and genotype data are available along with information on binding preferences for transcription factors.


2020 ◽  
Author(s):  
Pallavi Singh ◽  
Sean R. Stevenson ◽  
Ivan Reyna-Llorens ◽  
Gregory Reeves ◽  
Tina B. Schreier ◽  
...  

ABSTRACTThe efficient C4 pathway is based on strong up-regulation of genes found in C3 plants, but also compartmentation of their expression into distinct cell-types such as the mesophyll and bundle sheath. Transcription factors associated with these phenomena have not been identified. To address this, we undertook genome-wide analysis of transcript accumulation, chromatin accessibility and transcription factor binding in C4Gynandropsis gynandra. From these data, two models relating to the molecular evolution of C4 photosynthesis are proposed. First, increased expression of C4 genes is associated with increased binding by MYB-related transcription factors. Second, mesophyll specific expression is associated with binding of homeodomain transcription factors. Overall, we conclude that during evolution of the complex C4 trait, C4 cycle genes gain cis-elements that operate in the C3 leaf such that they become integrated into existing gene regulatory networks associated with cell specificity and photosynthesis.


2020 ◽  
Author(s):  
Lotte Vanheer ◽  
Andrea Alex Schiavo ◽  
Matthias Van Haele ◽  
Tine Haesen ◽  
Adrian Janiszewski ◽  
...  

SUMMARYCellular identity during development is under the control of transcription factors that form gene regulatory networks. However, the transcription factors and gene regulatory networks underlying cellular identity in the human adult pancreas remain largely unexplored. Here, we integrate multiple single-cell RNA sequencing datasets of the human adult pancreas, totaling 7393 cells, and comprehensively reconstruct gene regulatory networks. We show that a network of 142 transcription factors forms distinct regulatory modules that characterize pancreatic cell types. We present evidence that our approach identifies key regulators of cell identity in the human adult pancreas. We predict that HEYL and JUND are active in acinar and alpha cells, respectively, and show that these proteins are present in the human adult pancreas as well as in human induced pluripotent stem cell-derived pancreatic cells. The comprehensive gene regulatory network atlas can be explored interactively online. We anticipate our analysis to be the starting point for a more sophisticated dissection of how transcription factors regulate cell identity in the human adult pancreas. Furthermore, given that transcription factors are major regulators of embryo development and are often perturbed in diseases, a comprehensive understanding of how transcription factors work will be relevant in development and disease biology.HIGHLIGHTS-Reconstruction of gene regulatory networks for human adult pancreatic cell types-An interactive resource to explore and visualize gene expression and regulatory states-Predicting putative transcription factors driving pancreatic cell identity-HEYL and JUND as candidate regulators of acinar and alpha cell identity, respectively


2019 ◽  
Vol 116 (13) ◽  
pp. 5892-5901 ◽  
Author(s):  
Zoe Swank ◽  
Nadanai Laohakunakorn ◽  
Sebastian J. Maerkl

Gene-regulatory networks are ubiquitous in nature and critical for bottom-up engineering of synthetic networks. Transcriptional repression is a fundamental function that can be tuned at the level of DNA, protein, and cooperative protein–protein interactions, necessitating high-throughput experimental approaches for in-depth characterization. Here, we used a cell-free system in combination with a high-throughput microfluidic device to comprehensively study the different tuning mechanisms of a synthetic zinc-finger repressor library, whose affinity and cooperativity can be rationally engineered. The device is integrated into a comprehensive workflow that includes determination of transcription-factor binding-energy landscapes and mechanistic modeling, enabling us to generate a library of well-characterized synthetic transcription factors and corresponding promoters, which we then used to build gene-regulatory networks de novo. The well-characterized synthetic parts and insights gained should be useful for rationally engineering gene-regulatory networks and for studying the biophysics of transcriptional regulation.


Sign in / Sign up

Export Citation Format

Share Document