Machine learning methods to reverse engineer dynamic gene regulatory networks governing cell state transitions

Mapping Intimacies ◽

10.1101/264671 ◽

2018 ◽

Cited By ~ 2

Author(s):

P. Tsakanikas ◽

D. Manatakis ◽

E. S. Manolakos

Keyword(s):

Machine Learning ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Most Probable Number ◽

State Transitions ◽

Gene Expressions ◽

Dynamic Interactions ◽

Probable Number ◽

Cell State ◽

Gene Regulatory

ABSTRACTDeciphering the dynamic gene regulatory mechanisms driving cells to make fate decisions remains elusive. We present a novel unsupervised machine learning methodology that can be used to analyze a dataset of heterogeneous single-cell gene expressions profiles, determine the most probable number of states (major cellular phenotypes) represented and extract the corresponding cell sub-populations. Most importantly, for any transition of interest from a source to a destination state, our methodology can zoom in, identify the cells most specific for studying the dynamics of this transition, order them along a trajectory of biological progression in posterior probabilities space, determine the "key-player" genes governing the transition dynamics, partition the trajectory into consecutive phases (transition "micro-states"), and finally reconstruct causal gene regulatory networks for each phase. Application of the end-to-end methodology provides new insights on key-player genes and their dynamic interactions during the important HSC-to-LMPP cell state transition involved in hematopoiesis. Moreover, it allows us to reconstruct a probabilistic representation of the “epigenetic landscape” of transitions and identify correctly the major ones in the hematopoiesis hierarchy of states.

Download Full-text

A Machine Learning Approach to Predict Gene Regulatory Networks in Seed Development in Arabidopsis

Frontiers in Plant Science ◽

10.3389/fpls.2016.01936 ◽

2016 ◽

Vol 7 ◽

Cited By ~ 19

Author(s):

Ying Ni ◽

Delasa Aghamirzaie ◽

Haitham Elmarakeby ◽

Eva Collakova ◽

Song Li ◽

...

Keyword(s):

Machine Learning ◽

Seed Development ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Learning Approach ◽

Machine Learning Approach ◽

Gene Regulatory

Download Full-text

Hitoshi Iba: Evolutionary approach to machine learning and deep neural networks: neuro-evolution and gene regulatory networks

Genetic Programming and Evolvable Machines ◽

10.1007/s10710-019-09350-8 ◽

2019 ◽

Vol 20 (2) ◽

pp. 151-153

Author(s):

Petra Vidnerová

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Deep Neural Networks ◽

Evolutionary Approach ◽

Gene Regulatory

Download Full-text

Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks

BMC Systems Biology ◽

10.1186/s12918-019-0695-x ◽

2019 ◽

Vol 13 (S2) ◽

Author(s):

Wenting Liu ◽

Jagath C. Rajapakse

Keyword(s):

Gene Regulatory Networks ◽

Protein Interactions ◽

Regulatory Networks ◽

Protein Protein Interactions ◽

Gene Expressions ◽

Gene Regulatory

Download Full-text

Single Cell RNA-Seq and Machine Learning Reveal Novel Subpopulations in Low-Grade Inflammatory Monocytes With Unique Regulatory Circuits

Frontiers in Immunology ◽

10.3389/fimmu.2021.627036 ◽

2021 ◽

Vol 12 ◽

Author(s):

Jiyoung Lee ◽

Shuo Geng ◽

Song Li ◽

Liwu Li

Keyword(s):

Machine Learning ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Regulatory Genes ◽

Low Grade ◽

Machine Learning Method ◽

Learning Method ◽

Cell Clusters ◽

Inflammatory Monocytes ◽

Gene Regulatory

Subclinical doses of LPS (SD-LPS) are known to cause low-grade inflammatory activation of monocytes, which could lead to inflammatory diseases including atherosclerosis and metabolic syndrome. Sodium 4-phenylbutyrate is a potential therapeutic compound which can reduce the inflammation caused by SD-LPS. To understand the gene regulatory networks of these processes, we have generated scRNA-seq data from mouse monocytes treated with these compounds and identified 11 novel cell clusters. We have developed a machine learning method to integrate scRNA-seq, ATAC-seq, and binding motifs to characterize gene regulatory networks underlying these cell clusters. Using guided regularized random forest and feature selection, our method achieved high performance and outperformed a traditional enrichment-based method in selecting candidate regulatory genes. Our method is particularly efficient in selecting a few candidate genes to explain observed expression pattern. In particular, among 531 candidate TFs, our method achieves an auROC of 0.961 with only 10 motifs. Finally, we found two novel subpopulations of monocyte cells in response to SD-LPS and we confirmed our analysis using independent flow cytometry experiments. Our results suggest that our new machine learning method can select candidate regulatory genes as potential targets for developing new therapeutics against low grade inflammation.

Download Full-text

scTenifoldNet: A Machine Learning Workflow for Constructing and Comparing Transcriptome-wide Gene Regulatory Networks from Single-Cell Data

Patterns ◽

10.1016/j.patter.2020.100139 ◽

2020 ◽

Vol 1 (9) ◽

pp. 100139

Author(s):

Daniel Osorio ◽

Yan Zhong ◽

Guanxun Li ◽

Jianhua Z. Huang ◽

James J. Cai

Keyword(s):

Machine Learning ◽

Single Cell ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Gene Regulatory ◽

Cell Data

Download Full-text

Gene network simulations provide testable predictions for the molecular domestication syndrome

10.1101/2021.03.19.436202 ◽

2021 ◽

Author(s):

Ewen Burban ◽

Maud Irene Tenaillon ◽

Arnaud Le Rouzic

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Environmental Stability ◽

Interaction Matrix ◽

Gene Expressions ◽

Domestication Syndrome ◽

Molecular Domestication ◽

Gene Regulatory ◽

The Impact

The domestication of plant and animal species lead to repeatable morphological evolution, often referred to as the phenotypic domestication syndrome. Domestication is also associated with important genomic changes, such as the loss of genetic diversity and modifications of gene expression patterns. Here, we explored theoretically the effect of domestication at the genomic level by characterizing the impact of a domestication-like scenario on gene regulatory networks. We ran population genetics simulations in which individuals were featured by their genotype (an interaction matrix encoding a gene regulatory network) and their gene expressions, representing the phenotypic level. Our domestication scenario included a population bottleneck and a selection switch (change in the optimal gene expression level) mimicking canalizing selection, i.e. evolution towards more stable expression to parallel enhanced environmental stability in man-made habitat. We showed that domestication profoundly alters genetic architectures. Based on the well-documented example of the maize (Zea mays ssp. mays) domestication, our simulations predicted (i) a drop in neutral allelic diversity, (ii) a change in gene expression variance that depended upon the domestication scenario, (iii) transient maladaptive plasticity, (iv) a deep rewiring of the gene regulatory networks, with a trend towards gain of regulatory interactions between genes, and (v) a global increase in the genetic correlations among gene expressions, with a loss of modularity in the resulting coexpression patterns and in the underlying networks. Extending the range of parameters, we provide empirically testable predictions on the differences of genetic architectures between wild and domesticated and forms. The characterization of such systematic evolutionary changes in the genetic architecture of traits contributes to define a molecular domestication syndrome.

Download Full-text

Learning gene regulatory networks using gaussian process emulator and graphical LASSO

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720021500074 ◽

2021 ◽

pp. 2150007

Author(s):

H. Chatrabgoun ◽

A. R. Soltanian ◽

H. Mahjub ◽

F. Bahreini

Keyword(s):

Gaussian Process ◽

Normal Distribution ◽

Gene Regulatory Networks ◽

Multivariate Normal Distribution ◽

Regulatory Networks ◽

Multivariate Normal ◽

Gene Expressions ◽

Precision Matrix ◽

Gene Regulatory ◽

Gp Model

Large amounts of research efforts have been focused on learning gene regulatory networks (GRNs) based on gene expression data to understand the functional basis of a living organism. Under the assumption that the joint distribution of the gene expressions of interest is a multivariate normal distribution, such networks can be constructed by assessing the nonzero elements of the inverse covariance matrix, the so-called precision matrix or concentration matrix. This may not reflect the true connectivity between genes by considering just pairwise linear correlations. To relax this limitative constraint, we employ Gaussian process (GP) model which is well known as computationally efficient non-parametric Bayesian machine learning technique. GPs are among a class of methods known as kernel machines which can be used to approximate complex problems by tuning their hyperparameters. In fact, GP creates the ability to use the capacity and potential of different kernels in constructing precision matrix and GRNs. In this paper, in the first step, we choose the GP with appropriate kernel to learn the considered GRNs from the observed genetic data, and then we estimate kernel hyperparameters using rule-of-thumb technique. Using these hyperparameters, we can also control the degree of sparseness in the precision matrix. Then we obtain kernel-based precision matrix similar to GLASSO to construct kernel-based GRN. The findings of our research are used to construct GRNs with high performance, for different species of Drosophila fly rather than simply using the assumption of multivariate normal distribution, and the GPs, despite the use of the kernels capacity, have a much better performance than the multivariate Gaussian distribution assumption.

Download Full-text

Identification of Gene Regulatory Networks, Machine Learning

Encyclopedia of Systems Biology ◽

10.1007/978-1-4419-9863-7_399 ◽

2013 ◽

pp. 938-941

Author(s):

Zhong-Yuan Zhang

Keyword(s):

Machine Learning ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Gene Regulatory

Download Full-text

scTenifoldNet: a machine learning workflow for constructing and comparing transcriptome-wide gene regulatory networks from single-cell data

10.1101/2020.02.12.931469 ◽

2020 ◽

Author(s):

Daniel Osorio ◽

Yan Zhong ◽

Guanxun Li ◽

Jianhua Z. Huang ◽

James J. Cai

Keyword(s):

Gene Expression ◽

Machine Learning ◽

Single Cell ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Principal Component Regression ◽

Principal Component ◽

Real Data ◽

Low Rank ◽

Gene Regulatory

AbstractConstructing and comparing gene regulatory networks (GRNs) from single-cell RNA sequencing (scRNAseq) data has the potential to reveal critical components in the underlying regulatory networks regulating different cellular transcriptional activities. Here, we present a robust and powerful machine learning workflow—scTenifoldNet—for comparative GRN analysis of single cells. The scTenifoldNet workflow, consisting of principal component regression, low-rank tensor approximation, and manifold alignment, constructs and compares transcriptome-wide single-cell GRNs (scGRNs) from different samples to identify gene expression signatures shifting with cellular activity changes such as those associated with pathophysiological processes and responses to environmental perturbations. We used simulated data to benchmark scTenifoldNet’s performance, and then applied scTenifoldNet to several real data sets. In real-data applications, scTenifoldNet identified highly specific changes in gene regulation in response to acute morphine treatment, an antibody anticancer drug, gene knockout, double-stranded RNA stimulus, and amyloid-beta plaques in various types of mouse and human cells. We anticipate that scTenifoldNet can help achieve breakthroughs through constructing and comparing scGRNs in poorly characterized biological systems, by deciphering the full cellular and molecular complexity of the data.HighlightsscTenifoldNet is a machine learning workflow built upon principal component regression, low-rank tensor approximation, and manifold alignmentscTenifoldNet uses single-cell RNA sequencing (scRNAseq) data to construct single-cell gene regulatory networks (scGRNs)scTenifoldNet compares scGRNs of different samples to identify differentially regulated genesReal-data applications demonstrate that scTenifoldNet accurately detects specific signatures of gene expression relevant to the cellular systems tested.Short abstractWe present scTenifoldNet—a machine learning workflow built upon principal component regression, low-rank tensor approximation, and manifold alignment—for constructing and comparing single-cell gene regulatory networks (scGRNs) using data from single-cell RNA sequencing (scRNAseq). scTenifoldNet reveals regulatory changes in gene expression between samples by comparing the constructed scGRNs. With real data, scTenifoldNet identifies specific gene expression programs associated with different biological processes, providing critical insights into the underlying mechanism of regulatory networks governing cellular transcriptional activities.

Download Full-text

Inference of gene regulatory networks by integrating gene expressions and genetic perturbations

2013 IEEE International Conference on Bioinformatics and Biomedicine ◽

10.1109/bibm.2013.6732484 ◽

2013 ◽

Author(s):

Dong-Chul Kim ◽

Chunyu Liu ◽

Xiaoyong Wu ◽

Baoju Zhang ◽

Jean Gao

Keyword(s):

Gene Regulatory Networks ◽

Regulatory Networks ◽

Gene Expressions ◽

Genetic Perturbations ◽

Gene Regulatory

Download Full-text