scholarly journals Common gene expression signatures in Parkinson’s disease are driven by changes in cell composition

2019 ◽  
Author(s):  
Gonzalo S. Nido ◽  
Fiona Dick ◽  
Lilah Toker ◽  
Kjell Petersen ◽  
Guido Alves ◽  
...  

AbstractBackgroundThe etiology of Parkinson’s disease (PD) is largely unknown. Genome-wide transcriptomic studies in bulk brain tissue have identified several molecular signatures associated with the disease. While these studies have the potential to shed light into the pathogenesis of PD, they are also limited by two major confounders: RNA post mortem degradation and heterogeneous cell type composition of bulk tissue samples. We performed RNA sequencing following ribosomal RNA depletion in the prefrontal cortex of 49 individuals from two independent case-control cohorts. Using cell-type specific markers, we estimated the cell-type composition for each sample and included this in our analysis models to compensate for the variation in cell-type proportions.ResultsRibosomal RNA depletion results in substantially more even transcript coverage, compared to poly(A) capture, in post mortem tissue. Moreover, we show that cell-type composition is a major confounder of differential gene expression analysis in the PD brain. Correcting for cell-type proportions attenuates numerous transcriptomic signatures that have been previously associated with PD, including vesicle trafficking, synaptic transmission, immune and mitochondrial function. Conversely, pathways related to endoplasmic reticulum, lipid oxidation and unfolded protein response are strengthened and surface as the top differential gene expression signatures in the PD prefrontal cortex.ConclusionsDifferential gene expression signatures in PD bulk brain tissue are significantly confounded by underlying differences in cell-type composition. Modeling cell-type heterogeneity is crucial in order to unveil transcriptomic signatures that represent regulatory changes in the PD brain and are, therefore, more likely to be associated with underlying disease mechanisms.

2016 ◽  
Author(s):  
Megan Hastings Hagenauer ◽  
Anton Schulmann ◽  
Jun Z. Li ◽  
Marquis P. Vawter ◽  
David M. Walsh ◽  
...  

AbstractPsychiatric illness is unlikely to arise from pathology occurring uniformly across all cell types in affected brain regions. Despite this, transcriptomic analyses of the human brain have typically been conducted using macro-dissected tissue due to the difficulty of performing single-cell type analyses with donated post-mortem brains. To address this issue statistically, we compiled a database of several thousand transcripts that were specifically-enriched in one of 10 primary cortical cell types in previous publications. Using this database, we predicted the relative cell type composition for 833 human cortical samples using microarray or RNA-Seq data from the Pritzker Consortium (GSE92538) or publicly-available databases (GSE53987, GSE21935, GSE21138, CommonMind Consortium). These predictions were generated by averaging normalized expression levels across transcripts specific to each cell type using our R-packageBrainInABlender(validated and publicly-released:https://github.com/hagenaue/BrainInABlender). Using this method, we found that the principal components of variation in the datasets strongly correlated with the neuron to glia ratio of the samples.This variability was not simply due to dissection – the relative balance of brain cell types appeared to be influenced by a variety of demographic, pre- and post-mortem variables. Prolonged hypoxia around the time of death predicted increased astrocytic and endothelial gene expression, illustrating vascular upregulation. Aging was associated with decreased neuronal gene expression. Red blood cell gene expression was reduced in individuals who died following systemic blood loss. Subjects with Major Depressive Disorder had decreased astrocytic gene expression, mirroring previous morphometric observations. Subjects with Schizophrenia had reduced red blood cell gene expression, resembling the hypofrontality detected in fMRI experiments. Finally, in datasets containing samples with especially variable cell content, we found that controlling for predicted sample cell content while evaluating differential expression improved the detection of previously-identified psychiatric effects. We conclude that accounting for cell type can greatly improve the interpretability of transcriptomic data.


2014 ◽  
Vol 23 (10) ◽  
pp. 2721-2728 ◽  
Author(s):  
S. De Jong ◽  
M. Neeleman ◽  
J. J. Luykx ◽  
M. J. Ten Berg ◽  
E. Strengman ◽  
...  

2021 ◽  
Vol 8 ◽  
Author(s):  
Marianthi Kalafati ◽  
Michael Lenz ◽  
Gökhan Ertaylan ◽  
Ilja C. W. Arts ◽  
Chris T. Evelo ◽  
...  

Background: Macrophages play an important role in regulating adipose tissue function, while their frequencies in adipose tissue vary between individuals. Adipose tissue infiltration by high frequencies of macrophages has been linked to changes in adipokine levels and low-grade inflammation, frequently associated with the progression of obesity. The objective of this project was to assess the contribution of relative macrophage frequencies to the overall subcutaneous adipose tissue gene expression using publicly available datasets.Methods: Seven publicly available microarray gene expression datasets from human subcutaneous adipose tissue biopsies (n = 519) were used together with TissueDecoder to determine the adipose tissue cell-type composition of each sample. We divided the subjects in four groups based on their relative macrophage frequencies. Differential gene expression analysis between the high and low relative macrophage frequencies groups was performed, adjusting for sex and study. Finally, biological processes were identified using pathway enrichment and network analysis.Results: We observed lower frequencies of adipocytes and higher frequencies of adipose stem cells in individuals characterized by high macrophage frequencies. We additionally studied whether, within subcutaneous adipose tissue, interindividual differences in the relative frequencies of macrophages were reflected in transcriptional differences in metabolic and inflammatory pathways. Adipose tissue of individuals with high macrophage frequencies had a higher expression of genes involved in complement activation, chemotaxis, focal adhesion, and oxidative stress. Similarly, we observed a lower expression of genes involved in lipid metabolism, fatty acid synthesis, and oxidation and mitochondrial respiration.Conclusion: We present an approach that combines publicly available subcutaneous adipose tissue gene expression datasets with a deconvolution algorithm to calculate subcutaneous adipose tissue cell-type composition. The results showed the expected increased inflammation gene expression profile accompanied by decreased gene expression in pathways related to lipid metabolism and mitochondrial respiration in subcutaneous adipose tissue in individuals characterized by high macrophage frequencies. This approach demonstrates the hidden strength of reusing publicly available data to gain cell-type-specific insights into adipose tissue function.


2021 ◽  
Author(s):  
Wenjing Ma ◽  
Sumeet Sharma ◽  
Peng Jin ◽  
Shannon L Gourley ◽  
Zhaohui Qin

The rapid proliferation of single-cell RNA-sequencing (scRNA-seq) datasets have revealed cell heterogeneity at unprecedented scales. Several deconvolution methods have been developed to decompose bulk experiments to reveal cell type contributions. However, these methods lack power in identifying the accurate cell type composition when having a considerable amount of sub-cell types in the reference dataset. Here, we present LRcell, a R Bioconductor package (http://bioconductor.org/packages/release/bioc/html/LRcell.html) aiming to identify specific sub-cell type(s) that drives the changes observed in a bulk RNA-seq differential gene expression experiment. In addition, LRcell provides pre-embedded marker genes computed from putative single-cell RNA-seq experiments as options to execute the analyses.


GigaScience ◽  
2021 ◽  
Vol 10 (2) ◽  
Author(s):  
Brian B Nadel ◽  
David Lopez ◽  
Dennis J Montoya ◽  
Feiyang Ma ◽  
Hannah Waddel ◽  
...  

Abstract Background The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Computational approaches that use expression data to infer cell type abundance offer an alternative solution. While these methods have gained popularity, most fail to produce accurate predictions for the full range of platforms currently used by researchers or for the wide variety of tissue types often studied. Results We present the Gene Expression Deconvolution Interactive Tool (GEDIT), a flexible tool that utilizes gene expression data to accurately predict cell type abundances. Using both simulated and experimental data, we extensively evaluate the performance of GEDIT and demonstrate that it returns robust results under a wide variety of conditions. These conditions include multiple platforms (microarray and RNA-seq), tissue types (blood and stromal), and species (human and mouse). Finally, we provide reference data from 8 sources spanning a broad range of stromal and hematopoietic types in both human and mouse. GEDIT also accepts user-submitted reference data, thus allowing the estimation of any cell type or subtype, provided that reference data are available. Conclusions GEDIT is a powerful method for evaluating the cell type composition of tissue samples and provides excellent accuracy and versatility compared to similar tools. The reference database provided here also allows users to obtain estimates for a wide variety of tissue samples without having to provide their own data.


2019 ◽  
Author(s):  
Gregory J. Hunt ◽  
Johann A. Gagnon-Bartsch

ABSTRACTComplex tissues are composed of a large number of different types of cells, each involved in a multitude of biological processes. Consequently, an important component to understanding such processes is understanding the cell-type composition of the tissues. Estimating cell type composition using high-throughput gene expression data is known as cell-type deconvolution. In this paper, we first summarize the extensive deconvolution literature by identifying a common regression-like approach to deconvolution. We call this approach the Unified Deconvolution-as-Regression (UDAR) framework. While methods that fall under this framework all use a similar model, they fit using data on different scales. Two popular scales for gene expression data are logarithmic and linear. Unfortunately, each of these scales has problems in the UDAR framework. Using log-scale gene expressions proposes a biologically implausible model and using linear-scale gene expressions will lead to statistically inefficient estimators. To overcome these problems, we propose a new approach for cell-type deconvolution that works on a hybrid of the two scales. This new approach is biologically plausible and improves statistical efficiency. We compare the hybrid approach to other methods on simulations as well as a collection of eleven real benchmark datasets. Here, we find the hybrid approach to be accurate and robust.deconvolution, gene expression, microarray, RNA-seq


Sign in / Sign up

Export Citation Format

Share Document