scholarly journals Gene co-expression network connectivity is an important determinant of selective constraint

2016 ◽  
Author(s):  
Niklas Mähler ◽  
Jing Wang ◽  
Barbara K Terebieniec ◽  
Pär K Ingvarsson ◽  
Nathaniel R Street ◽  
...  

AbstractWhile several studies have investigated general properties of the genetic architecture of natural variation in gene expression, few of these have considered natural, outbreeding populations. In parallel, systems biology has established that a general feature of biological networks is that they are scale-free, rendering them buffered against random mutations. To date, few studies have attempted examine the relationship between the selective processes acting to maintain natural variation of gene expression and the associated co-expression network structure. Here we utilised RNA-Sequencing to assay gene expression in winter buds undergoing bud flush in a natural population of Populus tremula, and outbreeding forest tree species. We performed expression Quantitative Trait Locus (eQTL) mapping and identified 164,290 significant eQTLs associating 6,241 unique genes (eGenes) with 147,419 unique SNPs (eSNPs). We found approximately four times as many local as distant eQTLs, with local eQTLs having significantly higher effect sizes. eQTLs were primarily located in regulatory regions of genes (UTRs or flanking regions), regardless of whether they were local or distant. We used the gene expression data to infer a co-expression network and investigated the relationship between network topology, the genetic architecture of gene expression and signatures of selection. Within the co-expression network, eGenes were underrepresented in network module cores (hubs) and overrepresented in the periphery of the network, with a negative correlation between eQTL effect size and network connectivity. We additionally found that module core genes have experienced stronger selective constraint on coding and non-coding sequence, with connectivity associated with signatures of selection. Our integrated genetics and genomics results suggest that purifying selection is the primary mechanism underlying the genetic architecture of natural variation in gene expression assayed in flushing leaf buds of P. tremula and that connectivity within the co-expression network is linked to the strength of purifying selection.Author summaryNumerous studies have shown that many genomic polymorphisms contributing to phenotypic variation are located outside of protein coding regions, suggesting that they act by modulating gene expression. Furthermore, phenotypes are seldom explained by individual genes, but rather emerge from networks of interacting genes. The effect of regulatory variants and the interaction of genes can be described by co-expression networks, which are known to contain a small number of highly connected nodes and many more lowly connected nodes, making them robust to random mutation. While previous studies have examined the genetic architecture of gene expression variation, few were performed in natural populations with fewer still integrating the co-expression network.We undertook a study using a natural population of European aspen (Populus tremula), showing that highly connected genes within the co-expression network had lower levels of polymorphism, had polymorphisms segregating at lower frequencies and with lower than average effect sizes, suggesting purifying selection acts on central components of the network. Furthermore, the most highly connected genes within co-expression network hubs were underrepresented for identified expression quantitative trait loci, suggesting that purifying selection on individual SNPs is driven by stabilising selecting on gene expression. In contrast, genes in the periphery of the network displayed signatures of relaxed selective constraint. Highly connected genes are therefore buffered against large expression modulation, providing a mechanistic link between selective pressures and network toplogy, which act in cohort to maintain the robustness at the population level of the co-expression network derived from flushing buds in P. tremula.

2018 ◽  
Author(s):  
Eilis Hannon ◽  
Tyler J Gorrie-Stone ◽  
Melissa C Smart ◽  
Joe Burrage ◽  
Amanda Hughes ◽  
...  

ABSTRACTCharacterizing the complex relationship between genetic, epigenetic and transcriptomic variation has the potential to increase understanding about the mechanisms underpinning health and disease phenotypes. In this study, we describe the most comprehensive analysis of common genetic variation on DNA methylation (DNAm) to date, using the Illumina EPIC array to profile samples from the UK Household Longitudinal study. We identified 12,689,548 significant DNA methylation quantitative trait loci (mQTL) associations (P < 6.52x10-14) occurring between 2,907,234 genetic variants and 93,268 DNAm sites, including a large number not identified using previous DNAm-profiling methods. We demonstrate the utility of these data for interpreting the functional consequences of common genetic variation associated with > 60 human traits, using Summary data–based Mendelian Randomization (SMR) to identify 1,662 pleiotropic associations between 36 complex traits and 1,246 DNAm sites. We also use SMR to characterize the relationship between DNAm and gene expression, identifying 6,798 pleiotropic associations between 5,420 DNAm sites and the transcription of 1,702 genes. Our mQTL database and SMR results are available via a searchable online database (http://www.epigenomicslab.com/online-data-resources/) as a resource to the research community.


2020 ◽  
Author(s):  
Wen Huang ◽  
Mary Anna Carbone ◽  
Richard F. Lyman ◽  
Robert H. H. Anholt ◽  
Trudy F. C. Mackay

AbstractThe genetics of phenotypic responses to changing environments remains elusive. Using whole genome quantitative gene expression as a model, we studied how the genetic architecture of regulatory variation in gene expression changed in a population of fully sequenced inbred Drosophila melanogaster strains when flies developed at different environments (25 °C and 18 °C). We found a substantial fraction of the transcriptome exhibited genotype by environment interaction, implicating environmentally plastic genetic architecture of gene expression. Genetic variance in expression increased at 18 °C relative to 25 °C for most genes that had a change in genetic variance. Although the majority of expression quantitative trait loci (eQTLs) for the gene expression traits in the two environments were shared and had similar effects, analysis of the environment-specific eQTLs revealed enrichment of binding sites for two transcription factors. Finally, although genotype by environment interaction in gene expression could potentially disrupt genetic networks, the co-expression networks were highly conserved across environments. Genes with higher network connectivity were under stronger stabilizing selection, suggesting that stabilizing selection on expression plays an important role in promoting network robustness.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Wen Huang ◽  
Mary Anna Carbone ◽  
Richard F. Lyman ◽  
Robert R. H. Anholt ◽  
Trudy F. C. Mackay

Abstract The genetics of phenotypic responses to changing environments remains elusive. Using whole-genome quantitative gene expression as a model, here we study how the genetic architecture of regulatory variation in gene expression changed in a population of fully sequenced inbred Drosophila melanogaster strains when flies developed in different environments (25 °C and 18 °C). We find a substantial fraction of the transcriptome exhibited genotype by environment interaction, implicating environmentally plastic genetic architecture of gene expression. Genetic variance in expression increases at 18 °C relative to 25 °C for most genes that have a change in genetic variance. Although the majority of expression quantitative trait loci (eQTLs) for the gene expression traits in the two environments are shared and have similar effects, analysis of the environment-specific eQTLs reveals enrichment of binding sites for two transcription factors. Finally, although genotype by environment interaction in gene expression could potentially disrupt genetic networks, the co-expression networks are highly conserved across environments. Genes with higher network connectivity are under stronger stabilizing selection, suggesting that stabilizing selection on expression plays an important role in promoting network robustness.


2019 ◽  
Author(s):  
Suraj Sharma ◽  
Ovidiu Popa ◽  
Stanislav Kopriva ◽  
Oliver Ebenhoeh

AbstractGlucosinolates are a fascinating class of specialised metabolites found in the plants of Brassicacea family. The variation in glucosinolate composition across different Arabidopsis ecotypes could be a result of allelic compositions at different biosynthetic loci. The contribution of methylthioalkylmalate synthase (MAM) genes to diversity of glucosinolate profiles across different Arabidopsis ecotypes has been confirmed by genetic analyses. Different MAM isoforms utilise different chain-elongated substrates for glucosinolate biosynthesis causing thus a variation in chain lengths across different Arabidopsis ecotypes. To further investigate the relationship between the genotype and the associated metabolic phenotype, we studied the diversity of genes and enzymes of glucosinolate biosynthesis. Using Shannon entropy as a measure we revealed that several genes of the pathway show a clear derivation from the expected behaviour, either accumulating non-synonymous SNPs or showing signs of purifying selection. We found that the genotype-phenotype relationship is much more complicated than inferred from the diversity of MAM synthases. We conclude therefore, that the ON/OFF feature of key QTLs is not enough to elucidate the diversity of glucosinolates across different Arabidopsis thaliana ecotypes and that glucosinolate profiles are determined also through the polymorphic residues along the coding regions of multiple metabolic genes.


2020 ◽  
Author(s):  
Carlos Ruiz-Arenas ◽  
Carles Hernandez-Ferrer ◽  
Marta Vives-Usano ◽  
Sergi Marí ◽  
Inés Quintela ◽  
...  

AbstractBackgroundThe identification of expression quantitative trait methylation (eQTMs), defined as correlations between gene expression and DNA methylation levels, might help the biological interpretation of epigenome-wide association studies (EWAS). We aimed to identify autosomal cis-eQTMs in child blood, using data from 832 children of the Human Early Life Exposome (HELIX) project.MethodsBlood DNA methylation and gene expression were measured with the Illumina 450K and the Affymetrix HTA v2 arrays, respectively. The relationship between methylation levels and expression of nearby genes (transcription start site (TSS) within a window of 1 Mb) was assessed by fitting 13.6 M linear regressions adjusting for sex, age, and cohort.ResultsWe identified 63,831 autosomal cis-eQTMs, representing 35,228 unique CpGs and 11,071 unique transcript clusters (TCs, genes). 74.3% of these cis-eQTMs were located at <250 kb, 60.0% showed an inverse relationship and 23.9% had at least one genetic variant associated with the methylation and expression levels. They were enriched for active blood regulatory regions. Adjusting for cellular composition decreased the number of cis-eQTMs to 37.7%, suggesting that some of them were cell type-specific. The overlap of child blood cis-eQTMs with those described in adults was small, and child and adult shared cis-eQTMs tended to be proximal to the TSS, enriched for genetic variants and with lower cell type specificity. Only half of the cis-eQTMs could be captured through annotation to the closest gene.ConclusionsThis catalogue of blood autosomal cis-eQTMs in children can help the biological interpretation of EWAS findings, and is publicly available at https://helixomics.isglobal.org/.


Sign in / Sign up

Export Citation Format

Share Document