scholarly journals Genomic analysis of allele-specific expression in the mouse liver

2015 ◽  
Author(s):  
Ashutosh K Pandey ◽  
Robert W Williams

Genetic differences in gene expression contribute significantly to phenotypic diversity and differences in disease susceptibility. In fact, the great majority of causal variants highlighted by genome-wide association are in non-coding regions that modulate expression. In order to quantify the extent of allelic differences in expression, we analyzed liver transcriptomes of isogenic F1 hybrid mice. Allele-specific expression (ASE) effects are pervasive and are detected in over 50% of assayed genes. Genes with strong ASE do not differ from those with no ASE with respect to their length or promoter complexity. However, they have a higher density of sequence variants, higher functional redundancy, and lower evolutionary conservation compared to genes with no ASE. Fifty percent of genes with no ASE are categorized as house-keeping genes. In contrast, the high ASE set may be critical in phenotype canalization. There is significant overlap between genes that exhibit ASE and those that exhibit strong cis expression quantitative trait loci (cis eQTLs) identified using large genetic expression data sets. Eighty percent of genes with cis eQTLs also have strong ASE effects. Conversely, 40% of genes with ASE effects are associated with strong cis eQTLs. Cis-acting variation detected at the protein level is also detected at the transcript level, but the converse is not true. ASE is a highly sensitive and direct method to quantify cis-acting variation in gene expression and complements and extends classic cis eQTL analysis. ASE differences can be combined with coding variants to produce a key resource of functional variants for precision medicine and genome-to-phenome mapping.

Blood ◽  
2017 ◽  
Vol 130 (Suppl_1) ◽  
pp. 926-926
Author(s):  
Xu Zhang ◽  
Jihyun Song ◽  
Binal N. Shah ◽  
Galina Miasnikova ◽  
Adelina Sergueeva ◽  
...  

Abstract Homozygosity for the VHLR200W mutation in Chuvash polycythemia (CP) leads to decreased degradation of the α subunits of hypoxia inducible factor (HIF)-1 and HIF-2 by the hypomorphic variant of VHL, the principal negative regulator of HIFs. The constitutively activated HIFs directly regulate the transcription of a suite of hypoxic responsible genes, including the principal regulators of erythropoiesis, vessel development, and glycolytic metabolism, which further trigger a downstream cascade of gene expression. Besides these transcriptional factors, cis acting elements play an important role in the hypoxic gene regulatory network. To assess the extent of cis regulatory variation in hypoxic gene expression, we compared allele-specific expression (ASE) in purified reticulocytes between VHLR200W homozygote individuals and age- and gender-matched wild type control individuals living at the same altitude of ~200 meters from the Chuvash population. Cell fractions of reticulocytes were purified from 17 VHLR200W homozygotes and 13 wild type individuals. Total RNA was extracted, depleted of ribosomal RNA and hemoglobin transcripts, and reverse transcribed. Strand-specific libraries were constructed for 125 bp paired-end sequencing to 30-45 million read pairs per sample using Illumina HiSeq 2500 platform. The samples were collected and processed in three batches across two years, with VHL genotype randomized in each batch. The sequencing data were mapped to human reference genome and analyzed for differential expression and differential ASE between VHLR200W homozygotes and wild type individuals. At 5% false discovery rate (FDR, i.e., <5 false positives in 100 detected genes), 1,267 genes were differentially expressed with more than 1.2-fold change in CP patients, 703 elevated and 564 decreased. Genes up-regulated in CP were enriched (fold enrichment >5, FDR <0.05) in REACTOME pathways of epigenetic remodeling (Packaging of telomere ends, DNA methylation, HDACs deacetylate histones, PRC2 methylates histones and DNA, Deposition of new CENPA-containing nucleosomes at the centromere, HATs acetylate histones) and oxidative stress induced senescence (DNA damage/telomere stress induced senescence, Senescence-associated secretory phenotype, Oxidative stress induced senescence). Genes decreased in CP were enriched in REACTOME pathways of cell cycle (E2F-enabled inhibition of pre-replication complex formation, Nuclear pore complex disassembly, SUMOylation of DNA replication proteins) and DNA damage repair (Activation of ATR in response to replication stress, SUMOylation of DNA damage response and repair proteins). ASE was analyzed between CP and wild type individuals to assess hypoxic response-dependent genetic effects on gene expression. For the 1,267 genes differentially expressed in the CP, we selected genes containing exonic SNPs with heterozygous alleles for ASE analysis. With a null hypothesis of no cis acting regulation on the gene expression, both alleles are expected to be expressed at the same level, whereas allelic imbalance indicates linked cis regulation. At a given bi-allelic SNP, individuals with ≥2 read counts covering each of the reference and alternative alleles and with ≥20 total counts were included in the analysis. Exonic SNPs with at least one individual in each of the CP and wild type group were further selected to test for differential ASE between the CP and wild type groups, using a generalized linear model. A total of 147 genes passed the filtering and were analyzed, among which 32 were detected to have significant CP-dependent ASE at 5% FDR. Some of these genes may have important roles in hypoxic responses in CP reticulocytes, for example NEIL3, encoding a DNA glycosylase that initiates the first step in base excision repair by cleaving bases damaged by reactive oxygen species, and STOM, encoding an integral membrane protein that localizes to the cell membrane of red blood cells, loss of which is associated with hereditary stomatocytosis. Our study reveals plethora of gene expression changes in CP reticulocytes compared to wild type controls, among which 22% could be regulated by hypoxic response-specific cis genetic variations. These observations indicate the prominence of cis elements in hypoxic response, for which substantial inter-individual differences exist even among a relatively isolated population. Disclosures Gordeuk: Emmaus Life Sciences: Consultancy.


2017 ◽  
Author(s):  
Narayanan Raghupathy ◽  
Kwangbom Choi ◽  
Matthew J. Vincent ◽  
Glen L. Beane ◽  
Keith Sheppard ◽  
...  

AbstractAllele-specific expression (ASE) refers to the differential abundance of the allelic copies of a transcript. Direct RNA sequencing (RNA-Seq) can provide quantitative estimates of ASE for genes with transcribed polymorphisms. However, estimating ASE is challenging due to ambiguities in read alignment. Current approaches do not account for the hierarchy of multiple read alignments to genes, isoforms, and alleles. We have developed EMASE (Expectation-Maximization for Allele Specific Expression), an integrated approach to estimate total gene expression, ASE, and isoform usage based on hierarchical allocation of multi-mapping reads. In simulations, EMASE outperforms standard ASE estimation methods. We apply EMASE to RNA-Seq data from F1 hybrid mice where we observe widespread ASE associated with cis-acting polymorphisms and a small number of parent-of-origin effects at known imprinted genes. The EMASE software is freely available under GNU license at https://github.com/churchill-lab/emase and it can be adapted to other sequencing applications.


2015 ◽  
Vol 112 (15) ◽  
pp. 4618-4623 ◽  
Author(s):  
Zhiyuan Chen ◽  
Darren E. Hagen ◽  
Christine G. Elsik ◽  
Tieming Ji ◽  
Collin James Morris ◽  
...  

Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith–Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS.


BMC Genomics ◽  
2014 ◽  
Vol 15 (1) ◽  
pp. 471 ◽  
Author(s):  
Yehudit Hasin-Brumshtein ◽  
Farhad Hormozdiari ◽  
Lisa Martin ◽  
Atila van Nas ◽  
Eleazar Eskin ◽  
...  

2019 ◽  
Author(s):  
Xi Rao ◽  
Kriti S. Thapa ◽  
Andy B Chen ◽  
Hai Lin ◽  
Hongyu Gao ◽  
...  

AbstractTranscriptome studies can identify genes whose expression differs between alcoholics and controls. To test which variants associated with alcohol use disorder (AUDs) may cause expression differences, we integrated deep RNA-seq and genome-wide association studies (GWAS) data from four postmortem brain regions of 30 AUDs subjects and 30 controls (social/non-drinkers) and analyzed allele-specific expression (ASE). We identified 90 genes with differential ASE in subjects with AUDs compared to controls. Of these, 61 genes contained 437 single nucleotide polymorphisms (SNPs) in the 3’ untranslated regions (3’UTR) with at least one heterozygote among the subjects studied. Using a modified PASSPORT-seq (parallel assessment of polymorphisms in miRNA target-sites by sequencing) assay, we identified 25 SNPs that showed affected RNA levels in a consistent manner in two neuroblastoma cell lines, SH-SY5Y and SK-N-BE(2). Many of these are in binding sites of miRNAs and RNA binding proteins, indicating that these SNPs are likely causal variants of AUD-associated differential ASE.


2019 ◽  
Author(s):  
Anna Ullastres ◽  
Miriam Merenciano ◽  
Josefa González

ABSTRACTVariation in gene expression underlies inter-individual variability in immune response. However, the mutations responsible for gene expression changes remain largely unknown. In this work, we searched for transposable element insertions present at high population frequencies and located nearby immune-related genes in Drosophila melanogaster. We identified 12 insertions associated with allele-specific expression changes in immune-related genes. We showed that transgenically induced expression changes in most of these genes are associated with differences in survival to infection with the gram-negative bacteria Pseudomonas entomophila. We provide experimental evidence suggesting a causal role for five insertions in the allele-specific expression changes observed. Furthermore, for two insertions we found a significant association with increased tolerance to bacterial infection. Our results showed for the first time that polymorphic transposable element insertions from different families drive expression changes in genes that are relevant for inter-individual differences in immune response.


2021 ◽  
Vol 12 ◽  
Author(s):  
Frédéric Jehl ◽  
Fabien Degalez ◽  
Maria Bernard ◽  
Frédéric Lecerf ◽  
Laetitia Lagoutte ◽  
...  

In addition to their common usages to study gene expression, RNA-seq data accumulated over the last 10 years are a yet-unexploited resource of SNPs in numerous individuals from different populations. SNP detection by RNA-seq is particularly interesting for livestock species since whole genome sequencing is expensive and exome sequencing tools are unavailable. These SNPs detected in expressed regions can be used to characterize variants affecting protein functions, and to study cis-regulated genes by analyzing allele-specific expression (ASE) in the tissue of interest. However, gene expression can be highly variable, and filters for SNP detection using the popular GATK toolkit are not yet standardized, making SNP detection and genotype calling by RNA-seq a challenging endeavor. We compared SNP calling results using GATK suggested filters, on two chicken populations for which both RNA-seq and DNA-seq data were available for the same samples of the same tissue. We showed, in expressed regions, a RNA-seq precision of 91% (SNPs detected by RNA-seq and shared by DNA-seq) and we characterized the remaining 9% of SNPs. We then studied the genotype (GT) obtained by RNA-seq and the impact of two factors (GT call-rate and read number per GT) on the concordance of GT with DNA-seq; we proposed thresholds for them leading to a 95% concordance. Applying these thresholds to 767 multi-tissue RNA-seq of 382 birds of 11 chicken populations, we found 9.5 M SNPs in total, of which ∼550,000 SNPs per tissue and population with a reliable GT (call rate ≥ 50%) and among them, ∼340,000 with a MAF ≥ 10%. We showed that such RNA-seq data from one tissue can be used to (i) detect SNPs with a strong predicted impact on proteins, despite their scarcity in each population (16,307 SIFT deleterious missenses and 590 stop-gained), (ii) study, on a large scale, cis-regulations of gene expression, with ∼81% of protein-coding and 68% of long non-coding genes (TPM ≥ 1) that can be analyzed for ASE, and with ∼29% of them that were cis-regulated, and (iii) analyze population genetic using such SNPs located in expressed regions. This work shows that RNA-seq data can be used with good confidence to detect SNPs and associated GT within various populations and used them for different analyses as GTEx studies.


Sign in / Sign up

Export Citation Format

Share Document