scholarly journals Predicting gene expression in massively parallel reporter assays: A comparative study

2017 ◽  
Vol 38 (9) ◽  
pp. 1240-1250 ◽  
Author(s):  
Anat Kreimer ◽  
Haoyang Zeng ◽  
Matthew D. Edwards ◽  
Yuchun Guo ◽  
Kevin Tian ◽  
...  
2018 ◽  
Author(s):  
Talitha Forcier ◽  
Andalus Ayaz ◽  
Manraj S. Gill ◽  
Daniel Jones ◽  
Rob Phillips ◽  
...  

AbstractGene expression in all organisms is controlled by cooperative interactions between DNA-bound transcription factors (TFs), but quantitatively measuring TF-DNA and TF-TF interactions remains difficult. Here we introduce a strategy for precisely measuring the Gibbs free energy of such interactions in living cells. This strategy centers on the measurement and modeling of “allelic manifolds”, a multidimensional generalization of the classical genetics concept of allelic series. Allelic manifolds are measured using reporter assays performed on strategically designed cis-regulatory sequences. Quantitative biophysical models are then fit to the resulting data. We used this strategy to study regulation by twoEscherichia coliTFs, CRP and σ70RNA polymerase. Doing so, we consistently obtained energetic measurements precise to ~ 0.1 kcal/mol. We also obtained multiple results that deviate from the prior literature. Our strategy is compatible with massively parallel reporter assays in both prokaryotes and eukaryotes, and should therefore be highly scalable and broadly applicable.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Alexander Schmitz ◽  
Fuzhong Zhang

Abstract Background Cell-to-cell variation in gene expression strongly affects population behavior and is key to multiple biological processes. While codon usage is known to affect ensemble gene expression, how codon usage influences variation in gene expression between single cells is not well understood. Results Here, we used a Sort-seq based massively parallel strategy to quantify gene expression variation from a green fluorescent protein (GFP) library containing synonymous codons in Escherichia coli. We found that sequences containing codons with higher tRNA Adaptation Index (TAI) scores, and higher codon adaptation index (CAI) scores, have higher GFP variance. This trend is not observed for codons with high Normalized Translation Efficiency Index (nTE) scores nor from the free energy of folding of the mRNA secondary structure. GFP noise, or squared coefficient of variance (CV2), scales with mean protein abundance for low-abundant proteins but does not change at high mean protein abundance. Conclusions Our results suggest that the main source of noise for high-abundance proteins is likely not originating at translation elongation. Additionally, the drastic change in mean protein abundance with small changes in protein noise seen from our library implies that codon optimization can be performed without concerning gene expression noise for biotechnology applications.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Julius Judd ◽  
Hayley Sanderson ◽  
Cédric Feschotte

Abstract Background Transposable elements are increasingly recognized as a source of cis-regulatory variation. Previous studies have revealed that transposons are often bound by transcription factors and some have been co-opted into functional enhancers regulating host gene expression. However, the process by which transposons mature into complex regulatory elements, like enhancers, remains poorly understood. To investigate this process, we examined the contribution of transposons to the cis-regulatory network controlling circadian gene expression in the mouse liver, a well-characterized network serving an important physiological function. Results ChIP-seq analyses reveal that transposons and other repeats contribute ~ 14% of the binding sites for core circadian regulators (CRs) including BMAL1, CLOCK, PER1/2, and CRY1/2, in the mouse liver. RSINE1, an abundant murine-specific SINE, is the only transposon family enriched for CR binding sites across all datasets. Sequence analyses and reporter assays reveal that the circadian regulatory activity of RSINE1 stems from the presence of imperfect CR binding motifs in the ancestral RSINE1 sequence. These motifs matured into canonical motifs through point mutations after transposition. Furthermore, maturation occurred preferentially within elements inserted in the proximity of ancestral CR binding sites. RSINE1 also acquired motifs that recruit nuclear receptors known to cooperate with CRs to regulate circadian gene expression specifically in the liver. Conclusions Our results suggest that the birth of enhancers from transposons is predicated both by the sequence of the transposon and by the cis-regulatory landscape surrounding their genomic integration site.


2021 ◽  
Author(s):  
Subramaniyam Ravichandran ◽  
Maria Razzaq ◽  
Nazia Parveen ◽  
Ambarnil Ghosh ◽  
Kyeong Kyu Kim

Abstract G-quadruplex (G4), a four-stranded DNA or RNA structure containing stacks of guanine tetrads, plays regulatory roles in many cellular functions. So far, conventional G4s containing loops of 1–7 nucleotides have been widely studied. Increasing experimental evidence suggests that unconventional G4s, such as G4s containing long loops (long-loop G4s), play a regulatory role in the genome by forming a stable structure. Other secondary structures such as hairpins in the loop might thus contribute to the stability of long-loop G4s. Therefore, investigation of the effect of the hairpin-loops on the structure and function of G4s is required. In this study, we performed a systematic biochemical investigation of model G4s containing long loops with various sizes and structures. We found that the long-loop G4s are less stable than conventional G4s, but their stability increased when the loop forms a hairpin (hairpin-G4). We also verified the biological significance of hairpin-G4s by showing that hairpin-G4s present in the genome also form stable G4s and regulate gene expression as confirmed by in cellulo reporter assays. This study contributes to expanding the scope and diversity of G4s, thus facilitating future studies on the role of G4s in the human genome.


Sign in / Sign up

Export Citation Format

Share Document