scholarly journals Co-SELECT reveals sequence non-specific contribution of DNA shape to transcription factor binding in vitro

2018 ◽  
Author(s):  
Soumitra Pal ◽  
Jan Hoinka ◽  
Teresa M. Przytycka

AbstractUnderstanding the principles of DNA binding by transcription factors (TFs) is of primary importance for studying gene regulation. Recently, several lines of evidence suggested that both DNA sequence and shape contribute to TF binding. However, the question if in the absence of any sequence similarity to the binding motif, DNA shape can still increase probability of binding was yet to be addressed.To address this challenge, we developed Co-SELECT, a computational approach to analyze the results of in vitro HT-SELEX experiments for TF-DNA binding. Specifically, the presence of motif-free sequences in late HT-SELEX rounds and their enrichment in weak binders allowed us to detect evidence for the role of DNA shape features in TF binding.Our approach revealed that, even in the absence of the sequence motif, TFs have propensity to weakly bind to DNA molecules enriched in specific shape features. Surprisingly, we also found that some properties of DNA shape contribute to promiscuous binding of all tested TF families. Strikingly, such promiscuously bound shapes correspond to the most frequent shape formed by the DNA. We propose that this promiscuous binding facilitates diffusing of TFs along the DNA molecule before it is locked in its binding site.

2019 ◽  
Vol 47 (13) ◽  
pp. 6632-6641 ◽  
Author(s):  
Soumitra Pal ◽  
Jan Hoinka ◽  
Teresa M Przytycka

Abstract Understanding the principles of DNA binding by transcription factors (TFs) is of primary importance for studying gene regulation. Recently, several lines of evidence suggested that both DNA sequence and shape contribute to TF binding. However, the following compelling question is yet to be considered: in the absence of any sequence similarity to the binding motif, can DNA shape still increase binding probability? To address this challenge, we developed Co-SELECT, a computational approach to analyze the results of in vitro HT-SELEX experiments for TF–DNA binding. Specifically, Co-SELECT leverages the presence of motif-free sequences in late HT-SELEX rounds and their enrichment in weak binders allows Co-SELECT to detect an evidence for the role of DNA shape features in TF binding. Our approach revealed that, even in the absence of the sequence motif, TFs have propensity to bind to DNA molecules of the shape consistent with the motif specific binding. This provides the first direct evidence that shape features that accompany the preferred sequence motifs also bestow an advantage for weak, sequence non-specific binding.


2015 ◽  
Vol 112 (15) ◽  
pp. 4654-4659 ◽  
Author(s):  
Tianyin Zhou ◽  
Ning Shen ◽  
Lin Yang ◽  
Namiko Abe ◽  
John Horton ◽  
...  

DNA binding specificities of transcription factors (TFs) are a key component of gene regulatory processes. Underlying mechanisms that explain the highly specific binding of TFs to their genomic target sites are poorly understood. A better understanding of TF−DNA binding requires the ability to quantitatively model TF binding to accessible DNA as its basic step, before additional in vivo components can be considered. Traditionally, these models were built based on nucleotide sequence. Here, we integrated 3D DNA shape information derived with a high-throughput approach into the modeling of TF binding specificities. Using support vector regression, we trained quantitative models of TF binding specificity based on protein binding microarray (PBM) data for 68 mammalian TFs. The evaluation of our models included cross-validation on specific PBM array designs, testing across different PBM array designs, and using PBM-trained models to predict relative binding affinities derived from in vitro selection combined with deep sequencing (SELEX-seq). Our results showed that shape-augmented models compared favorably to sequence-based models. Although both k-mer and DNA shape features can encode interdependencies between nucleotide positions of the binding site, using DNA shape features reduced the dimensionality of the feature space. In addition, analyzing the feature weights of DNA shape-augmented models uncovered TF family-specific structural readout mechanisms that were not revealed by the DNA sequence. As such, this work combines knowledge from structural biology and genomics, and suggests a new path toward understanding TF binding and genome function.


Author(s):  
Jun-Xian Du ◽  
Yi-Hong Luo ◽  
Si-Jia Zhang ◽  
Biao Wang ◽  
Cong Chen ◽  
...  

Abstract Background Intensive evidence has highlighted the effect of aberrant alternative splicing (AS) events on cancer progression when triggered by dysregulation of the SR protein family. Nonetheless, the underlying mechanism in breast cancer (BRCA) remains elusive. Here we sought to explore the molecular function of SRSF1 and identify the key AS events regulated by SRSF1 in BRCA. Methods We conducted a comprehensive analysis of the expression and clinical correlation of SRSF1 in BRCA based on the TCGA dataset, Metabric database and clinical tissue samples. Functional analysis of SRSF1 in BRCA was conducted in vitro and in vivo. SRSF1-mediated AS events and their binding motifs were identified by RNA-seq, RNA immunoprecipitation-PCR (RIP-PCR) and in vivo crosslinking followed by immunoprecipitation (CLIP), which was further validated by the minigene reporter assay. PTPMT1 exon 3 (E3) AS was identified to partially mediate the oncogenic role of SRSF1 by the P-AKT/C-MYC axis. Finally, the expression and clinical significance of these AS events were validated in clinical samples and using the TCGA database. Results SRSF1 expression was consistently upregulated in BRCA samples, positively associated with tumor grade and the Ki-67 index, and correlated with poor prognosis in a hormone receptor-positive (HR+) cohort, which facilitated proliferation, cell migration and inhibited apoptosis in vitro and in vivo. We identified SRSF1-mediated AS events and discovered the SRSF1 binding motif in the regulation of splice switching of PTPMT1. Furthermore, PTPMT1 splice switching was regulated by SRSF1 by binding directly to its motif in E3 which partially mediated the oncogenic role of SRSF1 by the AKT/C-MYC axis. Additionally, PTPMT1 splice switching was validated in tissue samples of BRCA patients and using the TCGA database. The high-risk group, identified by AS of PTPMT1 and expression of SRSF1, possessed poorer prognosis in the stage I/II TCGA BRCA cohort. Conclusions SRSF1 exerts oncogenic roles in BRCA partially by regulating the AS of PTPMT1, which could be a therapeutic target candidate in BRCA and a prognostic factor in HR+ BRCA patient.


2021 ◽  
Vol 9 (1) ◽  
pp. 6
Author(s):  
Narendra Pratap Singh ◽  
Bony De Kumar ◽  
Ariel Paulson ◽  
Mark E. Parrish ◽  
Carrie Scott ◽  
...  

Knowledge of the diverse DNA binding specificities of transcription factors is important for understanding their specific regulatory functions in animal development and evolution. We have examined the genome-wide binding properties of the mouse HOXB1 protein in embryonic stem cells differentiated into neural fates. Unexpectedly, only a small number of HOXB1 bound regions (7%) correlate with binding of the known HOX cofactors PBX and MEIS. In contrast, 22% of the HOXB1 binding peaks display co-occupancy with the transcriptional repressor REST. Analyses revealed that co-binding of HOXB1 with PBX correlates with active histone marks and high levels of expression, while co-occupancy with REST correlates with repressive histone marks and repression of the target genes. Analysis of HOXB1 bound regions uncovered enrichment of a novel 15 base pair HOXB1 binding motif HB1RE (HOXB1 response element). In vitro template binding assays showed that HOXB1, PBX1, and MEIS can bind to this motif. In vivo, this motif is sufficient for direct expression of a reporter gene and over-expression of HOXB1 selectively represses this activity. Our analyses suggest that HOXB1 has evolved an association with REST in gene regulation and the novel HB1RE motif contributes to HOXB1 function in part through a repressive role in gene expression.


2005 ◽  
Vol 79 (13) ◽  
pp. 8661-8664 ◽  
Author(s):  
Stephen Schuck ◽  
Arne Stenlund

ABSTRACT Viral initiator proteins are polypeptides that form oligomeric complexes on the origin of DNA replication (ori). These complexes carry out a multitude of functions related to initiation of DNA replication, and although many of these functions have been characterized biochemically, little is understood about how the complexes are assembled. Here we demonstrate that loss of one particular interaction, the dimerization between E1 DNA binding domains, has a severe effect on DNA replication in vivo but has surprisingly modest effects on most individual biochemical activities in vitro. We conclude that the dimer interaction is primarily required for initial recognition of ori.


1993 ◽  
Vol 13 (2) ◽  
pp. 852-860
Author(s):  
M B Toledano ◽  
D Ghosh ◽  
F Trinh ◽  
W J Leonard

We previously reported that either oxidation or alkylation of NF-kappa B in vitro abrogates DNA binding. We used this phenomenon to help elucidate structural determinants of NF-kappa B binding. We now demonstrate that Cys-62 of NF-kappa B p50 mediates the redox effect and lies within an N-terminal region required for DNA binding but not for dimerization. Several point mutations in this region confer a transdominant negative binding phenotype to p50. The region is highly conserved in all Rel family proteins, and we have determined that it is also critical for DNA binding of NF-kappa B p65. Replacement of the N-terminal region of p65 with the corresponding region from p50 changes its DNA-binding specificity towards that of p50. These data suggest that the N-terminal regions of p50 and p65 are critical for DNA binding and help determine the DNA-binding specificities of p50 and p65. We have defined within the N-terminal region a sequence motif, R(F/G)(R/K)YXCE, which is present in Rel family proteins and also in zinc finger proteins capable of binding to kappa B sites. The potential significance of this finding is discussed.


2004 ◽  
Vol 24 (5) ◽  
pp. 2091-2102 ◽  
Author(s):  
Chao Wei ◽  
Carolyn M. Price

ABSTRACT Pot1 is a single-stranded-DNA-binding protein that recognizes telomeric G-strand DNA. It is essential for telomere capping in Saccharomyces pombe and regulates telomere length in humans. Human Pot1 also interacts with proteins that bind the duplex region of the telomeric tract. Thus, like Cdc13 from S. cerevisiae, Pot 1 may have multiple roles at the telomere. We show here that endogenous chicken Pot1 (cPot1) is present at telomeres during periods of the cell cycle when t loops are thought to be present. Since cPot1 can bind internal loops and directly adjacent DNA-binding sites, it is likely to fully coat and protect both G-strand overhangs and the displaced G strand of a t loop. The minimum binding site of cPot1 is double that of the S. pombe DNA-binding domain. Although cPot can self associate, dimerization is not required for DNA binding and hence does not explain the binding-site duplication. Instead, the DNA-binding domain appears to be extended to contain a second binding motif in addition to the conserved oligonucleotide-oligosaccharide (OB) fold present in other G-strand-binding proteins. This second motif could be another OB fold. Although dimerization is inefficient in vitro, it may be regulated in vivo and could promote association with other telomere proteins and/or telomere compaction.


Genetics ◽  
1997 ◽  
Vol 145 (2) ◽  
pp. 467-478 ◽  
Author(s):  
Rodger Voelker ◽  
Janet Mendel-Hartvig ◽  
Alice Barkan

A nuclear mutant of maize, tha1, which exhibited defects in the translocation of proteins across the thylakoid membrane, was described previously. A transposon insertion at the tha1 locus facilitated the cloning of portions of the tha1 gene. Strong sequence similarity with secA genes from bacteria, pea and spinach indicates that tha1 encodes a SecA homologue (cp-SecA). The tha1-ref allele is either null or nearly so, in that tha1 mRNA is undetectable in mutant leaves and cp-SecA accumulation is reduced ≥40-fold. These results, in conjunction with the mutant phenotype described previously, demonstrate that cp-SecA functions in vivo to facilitate the translocation of OEC33, PSI-F and plastocyanin but does not function in the translocation of OEC23 and OEC16. Our results confirm predictions for cp-Sed function made from the results of in vitro experiments and establish several new functions for cp-SecA, including roles in the targeting of a chloroplast-encoded protein, cytochrome f, and in protein targeting in the etioplast, a nonphotosynthetic plastid type. Our finding that the accumulation of properly targeted plastocyanin and cytochrome f in tha1-ref thylakoid membranes is reduced only a few-fold despite the near or complete absence of cp-SecA suggests that cp-SecA facilitates but is not essential in vivo for their translocation across the membrane.


2003 ◽  
Vol 185 (22) ◽  
pp. 6728-6731 ◽  
Author(s):  
Pekka Rappu ◽  
Terhi Pullinen ◽  
Pekka Mäntsälä

ABSTRACT The Bacillus subtilis PurR mediates adenine repression and guanosine induction of purA. PRPP inhibits binding of PurR to DNA in vitro. Mutations in the PRPP binding motif of PurR caused strong repression regardless of purine exclusions or additions, establishing the role of PRPP as regulator of PurR.


1994 ◽  
Vol 300 (1) ◽  
pp. 7-13 ◽  
Author(s):  
J M Candy ◽  
R G Duggleby

Several enzymes require thiamin diphosphate (ThDP) as an essential cofactor, and we have used one of these, pyruvate decarboxylase (PDC; EC 4.1.1.1) from Zymomonas mobilis, as a model for this group of enzymes. It is well suited for this purpose because of its stability, ease of purification and its simple kinetic properties. A sequence motif of approx. 30 residues, beginning with a glycine-aspartate-glycine (-GDG-) triplet and ending with a double asparagine (-NN-) sequence, has been identified in many of these enzymes [Hawkins, Borges and Perham (1989) FEBS Lett. 255, 77-82]. Other residues within this putative ThDP-binding motif are conserved, but to a lesser extent, including a glutamate and a proline residue. The role of the elements of this motif has been clarified by the determination of the three-dimensional structure of three of these enzymes [Muller, Lindqvist, Furey, Schulz, Jordan and Schneider (1993) Structure 1, 95-103]. Four of the residues within this motif were modified by site-directed mutagenesis of the cloned PDC gene to evaluate their role in cofactor binding. The mutant proteins were expressed in Escherichia coli and found to purify normally, indicating that the tertiary structure of these enzymes had not been grossly perturbed by the amino acid substitutions. We have shown previously [Diefenbach, Candy, Mattick and Duggleby (1992) FEBS Lett. 296, 95-98] that changing the aspartate in the -GDG- sequence to glycine, threonine or asparagine yields an inactive enzyme that is unable to bind ThDP, therefore verifying the role of the ThDP-binding motif. Here we demonstrate that substitution with glutamate yields an active enzyme with a greatly reduced affinity for both ThDP and Mg2+, but with normal kinetics for pyruvate. Unlike the wild-type tetrameric enzyme, this mutant protein usually exists as a dimer. Replacement of the second asparagine of the -NN- sequence by glutamine also yields an inactive enzyme which is unable to bind ThDP, whereas replacement with an aspartate residue results in an active enzyme with a reduced affinity for ThDP but which displays normal kinetics for both Mg2+ and pyruvate. Replacing the conserved glutamate with aspartate did not alter the properties of the enzyme, while the conserved proline, thought to be required for structural reasons, could be substituted with glycine or alanine without inactivating the enzyme, but these changes did reduce its stability.


Sign in / Sign up

Export Citation Format

Share Document