scholarly journals Genetic drivers of repeat expansion disorders localize to 3-D chromatin domain boundaries

2017 ◽  
Author(s):  
James Sun ◽  
Linda Zhou ◽  
Daniel J. Emerson ◽  
Thomas G. Gilgenast ◽  
Katelyn Titus ◽  
...  

AbstractMore than 25 inherited neurological disorders are caused by the unstable expansion of repetitive DNA sequences termed short tandem repeats (STRs). A fundamental unresolved question is why specific STRs are susceptible to unstable expansion leading to severe pathology, whereas tens of thousands of normal-length repeat tracts across the human genome are relatively stable. Here, we unexpectedly discover that nearly all STRs associated with repeat expansion diseases are located at boundaries demarcating 3-D chromatin domains. We find that boundaries exhibit markedly higher CpG island density compared to loci internal to domains. Importantly, disease-associated STRs are specifically localized to ultra-dense CpG island-rich boundaries, suggesting that these loci might be hotspots for epigenetic instability and topological disruption upon unstable expansion. In Fragile X Syndrome, mutation-length expansion at the Fmr1 gene results in severe disruption of the boundary between TADs. Our data uncover higher-order chromatin architecture as a new dimension in understanding the mechanistic basis of repeat expansion disorders.

2021 ◽  
Author(s):  
Linda Zhou ◽  
Chunmin Ge ◽  
Thomas Malachowski ◽  
Ji Hun Kim ◽  
Keerthivasan Raanin Chandradoss ◽  
...  

AbstractShort tandem repeat (STR) instability is causally linked to pathologic transcriptional silencing in a subset of repeat expansion disorders. In fragile X syndrome (FXS), instability of a single CGG STR tract is thought to repress FMR1 via local DNA methylation. Here, we report the acquisition of more than ten Megabase-sized H3K9me3 domains in FXS, including a 5-8 Megabase block around FMR1. Distal H3K9me3 domains encompass synaptic genes with STR instability, and spatially co-localize in trans concurrently with FMR1 CGG expansion and the dissolution of TADs. CRISPR engineering of mutation-length FMR1 CGG to normal-length preserves heterochromatin, whereas cut-out to pre-mutation-length attenuates a subset of H3K9me3 domains. Overexpression of a pre-mutation-length CGG de-represses both FMR1 and distal heterochromatinized genes, indicating that long-range H3K9me3-mediated silencing is exquisitely sensitive to STR length. Together, our data uncover a genome-wide surveillance mechanism by which STR tracts spatially communicate over vast distances to heterochromatinize the pathologically unstable genome in FXS.One-Sentence SummaryHeterochromatinization of distal synaptic genes with repeat instability in fragile X is reversible by overexpression of a pre-mutation length CGG tract.


2021 ◽  
Author(s):  
Igor Stevanovski ◽  
Sanjog R. Chintalaphani ◽  
Hasindu Gamaarachchi ◽  
James M. Ferguson ◽  
Sandy S. Pineda ◽  
...  

ABSTRACTShort-tandem repeat (STR) expansions are an important class of pathogenic genetic variants. Over forty neurological and neuromuscular diseases are caused by STR expansions, with 37 different genes implicated to date. Here we describe the use of programmable targeted long-read sequencing with Oxford Nanopore’s ReadUntil function for parallel genotyping of all known neuropathogenic STRs in a single, simple assay. Our approach enables accurate, haplotype-resolved assembly and DNA methylation profiling of expanded and non-expanded STR sites. In doing so, the assay correctly diagnoses all individuals in a cohort of patients (n = 27) with various neurogenetic diseases, including Huntington’s disease, fragile X syndrome and cerebellar ataxia (CANVAS) and others. Targeted long-read sequencing solves large and complex STR expansions that confound established molecular tests and short-read sequencing, and identifies non-canonical STR motif conformations and internal sequence interruptions. Even in our relatively small cohort, we observe a wide diversity of STR alleles of known and unknown pathogenicity, suggesting that long-read sequencing will redefine the genetic landscape of STR expansion disorders. Finally, we show how the flexible inclusion of pharmacogenomics (PGx) genes as secondary ReadUntil targets can identify clinically actionable PGx genotypes to further inform patient care, at no extra cost. Our study addresses the need for improved techniques for genetic diagnosis of STR expansion disorders and illustrates the broad utility of programmable long-read sequencing for clinical genomics.One sentence summaryThis study describes the development and validation of a programmable targeted nanopore sequencing assay for parallel genetic diagnosis of all known pathogenic short-tandem repeats (STRs) in a single, simple test.


1998 ◽  
Vol 201 (8) ◽  
pp. 1099-1117 ◽  
Author(s):  
R Hardison

The discovery of hemoglobins in virtually all kingdoms of organisms has shown (1) that the ancestral gene for hemoglobin is ancient, and (2) that hemoglobins can serve additional functions besides transport of oxygen between tissues, ranging from intracellular oxygen transport to catalysis of redox reactions. These different functions of the hemoglobins illustrate the acquisition of new roles by a pre-existing structural gene, which requires changes not only in the coding regions but also in the regulatory elements of the genes. The evolution of different regulated functions within an ancient gene family allows an examination of the types of biosequence data that are informative for various types of issues. Alignment of amino acid sequences is informative for the phylogenetic relationships among the hemoglobins in bacteria, fungi, protists, plants and animals. Although many of these diverse hemoglobins are induced by low oxygen concentrations, to date none of the molecular mechanisms for their hypoxic induction shows common regulatory proteins; hence, a search for matches in non-coding DNA sequences would not be expected to be fruitful. Indeed, alignments of non-coding DNA sequences do not reveal significant matches even between mammalian alpha- and beta-globin gene clusters, which diverged approximately 450 million years ago and are still expressed in a coordinated and balanced manner. They are in very different genomic contexts that show pronounced differences in regulatory mechanisms. The alpha-globin gene is in constitutively active chromatin and is encompassed by a CpG island, which is a dominant determinant of its regulation, whereas the beta-globin gene is in A+T-rich genomic DNA. Non-coding sequence matches are not seen between avian and mammalian beta-globin gene clusters, which diverged approximately 250 million years ago, despite the fact that regulation of both gene clusters requires tissue-specific activation of a chromatin domain regulated by a locus control region. The cis-regulatory sequences needed for domain opening and enhancement do show common binding sites for transcription factors. In contrast, alignments of non-coding sequences from species representing multiple eutherian mammalian orders, some of which diverged as long as 135 million years ago, are reliable predictors of novel cis-regulatory elements, both proximal and distal to the genes. Examples include a potential target for the hematopoietic transcription factor TAL1.


2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Sanjog R. Chintalaphani ◽  
Sandy S. Pineda ◽  
Ira W. Deveson ◽  
Kishore R. Kumar

Abstract Background Short tandem repeat (STR) expansion disorders are an important cause of human neurological disease. They have an established role in more than 40 different phenotypes including the myotonic dystrophies, Fragile X syndrome, Huntington’s disease, the hereditary cerebellar ataxias, amyotrophic lateral sclerosis and frontotemporal dementia. Main body STR expansions are difficult to detect and may explain unsolved diseases, as highlighted by recent findings including: the discovery of a biallelic intronic ‘AAGGG’ repeat in RFC1 as the cause of cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS); and the finding of ‘CGG’ repeat expansions in NOTCH2NLC as the cause of neuronal intranuclear inclusion disease and a range of clinical phenotypes. However, established laboratory techniques for diagnosis of repeat expansions (repeat-primed PCR and Southern blot) are cumbersome, low-throughput and poorly suited to parallel analysis of multiple gene regions. While next generation sequencing (NGS) has been increasingly used, established short-read NGS platforms (e.g., Illumina) are unable to genotype large and/or complex repeat expansions. Long-read sequencing platforms recently developed by Oxford Nanopore Technology and Pacific Biosciences promise to overcome these limitations to deliver enhanced diagnosis of repeat expansion disorders in a rapid and cost-effective fashion. Conclusion We anticipate that long-read sequencing will rapidly transform the detection of short tandem repeat expansion disorders for both clinical diagnosis and gene discovery.


2013 ◽  
Vol 2013 ◽  
pp. 1-9 ◽  
Author(s):  
Mohammadmersad Ghorbani ◽  
Simon J. E. Taylor ◽  
Mark A. Pook ◽  
Annette Payne

Previous studies have examined DNA methylation in different trinucleotide repeat diseases. We have combined this data and used a pattern searching algorithm to identify motifs in the DNA surrounding aberrantly methylated CpGs found in the DNA of patients with one of the three trinucleotide repeat (TNR) expansion diseases: fragile X syndrome (FRAXA), myotonic dystrophy type I (DM1), or Friedreich’s ataxia (FRDA). We examined sequences surrounding both the variably methylated (VM) CpGs, which are hypermethylated in patients compared with unaffected controls, and the nonvariably methylated CpGs which remain either always methylated (AM) or never methylated (NM) in both patients and controls. Using the J48 algorithm of WEKA analysis, we identified that two patterns are all that is necessary to classify our three regions CCGG* which is found in VM and not in AM regions and AATT* which distinguished between NM and VM + AM using proportional frequency. Furthermore, comparing our software with MEME software, we have demonstrated that our software identifies more patterns than MEME in these short DNA sequences. Thus, we present evidence that the DNA sequence surrounding CpG can influence its susceptibility to bede novomethylated in a disease state associated with a trinucleotide repeat.


Cell ◽  
2018 ◽  
Vol 175 (1) ◽  
pp. 224-238.e15 ◽  
Author(s):  
James H. Sun ◽  
Linda Zhou ◽  
Daniel J. Emerson ◽  
Sai A. Phyo ◽  
Katelyn R. Titus ◽  
...  

Author(s):  
Pierre Murat ◽  
Guillaume Guilbaud ◽  
Julian E. Sale

AbstractBackgroundShort tandem repeats (STRs) contribute significantly to de novo mutagenesis, driving phenotypic diversity and genetic disease. Although highly diverse, their repetitive sequences induce DNA polymerase slippage and stalling, leading to length and sequence variation. However, current studies of DNA synthesis through STRs are restricted to a handful of selected sequences, limiting our broader understanding of their evolutionary behaviour and hampering the characterisation of the determinants of their abundance and stability in eukaryotic genomes.ResultsWe perform a comprehensive analysis of DNA synthesis at all STR permutations and interrogate the impact of STR sequence and secondary structure on their genomic representation and mutability. To do so, we developed a high-throughput primer extension assay that allows monitoring of the kinetics and fidelity of DNA synthesis through 20,000 sequences comprising all STR permutations in different lengths. By combining these measurements with population-scale genomic data, we show that the response of a model replicative DNA polymerase to variously structured DNA is sufficient to predict the complex genomic behaviour of STRs, including abundance and mutational constraints. We demonstrate that DNA polymerase stalling at DNA structures induces error-prone DNA synthesis, which constrains STR expansion.ConclusionsOur data support a model in which STR length in eukaryotic genomes results from a balance between expansion due to polymerase slippage at repeated DNA sequences and point mutations caused by error-prone DNA synthesis at DNA structures.


2021 ◽  
Author(s):  
Harriet Dashnow ◽  
Brent S. Pedersen ◽  
Laurel Hiatt ◽  
Joe Brown ◽  
Sarah J. Beecroft ◽  
...  

Expansions of short tandem repeats (STRs) cause dozens of rare Mendelian diseases. However, STR expansions, especially those arising from repeats not present in the reference genome, are challenging to detect from short-read sequencing data. Such "novel" STRs include new repeat units occurring at known STR loci, or entirely new STR loci where the sequence is absent from the reference genome. A primary cause of difficulty detecting STR expansions is that reads arising from STR expansions are frequently mismapped or unmapped. To address this challenge, we have developed STRling, a new STR detection algorithm that counts k-mers (short DNA sequences of length k) in DNA sequencing reads, to efficiently recover reads that inform the presence and size of STR expansions. As a result, STRling can call expansions at both known and novel STR loci. STRling has a sensitivity of 83% for 14 known STR disease loci, including the novel STRs that cause CANVAS and DBQD2. It is the first method to resolve the position of novel STR expansions to base pair accuracy. Such accuracy is essential to interpreting the consequence of each expansion. STRling has an estimated 0.078 false discovery rate for known pathogenic loci in unaffected individuals and a 0.20 false discovery rate for genome-wide loci in unaffected individuals when using variants called from long-read data as truth. STRling is fast, scalable on cloud computing, open-source, and freely available at https://github.com/quinlan-lab/STRling.


2020 ◽  
Vol 51 (6) ◽  
pp. 635-641 ◽  
Author(s):  
Kristina Gvozdjan ◽  
Heather Casey ◽  
Carrie Mowery ◽  
Lorie Kumer ◽  
Carolyn Fisher ◽  
...  

Abstract Chimerism testing by short tandem repeats (STRs) is used to monitor engraftment after allogeneic hematopoietic stem cell transplantation (HSCT). Generally, STR alleles are stable and transferred from parent to child or from donor to recipient. However, 3 cases did not follow this norm. Additional work-up with help from forensic literature solved these mysteries. In case 1, the patient received HSCT from his son. The son shared STR alleles in 22/23 loci except Penta E, which was explained by repeat expansion in the son. In case 2, the patient had been in remission for 14 years after HSCT for lymphoma and developed repeat expansion in CSF1PO in granulocytes. In case 3, a pre-HSCT patient demonstrated 3 alleles, with 2 peaks taller than the third, in the FGA locus (chromosome 4). A combination of a triallelic variant and leukemia-associated trisomy 4 explained the finding. STR number variants are rare and clinically inconsequential but can overlap malignancy-associated, clinically significant changes.


Sign in / Sign up

Export Citation Format

Share Document