scholarly journals Refined detection and phasing of structural aberrations in pediatric acute lymphoblastic leukemia by linked-read whole genome sequencing

2018 ◽  
Author(s):  
Jessica Nordlund ◽  
Yanara Marincevic-Zuniga ◽  
Lucia Cavelier ◽  
Amanda Raine ◽  
Tom Martin ◽  
...  

ABSTRACTStructural chromosomal rearrangements that may lead to in-frame gene-fusions represent a leading source of information for diagnosis, risk stratification, and prognosis in pediatric acute lymphoblastic leukemia (ALL). However, short-read whole genome sequencing (WGS) technologies struggle to accurately identify and phase such large-scale chromosomal aberrations in cancer genomes. We therefore evaluated linked-read WGS for detection of chromosomal rearrangements in an ALL cell line (REH) and primary samples of varying DNA quality from 12 patients diagnosed with ALL. We assessed the effect of input DNA quality on phased haplotype block size and the detectability of copy number aberrations (CNAs) and structural variants (SVs). Biobanked DNA isolated by standard column-based extraction methods was sufficient to detect chromosomal rearrangements even at low 10x sequencing coverage. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements and aneuploidy assessment. With use of haplotype information from the linked-reads, we also identified additional structural variants, such as a compound heterozygous deletion of ERG in a patient with the DUX4-IGH fusion gene. Thus, linked-read WGS allows detection of important pathogenic variants in ALL genomes at a resolution beyond that of traditional karyotyping or short-read WGS.

Blood ◽  
2011 ◽  
Vol 118 (21) ◽  
pp. 68-68
Author(s):  
Jinghui Zhang ◽  
Li Ding ◽  
Linda Holmfeldt ◽  
Gang Wu ◽  
Susan L. Heatley ◽  
...  

Abstract Abstract 68 Early T-cell precursor acute lymphoblastic leukemia (ETP ALL) is characterized by an immature T-lineage immunophenotype (cCD3+, CD1a-, CD8- and CD5dim) aberrant expression of myeloid and stem cell markers, a distinct gene expression profile and very poor outcome. The underlying genetic basis of this form of leukemia is unknown. Here we report results of whole genome sequencing (WGS) of tumor and normal DNA from 12 children with ETP ALL. Genomes were sequenced to 30-fold haploid coverage using the Illumina GAIIx platform, and all putative somatic sequence and structural variants were validated. The frequency of mutations in 43 genes was assessed in a recurrence cohort of 52 ETP and 42 non-ETP T-ALL samples from patients enrolled in St Jude, Children's Oncology Group and AEIOP trials. Transcriptomic resequencing was performed for two WGS cases, and whole exome sequencing for three ETP ALL cases in the recurrence cohort. We identified 44 interchromosomal translocations (mean 4 per patient, range 0–12), 32 intrachromosomal translocations (mean 3, 0–7), 53 deletions (mean 4, 0–10) and 16 insertions (mean 1, 0–5). Three cases exhibited a pattern of complex rearrangements suggestive of a single cellular catastrophe (“chromothripsis”), two of which had mutations targeting mismatch and DNA repair (MLH3 and DCLRE1C). While no single chromosomal alteration was present in all cases, 10 of 12 ETP ALLs harbored chromosomal rearrangements, several of which involved complex multichromosomal translocations and resulted in the expression of chimeric in-frame novel fusion genes disrupting hematopoietic regulators, including ETV6-INO80D, NAP1L1-MLLT10, RUNX1-EVX1 and NUP214-SQSTM1, each occurring in a single case. An additional ETP case with the ETV6-INO80D fusion was identified in the recurrence cohort. Additionally, 51% of structural variants had breakpoints in genes, including those with roles in hematopoiesis and leukemogenesis, and genes also targeted by mutation in other cases (MLH3, SUZ12, RUNX1). We identified a high frequency of activating mutations in genes regulating cytokine receptor and Ras signalling in ETP ALL (67.2% of ETP compared to 19% of non-ETP T-ALL) including NRAS (17%), FLT3 (14%), JAK3 (9%), SH2B3 (or LNK; 9%), IL7R (8%), JAK1 (8%), KRAS (3%), and BRAF (2%). Seven cases (5 ETP, 2 non-ETP) harbored in frame insertion mutations in the transmembrane domain of IL7R, which were transforming when expressed in the murine cell lines, and resulted in enhanced colony formation when expressed in primary murine hematopoietic cells. The IL7R mutations resulted in constitutive Jak-Stat activation in these cell lines and primary leukemic cells expressing these mutations. Fifty-eight percent of ETP cases (compared to 17% of non-ETP cases) harbored mutations known or predicted to disrupt hematopoietic and lymphoid development, including ETV6 (33%), RUNX1 (16%), IKZF1 (14%), GATA3 (10%), EP300 (5%) and GATA2 (2%). GATA3 regulates early T cell development, and mutations in this gene were observed exclusively in ETP ALL. The mutations were commonly biallelic, and were clustered at R276, a residue critical for binding of GATA3 to DNA. Strikingly, mutations disrupting chromatin modifying genes were also highly enriched in ETP ALL. Genes encoding the the polycomb repressor complex 2 (EZH2, SUZ12 and EED), that mediates histone 3 lysine 27 (H3K27) trimethylation were deleted or mutated in 42% of ETP ALL compared to 12% of non-ETP T-ALL. In addition, alterations of the H3K36 trimethylase SETD2 were observed in 5 ETP cases, but not in non-ETP ALL. We also identified recurrent mutations in genes that have not previously been implicated in hematopoietic malignancies including RELN, DNM2, ECT2L, HNRNPA1 and HNRNPR. Using gene set enrichment analysis we demonstrate that the gene expression profile of ETP ALL shares features not only with normal human hematopoietic stem cells, but also with leukemic initiating cells (LIC) purified from patients with acute myeloid leukemia (AML). These results indicate that mutations that drive proliferation, impair differentiation and disrupt histone modification cooperate to induce an aggressive leukemia with an aberrant immature phenotype. The similarity of the gene expression pattern with that observed in the LIC of AML raises the possibility that myeloid-directed therapies might improve the outcome of ETP ALL. Disclosures: Evans: St. Jude Children's research Hospital: Employment, Patents & Royalties; NIH & NCI: Research Funding; Aldagen: Membership on an entity's Board of Directors or advisory committees.


2014 ◽  
Vol 36 (1) ◽  
pp. 118-128 ◽  
Author(s):  
Carl Mårten Lindqvist ◽  
Jessica Nordlund ◽  
Diana Ekman ◽  
Anna Johansson ◽  
Behrooz Torabi Moghadam ◽  
...  

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Jessica Nordlund ◽  
Yanara Marincevic-Zuniga ◽  
Lucia Cavelier ◽  
Amanda Raine ◽  
Tom Martin ◽  
...  

AbstractStructural chromosomal rearrangements that can lead to in-frame gene-fusions are a leading source of information for diagnosis, risk stratification, and prognosis in pediatric acute lymphoblastic leukemia (ALL). Traditional methods such as karyotyping and FISH struggle to accurately identify and phase such large-scale chromosomal aberrations in ALL genomes. We therefore evaluated linked-read WGS for detecting chromosomal rearrangements in primary samples of from 12 patients diagnosed with ALL. We assessed the effect of input DNA quality on phased haplotype block size and the detectability of copy number aberrations and structural variants in the ALL genomes. We found that biobanked DNA isolated by standard column-based extraction methods was sufficient to detect chromosomal rearrangements even at low 10x sequencing coverage. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements and aneuploidy assessment. With use of haplotype information from the linked-reads, we also identified previously unknown structural variants, such as a compound heterozygous deletion of ERG in a patient with the DUX4-IGH fusion gene. We conclude that linked-read WGS allows detection of important pathogenic variants in ALL genomes at a resolution beyond that of traditional karyotyping and FISH.


2018 ◽  
Author(s):  
Alba Sanchis-Juan ◽  
Jonathan Stephens ◽  
Courtney E French ◽  
Nicholas Gleadall ◽  
Karyn Mégy ◽  
...  

AbstractComplex structural variants (cxSVs) are genomic rearrangements comprising multiple structural variants, typically involving three or more breakpoint junctions. They contribute to human genomic variation and can cause Mendelian disease, however they are not typically considered during genetic testing. Here, we investigate the role of cxSVs in Mendelian disease using short-read whole genome sequencing (WGS) data from 1,324 individuals with neurodevelopmental or retinal disorders from the NIHR BioResource project. We present four cases of individuals with a cxSV affecting Mendelian disease-associated genes. Three of the cxSVs are pathogenic: a de novo duplication-inversion-inversion-deletion affecting ARID1B in an individual with Coffin-Siris syndrome, a deletion-inversion-duplication affecting HNRNPU in an individual with intellectual disability and seizures, and a homozygous deletion-inversion-deletion affecting CEP78 in an individual with cone-rod dystrophy. Additionally, we identified a de novo duplication-inversion-duplication overlapping CDKL5 in an individual with neonatal hypoxic-ischaemic encephalopathy. Long-read sequencing technology used to resolve the breakpoints demonstrated the presence of both a disrupted and an intact copy of CDKL5 on the same allele; therefore, it was classified as a variant of uncertain significance. Analysis of sequence flanking all breakpoint junctions in all the cxSVs revealed both microhomology and longer repetitive sequences, suggesting both replication and homology based processes. Accurate resolution of cxSVs is essential for clinical interpretation, and here we demonstrate that long-read WGS is a powerful technology by which to achieve this. Our results show cxSVs are an important although rare cause of Mendelian disease, and we therefore recommend their consideration during research and clinical investigations.


Author(s):  
Yongzhuang Liu ◽  
Yalin Huang ◽  
Guohua Wang ◽  
Yadong Wang

Abstract Short read whole genome sequencing has become widely used to detect structural variants in human genetic studies and clinical practices. However, accurate detection of structural variants is a challenging task. Especially existing structural variant detection approaches produce a large proportion of incorrect calls, so effective structural variant filtering approaches are urgently needed. In this study, we propose a novel deep learning-based approach, DeepSVFilter, for filtering structural variants in short read whole genome sequencing data. DeepSVFilter encodes structural variant signals in the read alignments as images and adopts the transfer learning with pre-trained convolutional neural networks as the classification models, which are trained on the well-characterized samples with known high confidence structural variants. We use two well-characterized samples to demonstrate DeepSVFilter’s performance and its filtering effect coupled with commonly used structural variant detection approaches. The software DeepSVFilter is implemented using Python and freely available from the website at https://github.com/yongzhuang/DeepSVFilter.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Tatiana Maroilley ◽  
Xiao Li ◽  
Matthew Oldach ◽  
Francesca Jean ◽  
Susan J. Stasiuk ◽  
...  

AbstractGenomic rearrangements cause congenital disorders, cancer, and complex diseases in human. Yet, they are still understudied in rare diseases because their detection is challenging, despite the advent of whole genome sequencing (WGS) technologies. Short-read (srWGS) and long-read WGS approaches are regularly compared, and the latter is commonly recommended in studies focusing on genomic rearrangements. However, srWGS is currently the most economical, accurate, and widely supported technology. In Caenorhabditis elegans (C. elegans), such variants, induced by various mutagenesis processes, have been used for decades to balance large genomic regions by preventing chromosomal crossover events and allowing the maintenance of lethal mutations. Interestingly, those chromosomal rearrangements have rarely been characterized on a molecular level. To evaluate the ability of srWGS to detect various types of complex genomic rearrangements, we sequenced three balancer strains using short-read Illumina technology. As we experimentally validated the breakpoints uncovered by srWGS, we showed that, by combining several types of analyses, srWGS enables the detection of a reciprocal translocation (eT1), a free duplication (sDp3), a large deletion (sC4), and chromoanagenesis events. Thus, applying srWGS to decipher real complex genomic rearrangements in model organisms may help designing efficient bioinformatics pipelines with systematic detection of complex rearrangements in human genomes.


2015 ◽  
Vol 117 (suppl_1) ◽  
Author(s):  
Matthew Wheeler ◽  
Daryl Waggott ◽  
Megan Grove ◽  
Frederick Dewey ◽  
Cuiping Pan ◽  
...  

Background: Technological advances have greatly reduced the cost of whole genome sequencing. For single individuals clinical application is apparent, while exome sequencing in tens of thousands of people has allowed a more global view of genetic variation that can inform interpretation of specific variants in individuals. We hypothesized that genome sequencing of patients with monogenic cardiomyopathy would facilitate discovery of genetic modifiers of phenotype. Methods and Results: We identified 48 individuals diagnosed with cardiomyopathy and with putative mutations in MYH7, the gene encoding beta myosin heavy chain. We carried out whole genome sequencing and applied a newly developed analytical pipeline optimized for discovery of genes modifying severity of clinical presentation and outcomes. Using a combination of external priors and rare variant burden tests we scored genes as potential modifiers. There were 96 genes that reached a modifier score of 6 out of 12 or better (9=2, 8=8, 7=17, 6=69). We identified NCKAP1, a gene that regulates actin filament dynamics, and CAMSAP1, a calmodulin regulate gene that regulates microtubule dynamics, as top scoring modifiers of hypertrophic cardiomyopathy phenotypes (score=9) while LDB2, RYR2, FBN1 and ATP1A2 had modifier scores of 8. Of the top scoring genes, 21 out of 96 were identified as candidates a priori. Our candidate prioritization scheme identified the previously described modifiers of cardiomyopathy phenotype, FHOD3 and MYBPC3, as top scoring genes. We identified structural variants in 21 clinically sequenced cardiomyopathy associated genes, 13 of which were at less than 10% frequency. Copy number variants in ILK and CSRP3 were nominally associated with ejection fraction (p=0.03), while 8 genes showed copy gains (GLA, FKTN, SGCD, TTN, SOS1, ANKRD1, VCL and NEBL). Structural variants were found in CSRP3, MYL3 and TNNC1, all of which have been implicated as causative for HCM. Conclusion: Evaluation of the whole genome sequence, even in the case of putatively monogenic disease, leads to important diagnostic and scientific insights not revealed by panel-based sequencing.


Sign in / Sign up

Export Citation Format

Share Document