scholarly journals Single-cell whole-genome sequencing reveals the functional landscape of somatic mutations in B lymphocytes across the human lifespan

2019 ◽  
Author(s):  
Lei Zhang ◽  
Xiao Dong ◽  
Moonsook Lee ◽  
Alexander Y. Maslov ◽  
Tao Wang ◽  
...  

Introductory paragraphThe accumulation of mutations in somatic cells have been implicated as a cause of ageing since the 1950s1,2. Yet, attempts to establish a causal relationship between somatic mutations and ageing have been constrained by the lack of methods to directly identify mutational events in primary human tissues. Here we provide detailed, genome-wide mutation frequencies and spectra of human B lymphocytes from healthy individuals across the entire human lifespan, from newborns to centenarians, using a recently developed, highly accurate single-cell whole-genome sequencing method3. We found that the number of somatic mutations increases from <500 per cell in newborns to >3,000 per cell in centenarians. We discovered mutational hotspot regions, some of which, as expected, located at immunoglobulin genes associated with somatic hypermutation. B cell-specific mutation signatures were observed associated with development, ageing or somatic hypermutation (SHM). The SHM signature strongly correlated with the signature found in human chronic lymphocytic leukemia and malignant B-cell lymphomas4, indicating that even in B cells of healthy individuals the potential cancer-causing events are already present. We also identified multiple mutations in sequence features relevant to cellular function, i.e., transcribed genes and gene regulatory regions. Such mutations increased significantly during ageing, but only at approximately half the rate of the genome average, indicating selection against mutations that impact B cell function. This first full characterization of the landscape of somatic mutations in human B lymphocytes indicates that spontaneous somatic mutations accumulating with age can be deleterious and may contribute to both the increased risk for leukemia and the functional decline of B lymphocytes in the elderly.

2019 ◽  
Vol 116 (18) ◽  
pp. 9014-9019 ◽  
Author(s):  
Lei Zhang ◽  
Xiao Dong ◽  
Moonsook Lee ◽  
Alexander Y. Maslov ◽  
Tao Wang ◽  
...  

Accumulation of mutations in somatic cells has been implicated as a cause of aging since the 1950s. However, attempts to establish a causal relationship between somatic mutations and aging have been constrained by the lack of methods to directly identify mutational events in primary human tissues. Here we provide genome-wide mutation frequencies and spectra of human B lymphocytes from healthy individuals across the entire human lifespan using a highly accurate single-cell whole-genome sequencing method. We found that the number of somatic mutations increases from <500 per cell in newborns to >3,000 per cell in centenarians. We discovered mutational hotspot regions, some of which, as expected, were located at Ig genes associated with somatic hypermutation (SHM). B cell–specific mutation signatures associated with development, aging, or SHM were found. The SHM signature strongly correlated with the signature found in human B cell tumors, indicating that potential cancer-causing events are already present even in B cells of healthy individuals. We also identified multiple mutations in sequence features relevant to cellular function (i.e., transcribed genes and gene regulatory regions). Such mutations increased significantly during aging, but only at approximately one-half the rate of the genome average, indicating selection against mutations that impact B cell function. This full characterization of the landscape of somatic mutations in human B lymphocytes indicates that spontaneous somatic mutations accumulating with age can be deleterious and may contribute to both the increased risk for leukemia and the functional decline of B lymphocytes in the elderly.


2019 ◽  
Author(s):  
Lei Zhang ◽  
Xiao Dong ◽  
Xiaoxiao Hao ◽  
Moonsook Lee ◽  
Zhongxuan Chi ◽  
...  

AbstractDNA Mismatch repair (MMR) deficiency is a major cause of hereditary non-polyposis colorectal cancer, and is also associated with increased risk of several other cancers. This is generally ascribed to the role of MMR in avoiding mutations by correcting DNA replication errors. In MMR knockout mice very high frequencies of somatic mutations, up until 100-fold of background, have been reported. However, these results have been obtained using bacterial reporter transgenes, which are not representative for the genome overall, and mutational patterns of MMR deficiency remain largely unknown. To fill this knowledge gap, we performed single-cell whole-genome sequencing of lung fibroblasts of Msh2−/− and wild-type mice. We observed a 4-fold increase of somatic single nucleotide variants (SNVs) in the fibroblasts of Msh2−/− mice compared to those of wild-type mice. The SNV signature of Msh2 deficiency was found to be driven by C>T and T>C transitions. By comparing it to human cancer signatures, we not only confirmed the inferred MMR-deficiency-related etiology of several cancer signatures but also suggested that MMR deficiency is likely the cause of a cancer signature with its etiology previously unknown. We also observed a 7-fold increase of somatic small insertions and deletions (INDELs) in the Msh2−/− mice. An elevated INDEL frequency has also been found in human MMR-related cancers. INDELs and SNVs distributed differently across genomic features in the Msh2−/− and control cells, with evidence of selection pressure and repair preference. These results provide insights into the landscape of somatic mutations in normal somatic cells caused by MMR deficiency.SignificanceOur results show that MMR deficiency in the mouse is associated with a much lower elevation of somatic mutation rates than previously reported and provides the first MMR whole-genome mutational landscapes in normal somatic cells in vivo.


2020 ◽  
Vol 22 (Supplement_3) ◽  
pp. iii408-iii408
Author(s):  
Marina Danilenko ◽  
Masood Zaka ◽  
Claire Keeling ◽  
Stephen Crosier ◽  
Rafiqul Hussain ◽  
...  

Abstract Medulloblastomas harbor clinically-significant intra-tumoral heterogeneity for key biomarkers (e.g. MYC/MYCN, β-catenin). Recent studies have characterized transcriptional heterogeneity at the single-cell level, however the underlying genomic copy number and mutational architecture remains to be resolved. We therefore sought to establish the intra-tumoural genomic heterogeneity of medulloblastoma at single-cell resolution. Copy number patterns were dissected by whole-genome sequencing in 1024 single cells isolated from multiple distinct tumour regions within 16 snap-frozen medulloblastomas, representing the major molecular subgroups (WNT, SHH, Group3, Group4) and genotypes (i.e. MYC amplification, TP53 mutation). Common copy number driver and subclonal events were identified, providing clear evidence of copy number evolution in medulloblastoma development. Moreover, subclonal whole-arm and focal copy number alterations covering important genomic loci (e.g. on chr10 of SHH patients) were detected in single tumour cells, yet undetectable at the bulk-tumor level. Spatial copy number heterogeneity was also common, with differences between clonal and subclonal events detected in distinct regions of individual tumours. Mutational analysis of the cells allowed dissection of spatial and clonal heterogeneity patterns for key medulloblastoma mutations (e.g. CTNNB1, TP53, SMARCA4, PTCH1) within our cohort. Integrated copy number and mutational analysis is underway to establish their inter-relationships and relative contributions to clonal evolution during tumourigenesis. In summary, single-cell analysis has enabled the resolution of common mutational and copy number drivers, alongside sub-clonal events and distinct patterns of clonal and spatial evolution, in medulloblastoma development. We anticipate these findings will provide a critical foundation for future improved biomarker selection, and the development of targeted therapies.


PLoS ONE ◽  
2020 ◽  
Vol 15 (8) ◽  
pp. e0238183
Author(s):  
Alana Sparks ◽  
J. Paul Woods ◽  
Dorothee Bienzle ◽  
Geoffrey A. Wood ◽  
Brenda Lynn Coomber

2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Lydia Y. Liu ◽  
Vinayak Bhandari ◽  
Adriana Salcedo ◽  
Shadrielle M. G. Espiritu ◽  
Quaid D. Morris ◽  
...  

AbstractWhole-genome sequencing can be used to estimate subclonal populations in tumours and this intra-tumoural heterogeneity is linked to clinical outcomes. Many algorithms have been developed for subclonal reconstruction, but their variabilities and consistencies are largely unknown. We evaluate sixteen pipelines for reconstructing the evolutionary histories of 293 localized prostate cancers from single samples, and eighteen pipelines for the reconstruction of 10 tumours with multi-region sampling. We show that predictions of subclonal architecture and timing of somatic mutations vary extensively across pipelines. Pipelines show consistent types of biases, with those incorporating SomaticSniper and Battenberg preferentially predicting homogenous cancer cell populations and those using MuTect tending to predict multiple populations of cancer cells. Subclonal reconstructions using multi-region sampling confirm that single-sample reconstructions systematically underestimate intra-tumoural heterogeneity, predicting on average fewer than half of the cancer cell populations identified by multi-region sequencing. Overall, these biases suggest caution in interpreting specific architectures and subclonal variants.


Blood ◽  
2009 ◽  
Vol 114 (22) ◽  
pp. 3965-3965
Author(s):  
Lukas D. Wartman ◽  
Li Ding ◽  
David E. Larson ◽  
Michael D. McLellan ◽  
Heather Schmidt ◽  
...  

Abstract Abstract 3965 Poster Board III-901 We have recently established that whole genome sequencing is a valid, unbiased approach that can identify novel candidate mutations that may be important for AML pathogenesis (Ley et al Nature 2008, Mardis et al NEJM 2009). Acute promyelocytic leukemia (APL, FAB M3 AML) is a subtype of AML characterized by the t(15;17)(q22;q11.2) translocation that creates an oncogenic fusion gene, PML-RARA. Our laboratory has previously modeled APL in a mouse in an effort to understand the genetic events that lead to the disease. In our knockin mouse model, a human PML-RARA cDNA was targeted to the 5' untranslated region of the mouse cathepsin G gene on chromosome 14 (mCG-PR). The targeting vector was transfected into the RW-4 embryonic stem cell line, derived from a 129/SvJ mouse. The transfected RW-4 cells were injected into C57Bl/6 blastocysts, and chimeric offspring were bred to C57Bl/6 mice. F1 129/SvJ x C57Bl/6 mice were subsequently backcrossed onto the B6/Taconic background for 10 generations before establishing a tumor watch. About 60% of the mCG-PR mice in the Bl/6 background develop a disease that closely resembles APL only after a latent period of 7-18 months, suggesting that additional progression mutations are required for APL development. Array-based genomic techniques (expression array studies and high resolution CGH) have revealed some recurring genetic alterations that may be relevant for progression (i.e. an interstitial deletion of chromosome 2, trisomy 15, etc.), but gene-specific progression mutations have not yet been identified. To begin to identify these mutations in an unbiased fashion, we sequenced a cytogenetically normal, diploid mouse APL genome using massively parallel DNA sequencing via the Illumina platform. Since the tumor arose in a highly inbred mouse strain, we predicted that 15x coverage of the genome (approximately 40 billion base pairs of sequence) would be necessary to identify >90% of the heterozygous somatic mutations. We generated 2 Illumina paired-end libraries (insert sizes of 300-350 bp and 550-600 bp) and generated 59.64 billion base pairs of sequence with 3 full sequencing runs; the reads that successfully mapped generated 15.6x coverage. The sequence data predicted 87,778 heterozygous Single Nucleotide Variants (SNVs) compared to the mouse C57Bl6/J reference sequence, and 23,439 homozygous SNVs. Of the predicted heterozygous SNVs, 695 were non-synonymous (missense or nonsense, or altering a canonical splice site). Thus far, 80 of these putative non-synonymous SNVs have been further analyzed using Sanger sequencing of the original tumor DNA vs. pooled B6/Taconic spleen DNA and pooled129/SvJ spleen DNA as controls. 37/80 were shown to be false positive calls, and 37 were inherited SNPs from residual regions of the129/SvJ genome. 6/80 were present only in the tumor genome, and were candidate somatic mutations. These 6 were screened in 89 additional murine APL tumor samples derived from the same mouse model. Mutations in the Jarid2 (L915I) and Capns2 (N149S) genes occurred only in the proband, and are therefore of uncertain significance. 4/6 mutations were found in additional samples; 3 of these mutations were derived from a common ancestor of the proband and the other affected mice, and were therefore not relevant for pathogenesis. The other recurring mutation was in the pseudokinase domain of JAK1 (V657F), and was identified in one other mouse that was not closely related to the proband. This mutation is orthologous to the known activating mutation V617F in human JAK2, and is identical to a recently described JAK1 pseudokinase domain mutation (V658F) found in human APL and T-ALL samples (EG Jeong et al, Clin Can Res 14: 3716, 2008). We are currently testing the functional significance of this mutation by expressing it in bone marrow cells derived from young WT vs. mCG-PR mice. In summary, unbiased whole genome sequencing of a mouse APL genome has identified a recurring mutation of JAK1 found in both human and mouse APL samples. This approach may allow us to rapidly identify progression mutations that are common to human and murine AML, and provides an important proof-of-concept that this mouse model of AML is functionally related to its human counterpart. Disclosures: No relevant conflicts of interest to declare.


Sign in / Sign up

Export Citation Format

Share Document