complex structural variation
Recently Published Documents


TOTAL DOCUMENTS

14
(FIVE YEARS 0)

H-INDEX

6
(FIVE YEARS 0)

Author(s):  
Gonzalo Lopez ◽  
Laura E Egolf ◽  
Federico M Giorgi ◽  
Sharon J Diskin ◽  
Adam A Margolin

Abstract Motivation Despite widespread prevalence of somatic structural variations (SVs) across most tumor types, understanding of their molecular implications often remains poor. SVs are extremely heterogeneous in size and complexity, hindering the interpretation of their pathogenic role. Tools integrating large SV datasets across platforms are required to fully characterize the cancer’s somatic landscape. Results svpluscnv R package is a swiss army knife for the integration and interpretation of orthogonal datasets including copy number variant segmentation profiles and sequencing-based structural variant calls. The package implements analysis and visualization tools to evaluate chromosomal instability and ploidy, identify genes harboring recurrent SVs and detects complex rearrangements such as chromothripsis and chromoplexia. Further, it allows systematic identification of hot-spot shattered genomic regions, showing reproducibility across alternative detection methods and datasets. Availability and implementation https://github.com/ccbiolab/svpluscnv. Contact [email protected] Supplementary information Supplementary data are available at Bioinformatics online.


Cell ◽  
2020 ◽  
Vol 183 (1) ◽  
pp. 197-210.e32 ◽  
Author(s):  
Kevin Hadi ◽  
Xiaotong Yao ◽  
Julie M. Behr ◽  
Aditya Deshpande ◽  
Charalampos Xanthopoulakis ◽  
...  

2020 ◽  
Vol 85 (1) ◽  
pp. 7-17
Author(s):  
Walid Algady ◽  
Eleanor Weyell ◽  
Daria Mateja ◽  
André Garcia ◽  
David Courtin ◽  
...  

2019 ◽  
Author(s):  
Kevin Hadi ◽  
Xiaotong Yao ◽  
Julie M. Behr ◽  
Aditya Deshpande ◽  
Charalampos Xanthopoulakis ◽  
...  

SummaryCancer genomes often harbor hundreds of somatic DNA rearrangement junctions, many of which cannot be easily classified into simple (e.g. deletion, translocation) or complex (e.g. chromothripsis, chromoplexy) structural variant classes. Applying a novel genome graph computational paradigm to analyze the topology of junction copy number (JCN) across 2,833 tumor whole genome sequences (WGS), we introduce three complex rearrangement phenomena: pyrgo, rigma, and tyfonas. Pyrgo are “towers” of low-JCN duplications associated with early replicating regions and superenhancers, and are enriched in breast and ovarian cancers. Rigma comprise “chasms” of low-JCN deletions at late-replicating fragile sites in esophageal and other gastrointestinal (GI) adenocarcinomas. Tyfonas are “typhoons” of high-JCN junctions and fold back inversions that are enriched in acral but not cutaneous melanoma and associated with a previously uncharacterized mutational process of non-APOBEC kataegis. Clustering of tumors according to genome graph-derived features identifies subgroups associated with DNA repair defects and poor prognosis.


2019 ◽  
Author(s):  
Walid Algady ◽  
Eleanor Weyell ◽  
Daria Mateja ◽  
André Garcia ◽  
David Courtin ◽  
...  

AbstractStructural variation in the human genome can affect risk of disease. An example is a complex structural variant of the human glycophorin gene cluster, called DUP4, which is associated with a clinically-significant level of protection against severe malaria. The human glycophorin gene cluster harbours at least 23 distinct structural variants and accurate genotyping of this complex structural variation remains a challenge. Here, we use a PCR-based strategy to genotype structural variation at the human glycophorin gene cluster. We validate our approach, based on a triplex paralogue ratio test (PRT) combined with junction-fragment specific PCR, on publically-available samples from the 1000 Genomes project. We then genotype a longitudinal birth cohort using small amounts of DNA at low cost. Our approach readily identifies known deletions and duplications, and can potentially identify novel variants for further analysis. It will allow exploration of genetic variation at the glycophorin locus, and investigation of its relationship with malaria, in large sample sets at minimal cost, using standard molecular biology equipment.


2018 ◽  
Author(s):  
István Prazsák ◽  
Norbert Moldován ◽  
Dóra Tombácz ◽  
Klára Megyeri ◽  
Attila Szűcs ◽  
...  

AbstractBackgroundVaricella zoster virus (VZV) is a human pathogenic alphaherpesvirus harboring a relatively large DNA molecule. The VZV transcriptome has already been analyzed by microarray and short-read sequencing analyses. However, both approaches have substantial limitations when used for structural characterization of transcript isoforms, even if supplemented with primer extension or other techniques. Among others, they are inefficient in distinguishing between embedded RNA molecules, transcript isoforms, including splice and length variants, as well as between alternative polycistronic transcripts. It has been demonstrated in several studies that long-read sequencing is able to circumvent these problems.ResultsIn this work, we report the analysis of VZV lytic transcriptome using the Oxford Nanopore Technologies sequencing platform. These investigations have led to the identification of 114 novel transcripts, including mRNAs, non-coding RNAs, polycistronic RNAs and complex transcripts, as well as 10 novel spliced transcripts and 27 novel transcription start site isoforms and transcription end site isoforms. A novel class of transcripts, the nroRNAs are described in this study. These transcripts are encoded by the genomic region located in close vicinity to the viral replication origin. We also show that the VZV latency transcript (VLT) exhibits a more complex structural variation than formerly believed. Additionally, we have detected RNA editing in a novel non-coding RNA molecule.ConclusionsOur investigations disclosed a composite transcriptomic architecture of VZV, including the discovery of novel RNA molecules and transcript isoforms, as well as a complex meshwork of transcriptional read-throughs and overlaps. The results represent a substantial advance in the annotation VZV transcriptome and in understanding the molecular biology of the herpesviruses in general.


2018 ◽  
Author(s):  
Paul Griffith ◽  
Castle Raley ◽  
David Sun ◽  
Yongmei Zhao ◽  
Zhonghe Sun ◽  
...  

AbstractPacific Biosciences’ (PacBio) RS II sequencer, utilizing Single-Molecule, Real-Time (SMRT) technology, has revolutionized next-generation sequencing by providing an accurate long-read platform. PacBio single-molecule long reads have been used to delineate complex spliceoforms, detect mutations in highly homologous sequences, identify mRNA chimeras and chromosomal translocations, accurately haplotype phasing over multiple kilobase distances and aid in assembly of genomes with complex structural variation. The PacBio protocol for preparation of sequencing templates employs blunt-end hairpin adapter ligation, which enables a short turnaround time for sequence production. However, we have found a significant portion of sequencing yield contains chimeric reads resulting from blunt-end ligation of multiple template molecules to each other prior to adapter ligation. These artefactual fusion DNA sequences pose a major challenge to analysis and can lead to false-positive detection of fusion events. We assessed the frequency of artefactual fusion when using blunt-end adapter ligation and compared it to an alternative method using A/T overhang adapter ligation. The A/T overhang adapter ligation method showed a vast improvement in limiting artefactual fusion events and is now our recommended procedure for adapter ligation during PacBio library preparation.


2017 ◽  
Author(s):  
Joseph G. Arthur ◽  
Xi Chen ◽  
Bo Zhou ◽  
Alexander E. Urban ◽  
Wing Hung Wong

AbstractDetecting structural variants (SVs) from sequencing data is key to genome analysis, but methods using standard whole-genome sequencing (WGS) data are typically incapable of resolving complex SVs with multiple co-located breakpoints. We introduce the ARC-SV method, which uses a probabilistic model to detect arbitrary local rearrangements from WGS data. Our method performs well on simple SVs while surpassing state-of-the-art methods in complex SV detection.


2017 ◽  
Author(s):  
Chuan Cao ◽  
Rodrigo Cogni ◽  
Vincent Barbier ◽  
Francis M. Jiggins

AbstractIt is common to find that major-effect genes are an important cause of variation in susceptibility to infection. Here we have characterised natural variation in a gene called pastrel that explains over half of the genetic variance in susceptibility to the virus DCV in populations of Drosophila melanogaster. We found extensive allelic heterogeneity, with a sample of seven alleles of pastrel from around the world conferring four phenotypically distinct levels of resistance. By modifying candidate SNPs in transgenic flies, we show that the largest effect is caused by an amino acid polymorphism that arose when an ancestral threonine was mutated to alanine, greatly increasing resistance to DCV. Overexpression of the ancestral susceptible allele provides strong protection against DCV, indicating that this mutation acted to improve an existing restriction factor. The pastrel locus also contains complex structural variation and cis-regulatory polymorphisms altering gene expression. We find that higher expression of pastrel is associated with increased survival after DCV infection. To understand why this variation is maintained in populations, we investigated genetic variation surrounding the amino acid variant that is causing flies to be resistant. We found no evidence of natural selection causing either recent changes in allele frequency or geographical variation in frequency, suggesting that this is an old polymorphism that has been maintained at a stable frequency. Overall, our data demonstrate how complex genetic variation at a single locus can control susceptibility to a virulent natural pathogen.


2017 ◽  
Vol 18 (1) ◽  
Author(s):  
Ryan L. Collins ◽  
Harrison Brand ◽  
Claire E. Redin ◽  
Carrie Hanscom ◽  
Caroline Antolik ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document