scholarly journals annoFuse: an R Package to annotate, prioritize, and interactively explore putative oncogenic RNA fusions

2019 ◽  
Author(s):  
Krutika S. Gaonkar ◽  
Federico Marini ◽  
Komal S. Rathi ◽  
Payal Jain ◽  
Yuankun Zhu ◽  
...  

AbstractBackgroundGene fusion events are a significant source of somatic variation across adult and pediatric cancers and are some of the most clinically-effective therapeutic targets, yet low consensus of RNA-Seq fusion prediction algorithms makes therapeutic prioritization difficult. In addition, events such as polymerase read-throughs, mis-mapping due to gene homology, and fusions occurring in healthy normal tissue require informed filtering, making it difficult for researchers and clinicians to rapidly discern gene fusions that might be true underlying oncogenic drivers of a tumor and in some cases, appropriate targets for therapy.ResultsWe developed annoFuse, an R package, and shinyFuse, a companion web application, to annotate, prioritize, and explore biologically-relevant expressed gene fusions, downstream of fusion calling. We validated annoFuse using a random cohort of TCGA RNA-Seq samples (N = 160) and achieved a 96% sensitivity for retention of high-confidence fusions (N = 603). annoFuse uses FusionAnnotator annotations to filter non-oncogenic and/or artifactual fusions. Then, fusions are prioritized if previously reported in TCGA and/or fusions containing gene partners that are known oncogenes, tumor suppressor genes, COSMIC genes, and/or transcription factors. We applied annoFuse to fusion calls from pediatric brain tumor RNA-Seq samples (N = 1,028) provided as part of the Open Pediatric Brain Tumor Atlas (OpenPBTA) Project to determine recurrent fusions and recurrently-fused genes within different brain tumor histologies. annoFuse annotates protein domains using the PFAM database, assesses reciprocality, and annotates gene partners for kinase domain retention. As a standard function, reportFuse enables generation of a reproducible R Markdown report to summarize filtered fusions, visualize breakpoints and protein domains by transcript, and plot recurrent fusions within cohorts. Finally, we created shinyFuse for algorithm-agnostic interactive exploration and plotting of gene fusions.ConclusionsannoFuse provides standardized filtering and annotation for gene fusion calls from STARFusion and Arriba by merging, filtering, and prioritizing putative oncogenic fusions across large cancer datasets, as demonstrated here with data from the OpenPBTA project. We are expanding the package to be widely-applicable to other fusion algorithms and expect annoFuse to provide researchers a method for rapidly evaluating, prioritizing, and translating fusion findings in patient tumors.

2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Krutika S. Gaonkar ◽  
Federico Marini ◽  
Komal S. Rathi ◽  
Payal Jain ◽  
Yuankun Zhu ◽  
...  

Abstract Background Gene fusion events are significant sources of somatic variation across adult and pediatric cancers and are some of the most clinically-effective therapeutic targets, yet low consensus of RNA-Seq fusion prediction algorithms makes therapeutic prioritization difficult. In addition, events such as polymerase read-throughs, mis-mapping due to gene homology, and fusions occurring in healthy normal tissue require informed filtering, making it difficult for researchers and clinicians to rapidly discern gene fusions that might be true underlying oncogenic drivers of a tumor and in some cases, appropriate targets for therapy. Results We developed annoFuse, an R package, and shinyFuse, a companion web application, to annotate, prioritize, and explore biologically-relevant expressed gene fusions, downstream of fusion calling. We validated annoFuse using a random cohort of TCGA RNA-Seq samples (N = 160) and achieved a 96% sensitivity for retention of high-confidence fusions (N = 603). annoFuse uses FusionAnnotator annotations to filter non-oncogenic and/or artifactual fusions. Then, fusions are prioritized if previously reported in TCGA and/or fusions containing gene partners that are known oncogenes, tumor suppressor genes, COSMIC genes, and/or transcription factors. We applied annoFuse to fusion calls from pediatric brain tumor RNA-Seq samples (N = 1028) provided as part of the Open Pediatric Brain Tumor Atlas (OpenPBTA) Project to determine recurrent fusions and recurrently-fused genes within different brain tumor histologies. annoFuse annotates protein domains using the PFAM database, assesses reciprocality, and annotates gene partners for kinase domain retention. As a standard function, reportFuse enables generation of a reproducible R Markdown report to summarize filtered fusions, visualize breakpoints and protein domains by transcript, and plot recurrent fusions within cohorts. Finally, we created shinyFuse for algorithm-agnostic interactive exploration and plotting of gene fusions. Conclusions annoFuse provides standardized filtering and annotation for gene fusion calls from STAR-Fusion and Arriba by merging, filtering, and prioritizing putative oncogenic fusions across large cancer datasets, as demonstrated here with data from the OpenPBTA project. We are expanding the package to be widely-applicable to other fusion algorithms and expect annoFuse to provide researchers a method for rapidly evaluating, prioritizing, and translating fusion findings in patient tumors.


2016 ◽  
Vol 17 (1) ◽  
Author(s):  
Mohammad Tanvir Ahamed ◽  
Anna Danielsson ◽  
Szilárd Nemes ◽  
Helena Carén

2012 ◽  
Vol 224 (06) ◽  
Author(s):  
T Milde ◽  
M Zucknick ◽  
M Kool ◽  
A Korshunov ◽  
H Witt ◽  
...  

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Yiqun Zhang ◽  
Fengju Chen ◽  
Lawrence A. Donehower ◽  
Michael E. Scheurer ◽  
Chad J. Creighton

AbstractThe global impact of somatic structural variants (SSVs) on gene expression in pediatric brain tumors has not been thoroughly characterised. Here, using whole-genome and RNA sequencing from 854 tumors of more than 30 different types from the Children’s Brain Tumor Tissue Consortium, we report the altered expression of hundreds of genes in association with the presence of nearby SSV breakpoints. SSV-mediated expression changes involve gene fusions, altered cis-regulation, or gene disruption. SSVs considerably extend the numbers of patients with tumors somatically altered for critical pathways, including receptor tyrosine kinases (KRAS, MET, EGFR, NF1), Rb pathway (CDK4), TERT, MYC family (MYC, MYCN, MYB), and HIPPO (NF2). Compared to initial tumors, progressive or recurrent tumors involve a distinct set of SSV-gene associations. High overall SSV burden associates with TP53 mutations, histone H3.3 gene H3F3C mutations, and the transcription of DNA damage response genes. Compared to adult cancers, pediatric brain tumors would involve a different set of genes with SSV-altered cis-regulation. Our comprehensive and pan-histology genomic analyses reveal SSVs to play a major role in shaping the transcriptome of pediatric brain tumors.


2021 ◽  
Vol 37 (3) ◽  
pp. 204-206
Author(s):  
Carolina Nör ◽  
Vijay Ramaswamy

Sign in / Sign up

Export Citation Format

Share Document