scholarly journals High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes

2010 ◽  
Vol 20 (9) ◽  
pp. 1262-1270 ◽  
Author(s):  
A. D. Ewing ◽  
H. H. Kazazian
2021 ◽  
Author(s):  
Ramesh Rajaby ◽  
Yi Zhou ◽  
Yifan Meng ◽  
Xi Zeng ◽  
Guoliang Li ◽  
...  

Abstract A significant portion of human cancers are due to viruses integrating into human genomes. Therefore, accurately predicting virus integrations can help uncover the mechanisms that lead to many devastating diseases. Virus integrations can be called by analysing second generation high-throughput sequencing datasets. Unfortunately, existing methods fail to report a significant portion of integrations, while predicting a large number of false positives. We observe that the inaccuracy is caused by incorrect alignment of reads in repetitive regions. False alignments create false positives, while missing alignments create false negatives. This paper proposes SurVirus, an improved virus integration caller that corrects the alignment of reads which are crucial for the discovery of integrations. We use publicly available datasets to show that existing methods predict hundreds of thousands of false positives; SurVirus, on the other hand, is significantly more precise while it also detects many novel integrations previously missed by other tools, most of which are in repetitive regions. We validate a subset of these novel integrations, and find that the majority are correct. Using SurVirus, we find that HPV and HBV integrations are enriched in LINE and Satellite regions which had been overlooked, as well as discover recurrent HBV and HPV breakpoints in human genome-virus fusion transcripts.


Author(s):  
E.V. Korneenko ◽  
◽  
А.E. Samoilov ◽  
I.V. Artyushin ◽  
M.V. Safonova ◽  
...  

In our study we analyzed viral RNA in bat fecal samples from Moscow region (Zvenigorod district) collected in 2015. To detect various virus families and genera in bat fecal samples we used PCR amplification of viral genome fragments, followed by high-throughput sequencing. Blastn search of unassembled reads revealed the presence of viruses from families Astroviridae, Coronaviridae and Herpesviridae. Assembly using SPAdes 3.14 yields contigs of length 460–530 b.p. which correspond to genome fragments of Coronaviridae and Astroviridae. The taxonomy of coronaviruses has been determined to the genus level. We also showed that one bat can be a reservoir of several virus genuses. Thus, the bats in the Moscow region were confirmed as reservoir hosts for potentially zoonotic viruses.


Sign in / Sign up

Export Citation Format

Share Document