scholarly journals Using Growing Self-Organising Maps to Improve the Binning Process in Environmental Whole-Genome Shotgun Sequencing

2008 ◽  
Vol 2008 ◽  
pp. 1-10 ◽  
Author(s):  
Chon-Kit Kenneth Chan ◽  
Arthur L. Hsu ◽  
Sen-Lin Tang ◽  
Saman K. Halgamuge

Metagenomic projects using whole-genome shotgun (WGS) sequencing produces many unassembled DNA sequences and small contigs. The step of clustering these sequences, based on biological and molecular features, is called binning. A reported strategy for binning that combines oligonucleotide frequency and self-organising maps (SOM) shows high potential. We improve this strategy by identifying suitable training features, implementing a better clustering algorithm, and defining quantitative measures for assessing results. We investigated the suitability of each of di-, tri-, tetra-, and pentanucleotide frequencies. The results show that dinucleotide frequency is not a sufficiently strong signature for binning 10 kb long DNA sequences, compared to the other three. Furthermore, we observed that increased order of oligonucleotide frequency may deteriorate the assignment result in some cases, which indicates the possible existence of optimal species-specific oligonucleotide frequency. We replaced SOM with growing self-organising map (GSOM) where comparable results are obtained while gaining7%–15%speed improvement.

2021 ◽  
Vol 160 (6) ◽  
pp. S-569
Author(s):  
Manoj Dadlani ◽  
Kelly Moffat ◽  
Huai Li ◽  
Xin Zhou ◽  
Rita Colwell

2011 ◽  
Vol 193 (19) ◽  
pp. 5553-5554 ◽  
Author(s):  
W. Ghosh ◽  
A. George ◽  
A. Agarwal ◽  
P. Raj ◽  
M. Alam ◽  
...  

PLoS ONE ◽  
2016 ◽  
Vol 11 (5) ◽  
pp. e0155362 ◽  
Author(s):  
Emily Vogtmann ◽  
Xing Hua ◽  
Georg Zeller ◽  
Shinichi Sunagawa ◽  
Anita Y. Voigt ◽  
...  

2012 ◽  
Vol 194 (5) ◽  
pp. 1264-1265 ◽  
Author(s):  
J. B. Prajapati ◽  
C. D. Khedkar ◽  
J. Chitra ◽  
S. Suja ◽  
V. Mishra ◽  
...  

F1000Research ◽  
2015 ◽  
Vol 4 ◽  
pp. 170 ◽  
Author(s):  
Ted Kalbfleisch ◽  
Pamela Brock ◽  
Angela Snow ◽  
Deborah Neklason ◽  
Gordon Gowans ◽  
...  

Recently, deletions have been identified and published as causal for Familial Adenomatous Polyposis in the 1B promoter region of the APC gene.  Those deletions were measured using multiplex ligation-dependent probe amplification.  Here, we present and characterize an ~11kb deletion identified by whole genome shotgun sequencing.  The deletion occurred in a patient diagnosed with Familial Adenomatous Polyposis, and was located on chr5, between bases 112,034,824 and 112,045,845, fully encompassing the 1B promoter region of the APC gene.   Results are presented here that include the sequence evidence supporting the presence of the deletion as well as base level characterization of the deletion site.  These results demonstrate the capacity of whole genome sequencing for the detection of large structural variants in single individuals.


Sign in / Sign up

Export Citation Format

Share Document