Under-representation of repetitive sequences in whole-genome shotgun sequence databases: an illustration using a recently acquired transposable element

Akihiko Koga

doi:10.1139/g11-088

Under-representation of repetitive sequences in whole-genome shotgun sequence databases: an illustration using a recently acquired transposable element

Genome ◽

10.1139/g11-088 ◽

2012 ◽

Vol 55 (2) ◽

pp. 172-175 ◽

Cited By ~ 3

Author(s):

Akihiko Koga

Keyword(s):

Southern Blot Analysis ◽

Repetitive Sequences ◽

Whole Genome Shotgun ◽

Whole Genome ◽

Sequence Database ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

A Genome ◽

Genome Shotgun Sequence ◽

Sequence Databases

It is widely accepted in a conceptual framework that repetitive sequences, especially those with high sequence homogeneity among copies, tend to be under-represented in whole-genome shotgun sequence databases, because of the difficulty of assembling sequence reads into contigs. Although this is easily inferred, there is no quantitative illsutration of this phenomenon. An example using a currently used database is expected to contribute to the intuitive understanding of how serious the under-representation is. The present study provides the first quantitative example (in the case of 16 copies of virtually identical, 4.7-kb sequences in a genome of 7 × 10 8 bp) by comparing the results of BLAST searches of a sequence database (contig N50; 9.8 kb) with those of Southern blot analysis of genomic DNA. This has revealed that the internal regions of the repetitive sequences are under-represented to a striking extent.

Download Full-text

BLAST searches of the NCBI whole genome shotgun sequence database identify complement factor H-like sequences in lower primates

Molecular Immunology ◽

10.1016/j.molimm.2010.05.066 ◽

2010 ◽

Vol 47 (13) ◽

pp. 2217-2217

Author(s):

Lisa A. Kuttner-Kondo

Keyword(s):

Whole Genome Shotgun ◽

Factor H ◽

Complement Factor H ◽

Whole Genome ◽

Complement Factor ◽

Sequence Database ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

Download Full-text

Whole-Genome Shotgun Sequence of Arthrospira platensis Strain Paraca, a Cultivated and Edible Cyanobacterium

Genome Announcements ◽

10.1128/genomea.00751-14 ◽

2014 ◽

Vol 2 (4) ◽

Cited By ~ 8

Author(s):

F. Lefort ◽

G. Calmin ◽

J. Crovadore ◽

J. Falquet ◽

J.-P. Hurni ◽

...

Keyword(s):

Arthrospira Platensis ◽

Whole Genome Shotgun ◽

Whole Genome ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

Download Full-text

Whole-Genome Shotgun Sequence of Halomonas sp. Strain SBS 10, Isolated from a Hypersaline Lake in India

Microbiology Resource Announcements ◽

10.1128/mra.01270-19 ◽

2020 ◽

Vol 9 (1) ◽

Cited By ~ 1

Author(s):

Bijayendra Kushwaha ◽

Guru Prasad Sharma ◽

Anshul Sharma ◽

Prem Shankar ◽

Anjali Geethadevi ◽

...

Keyword(s):

Genome Size ◽

Gene Clusters ◽

Halophilic Bacterium ◽

Whole Genome Shotgun ◽

Hypersaline Lake ◽

Whole Genome ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Moderately Halophilic Bacterium ◽

Genome Shotgun Sequence

The whole-genome shotgun sequence of a moderately halophilic bacterium, Halomonas sp. strain SBS 10, was assembled and studied. The assembled genome size was 1.5 Mb, with a G+C content of 63.6%. The genome sequence of this Halomonas sp. SBS 10 isolate will be valuable in understanding gene clusters and functions involved in the adaptability of this bacterium to hypersaline conditions.

Download Full-text

Whole-Genome Shotgun Sequence of Salmonella bongori, First Isolated in Northwestern Italy

Genome Announcements ◽

10.1128/genomea.00560-17 ◽

2017 ◽

Vol 5 (27) ◽

Author(s):

Angelo Romano ◽

Alberto Bellio ◽

Guerrino Macori ◽

Paul D. Cotter ◽

Daniela Manila Bianchi ◽

...

Keyword(s):

Genome Sequence ◽

Draft Genome ◽

Symptomatic Patient ◽

Whole Genome Shotgun ◽

Draft Genome Sequence ◽

Whole Genome ◽

Content Type ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

ABSTRACT This study describes the whole-genome shotgun sequence of Salmonella bongori 48:z35:–, originally isolated from a 1-year-old symptomatic patient in northwest Italy, a typically nonendemic area. The draft genome sequence contained 4.56 Mbp and the G+C content was 51.27%.

Download Full-text

Long-read, whole-genome shotgun sequence data for five model organisms

Scientific Data ◽

10.1038/sdata.2014.45 ◽

2014 ◽

Vol 1 (1) ◽

Cited By ~ 89

Author(s):

Kristi E Kim ◽

Paul Peluso ◽

Primo Babayan ◽

P. Jane Yeadon ◽

Charles Yu ◽

...

Keyword(s):

Sequence Data ◽

Whole Genome Shotgun ◽

Model Organisms ◽

Whole Genome ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Long Read ◽

Genome Shotgun Sequence

Download Full-text

Whole-genome shotgun sequence assembly enables rapid gene characterization in the tropical fish barramundi,Lates calcarifer

Animal Genetics ◽

10.1111/age.12312 ◽

2015 ◽

Vol 46 (4) ◽

pp. 468-469 ◽

Cited By ~ 8

Author(s):

Jose A. Domingos ◽

Kyall R. Zenger ◽

Dean R. Jerry

Keyword(s):

Sequence Assembly ◽

Whole Genome Shotgun ◽

Tropical Fish ◽

Whole Genome ◽

Lates Calcarifer ◽

Gene Characterization ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

Download Full-text

Applications of the double-barreled data in whole-genome shotgun sequence assembly and analysis

Science in China Series C Life Sciences ◽

10.1360/03yc0248 ◽

2005 ◽

Vol 48 (3) ◽

pp. 300

Author(s):

Yujun HAN

Keyword(s):

Sequence Assembly ◽

Whole Genome Shotgun ◽

Whole Genome ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

Download Full-text

Annotated Whole-Genome Shotgun Sequence of Multidrug-Resistant Mycobacterium tuberculosis MTB13_M Isolated from Morocco

Genome Announcements ◽

10.1128/genomea.01756-16 ◽

2017 ◽

Vol 5 (9) ◽

Cited By ~ 1

Author(s):

L. Lahlou ◽

N. El Mrimar ◽

T. Alouane ◽

M. Laamarti ◽

S. Karti ◽

...

Keyword(s):

Mycobacterium Tuberculosis ◽

Genome Sequence ◽

Sputum Sample ◽

Multidrug Resistant ◽

Whole Genome Shotgun ◽

Whole Genome ◽

Content Type ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

ABSTRACT Here, we describe the annotated genome sequence of Mycobacterium tuberculosis MTB13_M. The organism was isolated from a sputum sample in Morocco.

Download Full-text

Organization and Evolution of Primate Centromeric DNA from Whole-Genome Shotgun Sequence Data

PLoS Computational Biology ◽

10.1371/journal.pcbi.0030181 ◽

2007 ◽

Vol 3 (9) ◽

pp. e181 ◽

Cited By ~ 57

Author(s):

Can Alkan ◽

Mario Ventura ◽

Nicoletta Archidiacono ◽

Mariano Rocchi ◽

S. Cenk Sahinalp ◽

...

Keyword(s):

Sequence Data ◽

Whole Genome Shotgun ◽

Whole Genome ◽

Centromeric Dna ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

Download Full-text

First Whole-Genome Shotgun Sequence of a Promising Cellulase Secretor, Trichoderma koningiopsis Strain POS7

Genome Announcements ◽

10.1128/genomea.00823-17 ◽

2017 ◽

Vol 5 (37) ◽

Cited By ~ 2

Author(s):

María Lorena Castrillo ◽

Gustavo Ángel Bich ◽

Carlos Modenutti ◽

Adrián Turjanski ◽

Pedro Darío Zapata ◽

...

Keyword(s):

Solid State ◽

Sequence Analysis ◽

Ab Initio ◽

Genome Size ◽

Solid State Fermentation ◽

Whole Genome Shotgun ◽

Whole Genome ◽

Shotgun Sequence ◽

Whole Genome Shotgun Sequence ◽

Genome Shotgun Sequence

ABSTRACT Trichoderma koningiopsis strain POS7 produces significantly large amounts of cellulase enzymes in solid-state fermentation. The Illumina-based sequence analysis reveals an approximate genome size of 36.6 Mbp, with a G+C content of 48.82% for T. koningiopsis POS7. Based on ab initio prediction, 12,661 coding genes were annotated.

Download Full-text