scholarly journals Draft versus finished sequence data for DNA and protein diagnostic signature development

2005 ◽  
Vol 33 (18) ◽  
pp. 5838-5850 ◽  
Author(s):  
S. N. Gardner
1999 ◽  
Vol 9 (2) ◽  
pp. 189-194
Author(s):  
Peter M. Kuehl ◽  
Jane M. Weisemann ◽  
Jeffrey W. Touchman ◽  
Eric D. Green ◽  
Mark S. Boguski

Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs (“prefinished” sequence) and more slowly produces the final, completely assembled sequence (“finished” sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.[Our software system is available via an extensive web supplement to this article at http://www.ncbi.nlm.nih.gov/Kuehl/prefinished.]


2019 ◽  
Vol 44 (4) ◽  
pp. 930-942
Author(s):  
Geraldine A. Allen ◽  
Luc Brouillet ◽  
John C. Semple ◽  
Heidi J. Guest ◽  
Robert Underhill

Abstract—Doellingeria and Eucephalus form the earliest-diverging clade of the North American Astereae lineage. Phylogenetic analyses of both nuclear and plastid sequence data show that the Doellingeria-Eucephalus clade consists of two main subclades that differ from current circumscriptions of the two genera. Doellingeria is the sister group to E. elegans, and the Doellingeria + E. elegans subclade in turn is sister to the subclade containing all remaining species of Eucephalus. In the plastid phylogeny, the two subclades are deeply divergent, a pattern that is consistent with an ancient hybridization event involving ancestral species of the Doellingeria-Eucephalus clade and an ancestral taxon of a related North American or South American group. Divergence of the two Doellingeria-Eucephalus subclades may have occurred in association with northward migration from South American ancestors. We combine these two genera under the older of the two names, Doellingeria, and propose 12 new combinations (10 species and two varieties) for all species of Eucephalus.


Sign in / Sign up

Export Citation Format

Share Document