Health informatics. Genomic sequence variation markup language (GSVML)

2009 ◽  
2010 ◽  
Vol 79 (2) ◽  
pp. 130-142 ◽  
Author(s):  
Jun Nakaya ◽  
Michio Kimura ◽  
Kaei Hiroi ◽  
Keisuke Ido ◽  
Woosung Yang ◽  
...  

2018 ◽  
Author(s):  
C. Titus Brown ◽  
Dominik Moritz ◽  
Michael P. O’Brien ◽  
Felix Reidl ◽  
Taylor Reiter ◽  
...  

Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at https://github.com/spacegraphcats/ spacegraphcats under the 3-Clause BSD License.


2006 ◽  
Vol 72 (2) ◽  
pp. 1226-1230 ◽  
Author(s):  
Jill R. Stewart ◽  
Jan Vinjé ◽  
Sjon J. G. Oudejans ◽  
Geoff I. Scott ◽  
Mark D. Sobsey

ABSTRACT Typing of F-specific RNA (FRNA) coliphages has been proposed as a useful method for distinguishing human from animal fecal contamination in environmental samples. Group II and III FRNA coliphages are generally associated with human wastes, but several exceptions have been noted. In the present study, we have genotyped and partially sequenced group III FRNA coliphage field isolates from swine lagoons in North Carolina (NC) and South Carolina (SC), along with isolates from surface waters and municipal wastewaters. Phylogenetic analysis of a region of the 5′ end of the maturation protein gene revealed two genetically different group III FRNA subclusters with 36.6% sequence variation. The SC swine lagoon isolates were more closely related to group III prototype virus M11, whereas the isolates from a swine lagoon in NC, surface waters, and wastewaters grouped with prototype virus Q-beta. These results suggest that refining phage genotyping systems to discriminate M11-like phages from Q-beta-like phages would not necessarily provide greater discriminatory power in distinguishing human from animal sources of pollution. Within the group III subclusters, nucleotide sequence diversity ranged from 0% to 6.9% for M11-like strains and from 0% to 8.7% for Q-beta-like strains. It is demonstrated here that nucleotide sequencing of closely related FRNA strains can be used to help track sources of contamination in surface waters. A similar use of phage genomic sequence information to track fecal pollution promises more reliable results than phage typing by nucleic acid hybridization and may hold more potential for field applications.


Genome ◽  
2006 ◽  
Vol 49 (8) ◽  
pp. 950-958 ◽  
Author(s):  
Xue-Feng Ma ◽  
J Perry Gustafson

The timing and rate of genomic variation induced by allopolyploidization in the intergeneric wheat-rye (Triticum spp. – Secale cereale L.) hybrid triticale (× Triticosecale Wittmack) was studied using amplified fragment length polymorphism (AFLP) analyses with 2 sets of primers, EcoRI–MseI (E–M) and PstI–MseI (P–M), which primarily amplify repetitive and low-copy sequences, respectively. The results showed that allopolyploidization induced genome sequence variation in triticale and that a great degree of the genome variation occurred immediately following wide hybridization. Specifically, about 46.3% and 36.2% of the wheat parental band loss and 74.5% and 68.4% of the rye parental band loss occurred in the F1 hybrids (before chromosome doubling) for E–M and P–M primers, respectively. The sequence variation events that followed chromosome doubling consisted of continuous modifications that occurred at a very small rate compared with the rate of variation before chromosome doubling. However, the rate of sequence variation involving the rye parental genome was much higher in the first 5 generations following chromosome doubling than in any subsequent generation. Surprisingly, the highest rate of rye genomic variation occurring after chromosome doubling was in C3 or later, but not in C1. The data suggested that the cytoplasm and the degree of the relationship between the parental genomes were the key factors in determining the direction, amount, timing, and rate of genomic sequence variation occurring during intergeneric allopolyploidization.Key words: genome evolution, sequence variation, allopolyploid, triticale, AFLP.


Sign in / Sign up

Export Citation Format

Share Document