Progress towards a reference genome for sunflower

Botany ◽  
2011 ◽  
Vol 89 (7) ◽  
pp. 429-437 ◽  
Author(s):  
N.C. Kane ◽  
N. Gill ◽  
M.G. King ◽  
J.E. Bowers ◽  
H. Berges ◽  
...  

The Compositae is one of the largest and most economically important families of flowering plants and includes a diverse array of food crops, horticultural crops, medicinals, and noxious weeds. Despite its size and economic importance, there is no reference genome sequence for the Compositae, which impedes research and improvement efforts. We report on progress toward sequencing the 3.5 Gb genome of cultivated sunflower ( Helianthus annuus ), the most important crop in the family. Our sequencing strategy combines whole-genome shotgun sequencing using the Solexa and 454 platforms with the generation of high-density genetic and physical maps that serve as scaffolds for the linear assembly of whole-genome shotgun sequences. The performance of this approach is enhanced by the construction of a sequence-based physical map, which provides unique sequence-based tags every 5–6 kb across the genome. Thus far, our physical map covers ∼85% of the sunflower genome, and we have generated ∼80× genome coverage with Solexa reads and 15.5× with 454 reads. Preliminary analyses indicated that ∼78% of the sunflower genome consists of repetitive sequences. Nonetheless, ∼76% of contigs >5 kb in size can be assigned to either the physical or genetic map or to both, suggesting that our approach is likely to deliver a highly accurate and contiguous reference genome for sunflower.

Genome ◽  
2012 ◽  
Vol 55 (2) ◽  
pp. 172-175 ◽  
Author(s):  
Akihiko Koga

It is widely accepted in a conceptual framework that repetitive sequences, especially those with high sequence homogeneity among copies, tend to be under-represented in whole-genome shotgun sequence databases, because of the difficulty of assembling sequence reads into contigs. Although this is easily inferred, there is no quantitative illsutration of this phenomenon. An example using a currently used database is expected to contribute to the intuitive understanding of how serious the under-representation is. The present study provides the first quantitative example (in the case of 16 copies of virtually identical, 4.7-kb sequences in a genome of 7 × 10 8 bp) by comparing the results of BLAST searches of a sequence database (contig N50; 9.8 kb) with those of Southern blot analysis of genomic DNA. This has revealed that the internal regions of the repetitive sequences are under-represented to a striking extent.


Author(s):  
Shangzhe Zhang ◽  
Wenyu Liu ◽  
Xinfeng Liu ◽  
Xin Du ◽  
Ke Zhang ◽  
...  

Abstract Structural variants (SVs) represent an important genetic resource for both natural and artificial selection. Here we present a chromosome-scale reference genome for domestic yak (Bos grunniens) that has longer contigs and scaffolds (N50 44.72Mb and 114.39 Mb, respectively) than reported for any other ruminant genome. We further obtained long-read resequencing data for 6 wild and 23 domestic yaks and constructed a genetic SV map of 37,220 SVs that covers the geographic range of the yaks. The majority of the SVs contains repetitive sequences and several are in or near genes. By comparing SVs in domestic and wild yaks, we identified genes that are predominantly related to the nervous system, behavior, immunity and reproduction and may have been targeted by artificial selection during yak domestication. These findings provide new insights in the domestication of animals living at high altitude and highlight the importance of SVs in animal domestication.


Genetics ◽  
1996 ◽  
Vol 142 (2) ◽  
pp. 537-548 ◽  
Author(s):  
Michael W Nachman ◽  
Gary A Churchill

Abstract If loci are randomly distributed on a physical map, the density of markers on a genetic map will be inversely proportional to recombination rate. First proposed by MARY LYON, we have used this idea to estimate recombination rates from the Drosophila melanogaster linkage map. These results were compared with results of two other studies that estimated regional recombination rates in D. melanogaster using both physical and genetic maps. The three methods were largely concordant in identifying large-scale genomic patterns of recombination. The marker density method was then applied to the Mus musculus microsatellite linkage map. The distribution of microsatellites provided evidence for heterogeneity in recombination rates. Centromeric regions for several mouse chromosomes had significantly greater numbers of markers than expected, suggesting that recombination rates were lower in these regions. In contrast, most telomeric regions contained significantly fewer markers than expected. This indicates that recombination rates are elevated at the telomeres of many mouse chromosomes and is consistent with a comparison of the genetic and cytogenetic maps in these regions. The density of markers on a genetic map may provide a generally useful way to estimate regional recombination rates in species for which genetic, but not physical, maps are available.


2021 ◽  
Vol 160 (6) ◽  
pp. S-569
Author(s):  
Manoj Dadlani ◽  
Kelly Moffat ◽  
Huai Li ◽  
Xin Zhou ◽  
Rita Colwell

2018 ◽  
Vol 6 (26) ◽  
Author(s):  
Zhong Liang ◽  
Melissa Stephens ◽  
Victoria A. Ploplis ◽  
Shaun W. Lee ◽  
Francis J. Castellino

Whole-genome shotgun sequences and bottom-up assembly of contigs of six skin isolates of Streptococcus pyogenes, viz., NS88.3 (emm98.1), NS223 (emm91), NS455 (emm52), SS1448 (emm86.2), SS1572 (emm223), and SS1574 (emm224), are presented here. All contigs were annotated, and the gene arrangements and the inferred proteins were consistent with a pattern D classification.


Sign in / Sign up

Export Citation Format

Share Document