An annotated draft genome for Radix auricularia (Gastropoda, Mollusca)

Mapping Intimacies ◽

10.1101/087254 ◽

2016 ◽

Cited By ~ 1

Author(s):

Tilman Schell ◽

Barbara Feldmeyer ◽

Hanno Schmidt ◽

Bastian Greshake ◽

Oliver Tills ◽

...

Keyword(s):

Draft Genome ◽

Genetic Research ◽

Freshwater Snail ◽

Core Gene ◽

Protein Coding ◽

Repeat Content ◽

Starting Point ◽

Cumulative Length ◽

Gene Collection ◽

Radix Auricularia

AbstractMolluscs are the second most species-rich phylum in the animal kingdom, yet only eleven genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72x. The assembly contains 94.6 % of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ~690 Mb compared to the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70 % mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly-available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum.

Download Full-text

Genome assembly of the Australian black tiger shrimp (Penaeus monodon) reveals a fragmented IHHNV EVE sequence

10.1101/2021.11.11.468259 ◽

2021 ◽

Author(s):

Roger Huerlimann ◽

Jeff A Cowley ◽

Nicholas M Wade ◽

Yinan Wang ◽

Naga Kasinadhuni ◽

...

Keyword(s):

Genome Assembly ◽

Draft Genome ◽

Penaeus Monodon ◽

Penaeid Shrimp ◽

Protein Coding ◽

Black Tiger Shrimp ◽

Draft Genome Assembly ◽

Repeat Content ◽

Tiger Shrimp ◽

Endogenous Viral Elements

Shrimp are a valuable aquaculture species globally; however, disease remains a major hindrance to shrimp aquaculture sustainability and growth. Mechanisms mediated by endogenous viral elements (EVEs) have been proposed as a means by which shrimp that encounter a new virus start to accommodate rather than succumb to infection over time. However, evidence on the nature of such EVEs and how they mediate viral accommodation is limited. More extensive genomic data on Penaeid shrimp from different geographical locations should assist in exposing the diversity of EVEs. In this context, reported here is a PacBio Sequel-based draft genome assembly of an Australian black tiger shrimp (Penaeus monodon) inbred for one generation. The 1.89 Gbp draft genome is comprised of 31,922 scaffolds (N50: 496,398 bp) covering 85.9% of the projected genome size. The genome repeat content (61.8% with 30% representing simple sequence repeats) is almost the highest identified for any species. The functional annotation identified 35,517 gene models, of which 25,809 were protein-coding and 17,158 were annotated using interproscan. Scaffold scanning for specific EVEs identified an element comprised of a 9,045 bp stretch of repeated, inverted and jumbled genome fragments of Infectious hypodermal and hematopoietic necrosis virus (IHHNV) bounded by a repeated 591/590 bp host sequence. As only near complete linear ~4 kb IHHNV genomes have been found integrated in the genome of P. monodon previously, its discovery has implications regarding the validity of PCR tests designed to specifically detect such linear EVE types. The existence of conjoined inverted IHHNV genome fragments also provides a means by which hairpin dsRNAs could be expressed and processed by the shrimp RNA interference (RNAi) machinery.

Download Full-text

A draft genome assembly of the eastern banjo frog Limnodynastes dumerilii dumerilii (Anura: Limnodynastidae)

10.1101/2020.03.03.971721 ◽

2020 ◽

Cited By ~ 1

Author(s):

Qiye Li ◽

Qunfei Guo ◽

Yang Zhou ◽

Huishuang Tan ◽

Terry Bertozzi ◽

...

Keyword(s):

Genome Assembly ◽

Draft Genome ◽

Protein Coding ◽

Large Genome ◽

Draft Genome Assembly ◽

Protein Coding Genes ◽

Repeat Content ◽

Australian Continent ◽

Large Genome Size ◽

Reference Genomes

AbstractAmphibian genomes are usually challenging to assemble due to large genome size and high repeat content. The Limnodynastidae is a family of frogs native to Australia, Tasmania and New Guinea. As an anuran lineage that successfully diversified on the Australian continent, it represents an important lineage in the amphibian tree of life but lacks reference genomes. Here we sequenced and annotated the genome of the eastern banjo frog Limnodynastes dumerilii dumerilii to fill this gap. The total length of the genome assembly is 2.38 Gb with a scaffold N50 of 285.9 kb. We identified 1.21 Gb of non-redundant sequences as repetitive elements and annotated 24,548 protein-coding genes in the assembly. BUSCO assessment indicated that more than 94% of the expected vertebrate genes were present in the genome assembly and the gene set. We anticipate that this annotated genome assembly will advance the future study of anuran phylogeny and amphibian genome evolution.

Download Full-text

Four high-quality draft genome assemblies of the marine heterotrophic nanoflagellate Cafeteria roenbergensis

10.1101/751586 ◽

2019 ◽

Cited By ~ 1

Author(s):

Thomas Hackl ◽

Roman Martin ◽

Karina Barenhoff ◽

Sarah Duponchel ◽

Dominik Heider ◽

...

Keyword(s):

De Novo ◽

Draft Genome ◽

Gc Content ◽

Illumina Miseq ◽

Evolutionary Analysis ◽

High Quality ◽

Protein Coding ◽

Repeat Content ◽

Genome Assemblies ◽

Cafeteria Roenbergensis

AbstractThe heterotrophic stramenopile Cafeteria roenbergensis is a globally distributed marine bacterivorous protist. This unicellular flagellate is host to the giant DNA virus CroV and the virophage mavirus. We sequenced the genomes of four cultured C. roenbergensis strains and generated 23.53 Gb of Illumina MiSeq data (99-282 × coverage per strain) and 5.09 Gb of PacBio RSII data (13-54 × coverage). Using the Canu assembler and customized curation procedures, we obtained high-quality draft genome assemblies with a total length of 34-36 Mbp per strain and contig N50 lengths of 148 kbp to 464 kbp. The C. roenbergensis genome has a GC content of ~70%, a repeat content of ~28%, and is predicted to contain approximately 7857-8483 protein-coding genes based on a combination of de novo, homology-based and transcriptome-supported annotation. These first high-quality genome assemblies of a Bicosoecid fill an important gap in sequenced Stramenopile representatives and enable a more detailed evolutionary analysis of heterotrophic protists.

Download Full-text

The draft chromosome-level genome assembly of tetraploid ground cherry (Prunus fruticosa Pall.) from long reads

10.1101/2021.06.01.446499 ◽

2021 ◽

Author(s):

Thomas W Woehner ◽

Ofere Francis Emeriewen ◽

Alexander Wittenberg ◽

Harrie Schneiders ◽

Ilse Vrijenhoek ◽

...

Keyword(s):

Genome Sequence ◽

Genome Assembly ◽

Draft Genome ◽

Sour Cherry ◽

Evolutionary Analysis ◽

Protein Coding ◽

Draft Genome Assembly ◽

Repeat Content ◽

Long Read ◽

Chromosome Level

Background: Cherries are stone fruits and belong to the economically important plant family of Rosaceae with worldwide cultivation of different species. The ground cherry, Prunus fruticosa Pall. is one ancestor of cultivated sour cherry, an important tetraploid cherry species. Here, we present a long read chromosome-level draft genome assembly and related plastid sequences using the Oxford Nanopore Technology PromethION platform and R10.3 pore type. Finding: The final assemblies obtained from 117.3 Gb cleaned reads representing 97x coverage of expected 1.2 Gb tetraploid (2n=4x=32) and 0.3 Gb haploid (1n=8) genome sequence of P. fruticosa were calculated. The N50 contig length ranged between 0.3 and 0.5 Mb with the longest contig being ~6 Mb. BUSCO estimated a completeness between 98.7 % for the 4n and 96.1 % for the 1n datasets. Using a homology and reference based scaffolding method, we generated a final consensus genome sequence of 366 Mb comprising eight chromosomes. The N50 scaffold was ~44 Mb with the longest chromosome being 66.5 Mb. The repeat content was estimated to ~190 Mb (52 %) and 58,880 protein-coding genes were annotated. The chloroplast and mitochondrial genomes were 158,217 bp and 383,281 bp long, which is in accordance with previously published plastid sequences. Conclusion: This is the first report of the genome of ground cherry (P. fruticosa) sequenced by long read technology only. The datasets obtained from this study provide a foundation for future breeding, molecular and evolutionary analysis in Prunus studies.

Download Full-text

Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing

Genome Biology and Evolution ◽

10.1093/gbe/evz135 ◽

2019 ◽

Vol 11 (7) ◽

pp. 2045-2054 ◽

Cited By ~ 9

Author(s):

Martin Helmkampf ◽

M Renee Bellinger ◽

Scott M Geib ◽

Sheina B Sim ◽

Misaki Takabayashi

Keyword(s):

Tandem Repeats ◽

De Novo ◽

Draft Genome ◽

Cost Effective ◽

Sequence Length ◽

High Molecular Weight Dna ◽

Protein Coding ◽

Montipora Capitata ◽

Considerable Uncertainty ◽

Repeat Content

Abstract The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The final draft assembly consisted of 27,870 scaffolds with a N50 size of 186 kb and contained a fairly complete set (81%) of metazoan benchmarking (BUSCO) genes. Based on haploid assembly size (615 Mb) and read k-mer profiles, we estimated the genome size to fall between 600 and 700 Mb, although the high fraction of repetitive sequence introduced considerable uncertainty. Repeat analysis indicated that 42% of the assembly consisted of interspersed, mostly unclassified repeats, and almost 3% tandem repeats. We also identified 36,691 protein-coding genes with a median coding sequence length of 807 bp, together spanning 7% of the assembly. The high repeat content and heterozygosity of the genome proved a challenging scenario for assembly, requiring additional steps to merge haplotypes and resulting in a higher than expected fragmentation at the scaffold level. Despite these challenges, the assembly turned out to be comparable in most quality measures to that of other available coral genomes while being considerably more cost-effective, especially with respect to long-read sequencing methods. Provided high-molecular-weight DNA is available, linked-read technology may thus serve as a valuable alternative capable of providing quality genome assemblies of nonmodel organisms.

Download Full-text

Whole Genome Sequence of the Newly Prescribed Subspecies Oreochromis spilurus saudii: A Valuable Genetic Resource for Aquaculture in Saudi Arabia

Journal of Marine Science and Engineering ◽

10.3390/jmse9050506 ◽

2021 ◽

Vol 9 (5) ◽

pp. 506

Author(s):

Mohammed Othman Aljahdali ◽

Mohammad Habibur Rahman Molla ◽

Wessam Mansour Filfilan

Keyword(s):

Genome Sequence ◽

Saline Water ◽

Salt Water ◽

Draft Genome ◽

Gc Content ◽

Whole Genome Sequence ◽

Whole Genome ◽

Protein Coding ◽

Repeat Content ◽

High Yields

Tilapia (Oreochromis spp.) have significant potential for aquaculture production around the world. There is an increasing demand among tilapia producers for strains with higher yields and for fish that can survive in highly saline water. Novel strains and consistent seedstock are critically important objectives for sustainable aquaculture, but for these required targets there is still not enough progress. Therefore, this study describes the genome sequence of Oreochromis spilurus to support the seawater culture of tilapia. The draft genome is 0.768 Gb (gigabases), with a scaffold N50 (the genome (50%) is in fragments of this length) of 0.22 Mb (megabases). The GC content is 40.4%, the heterozygosity rate is 0.35%, and the repeat content is 47.97%. The predicted protein-coding peptide encoded 51,642 and predicted 10,641 protein-coding genes in the O. spilurus genome. The predicted antimicrobial peptides were 262, bringing new hope for further research. This whole genome sequence provides new insights for biomedical and molecular research and will also improve the breeding of tilapia for high yields, resistance to disease, and adaptation to salt water.

Download Full-text

The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan Plateau

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1423628112 ◽

2015 ◽

Vol 112 (4) ◽

pp. 1095-1100 ◽

Cited By ~ 69

Author(s):

Xingquan Zeng ◽

Hai Long ◽

Zhuo Wang ◽

Shancen Zhao ◽

Yawei Tang ◽

...

Keyword(s):

Tibetan Plateau ◽

Stress Responses ◽

Crop Improvement ◽

Draft Genome ◽

Genetic Research ◽

The Tibetan Plateau ◽

Protein Coding ◽

Hulless Barley ◽

Tibetan Wild Barley ◽

Tibetan Hulless Barley

The Tibetan hulless barley (Hordeum vulgare L. var. nudum), also called “Qingke” in Chinese and “Ne” in Tibetan, is the staple food for Tibetans and an important livestock feed in the Tibetan Plateau. The diploid nature and adaptation to diverse environments of the highland give it unique resources for genetic research and crop improvement. Here we produced a 3.89-Gb draft assembly of Tibetan hulless barley with 36,151 predicted protein-coding genes. Comparative analyses revealed the divergence times and synteny between barley and other representative Poaceae genomes. The expansion of the gene family related to stress responses was found in Tibetan hulless barley. Resequencing of 10 barley accessions uncovered high levels of genetic variation in Tibetan wild barley and genetic divergence between Tibetan and non-Tibetan barley genomes. Selective sweep analyses demonstrate adaptive correlations of genes under selection with extensive environmental variables. Our results not only construct a genomic framework for crop improvement but also provide evolutionary insights of highland adaptation of Tibetan hulless barley.

Download Full-text

A draft genome assembly of the eastern banjo frog Limnodynastes dumerilii dumerilii (Anura: Limnodynastidae)

Gigabyte ◽

10.46471/gigabyte.2 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13

Author(s):

Qiye Li ◽

Qunfei Guo ◽

Yang Zhou ◽

Huishuang Tan ◽

Terry Bertozzi ◽

...

Keyword(s):

Genome Assembly ◽

Draft Genome ◽

Protein Coding ◽

Large Genome ◽

Draft Genome Assembly ◽

Protein Coding Genes ◽

Repeat Content ◽

Australian Continent ◽

Large Genome Size ◽

Reference Genomes

Amphibian genomes are usually challenging to assemble due to their large genome size and high repeat content. The Limnodynastidae is a family of frogs native to Australia, Tasmania and New Guinea. As an anuran lineage that successfully diversified on the Australian continent, it represents an important lineage in the amphibian tree of life but lacks reference genomes. Here we sequenced and annotated the genome of the eastern banjo frog Limnodynastes dumerilii dumerilii to fill this gap. The total length of the genome assembly is 2.38 Gb with a scaffold N50 of 285.9 kb. We identified 1.21 Gb of non-redundant sequences as repetitive elements and annotated 24,548 protein-coding genes in the assembly. BUSCO assessment indicated that more than 94% of the expected vertebrate genes were present in the genome assembly and the gene set. We anticipate that this annotated genome assembly will advance the future study of anuran phylogeny and amphibian genome evolution.

Download Full-text

Draft Genome Sequence of Urease-Producing Pseudorhodobacter sp. Strain E13, Isolated from the Yellow Sea in Gunsan, South Korea

Microbiology Resource Announcements ◽

10.1128/mra.00189-19 ◽

2019 ◽

Vol 8 (23) ◽

Author(s):

Si Chul Kim ◽

Hyo Jung Lee

Keyword(s):

South Korea ◽

Genome Sequence ◽

Yellow Sea ◽

Draft Genome ◽

The Yellow Sea ◽

Draft Genome Sequence ◽

Protein Coding ◽

Coding Sequences ◽

Gram Negative ◽

Content Type

Here, we report the draft genome sequence of Pseudorhodobacter sp. strain E13, a Gram-negative, aerobic, nonflagellated, and rod-shaped bacterium which was isolated from the Yellow Sea in South Korea. The assembled genome sequence is 3,878,578 bp long with 3,646 protein-coding sequences in 159 contigs.

Download Full-text

Combined genomic, transcriptomic, and metabolomic analyses provide insights into chayote (Sechium edule) evolution and fruit development

Horticulture Research ◽

10.1038/s41438-021-00487-1 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Anzhen Fu ◽

Qing Wang ◽

Jianlou Mu ◽

Lili Ma ◽

Changlong Wen ◽

...

Keyword(s):

Fruit Development ◽

Repetitive Sequences ◽

Genetic Research ◽

Future Research ◽

Agricultural Crop ◽

Protein Coding ◽

Third Generation Sequencing ◽

Sechium Edule ◽

Generation Sequencing ◽

Cucurbitaceae Family

AbstractChayote (Sechium edule) is an agricultural crop in the Cucurbitaceae family that is rich in bioactive components. To enhance genetic research on chayote, we used Nanopore third-generation sequencing combined with Hi–C data to assemble a draft chayote genome. A chromosome-level assembly anchored on 14 chromosomes (N50 contig and scaffold sizes of 8.40 and 46.56 Mb, respectively) estimated the genome size as 606.42 Mb, which is large for the Cucurbitaceae, with 65.94% (401.08 Mb) of the genome comprising repetitive sequences; 28,237 protein-coding genes were predicted. Comparative genome analysis indicated that chayote and snake gourd diverged from sponge gourd and that a whole-genome duplication (WGD) event occurred in chayote at 25 ± 4 Mya. Transcriptional and metabolic analysis revealed genes involved in fruit texture, pigment, flavor, flavonoids, antioxidants, and plant hormones during chayote fruit development. The analysis of the genome, transcriptome, and metabolome provides insights into chayote evolution and lays the groundwork for future research on fruit and tuber development and genetic improvements in chayote.

Download Full-text