Genome size and identification of abundant repetitive sequences in Vallisneria spinulosa

PeerJ ◽

10.7717/peerj.3982 ◽

2017 ◽

Vol 5 ◽

pp. e3982 ◽

Cited By ~ 3

Author(s):

RuiJuan Feng ◽

Xin Wang ◽

Min Tao ◽

Guanchao Du ◽

Qishuo Wang

Keyword(s):

Genome Size ◽

Aquatic Plant ◽

Nuclear Dna ◽

De Novo ◽

Repetitive Sequences ◽

Nuclear Dna Content ◽

Ltr Retrotransposons ◽

Sequencing Data ◽

Next Generation Sequencing Ngs ◽

Generation Sequencing

Vallisneria spinulosa is a freshwater aquatic plant of ecological and economic importance. However, there is limited cytogenetic and genomics information on Vallisneria. In this study, we measured the nuclear DNA content of Vallisneria spinulosa by flow cytometry, performed a de novo assembly, and annotated repetitive sequences by using a combination of next-generation sequencing (NGS) and bioinformatics tools. The genome size of Vallisneria spinulosa is approximately 3,595 Mbp, in which nearly 60% of the genome consists of repetitive sequences. The majority of the repetitive sequences are LTR-retrotransposons comprising 43% of the genome. Although the amount of sequencing data used in this study was not sufficient for a whole-genome assembly, it could generate an overview of representative elements in the genome. These results will lay a new foundation for further studies on various species that belong to the Vallisneria genus.

Download Full-text

Cytophotometric and biochemical analyses of DNA in pentaploid and diploid Agave species

Genome ◽

10.1139/g96-036 ◽

1996 ◽

Vol 39 (2) ◽

pp. 266-271 ◽

Cited By ~ 7

Author(s):

A. Cavallini ◽

L. Natali ◽

G. Cionini ◽

I. Castorena-Sanchez

Keyword(s):

Genome Size ◽

Chromatin Structure ◽

Nuclear Dna ◽

Repetitive Sequences ◽

Point Mutations ◽

Diploid Species ◽

Nuclear Dna Content ◽

Dapi Staining ◽

Gene Copies ◽

Biochemical Analyses

Nuclear DNA content, chromatin structure, and DNA composition were investigated in four Agave species: two diploid, Agave tequilana Weber and Agave angustifolia Haworth var. marginata Hort., and two pentaploid, Agave fourcroydes Lemaire and Agave sisalana Perrine. It was determined that the genome size of pentaploid species is nearly 2.5 times that of diploid ones. Cytophotometric analyses of chromatin structure were performed following Feulgen or DAPI staining to determine optical density profiles of interphase nuclei. Pentaploid species showed higher frequencies of condensed chromatin (heterochromatin) than diploid species. On the other hand, a lower frequency of A-T rich (DAPI stained) heterochromatin was found in pentaploid species than in diploid ones, indicating that heterochromatin in pentaploid species is made up of sequences with base compositions different from those of diploid species. Since thermal denaturation profiles of extracted DNA showed minor variations in the base composition of the genomes of the four species, it is supposed that, in pentaploid species, the large heterochromatin content is not due to an overrepresentation of G-C repetitive sequences but rather to the condensation of nonrepetitive sequences, such as, for example, redundant gene copies switched off in the polyploid complement. It is suggested that speciation in the genus Agave occurs through point mutations and minor DNA rearrangements, as is also indicated by the relative stability of the karyotype of this genus. Key words : Agave, DNA cytophotometry, DNA melting profiles, chromatin structure, genome size.

Download Full-text

RepAHR: an improved approach for de novo repeat identification by assembly of the high-frequency reads

BMC Bioinformatics ◽

10.1186/s12859-020-03779-w ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Xingyu Liao ◽

Xin Gao ◽

Xiankai Zhang ◽

Fang-Xiang Wu ◽

Jianxin Wang

Keyword(s):

High Frequency ◽

Structural Variation ◽

De Novo ◽

Repetitive Sequences ◽

Data Sets ◽

Sequence Coverage ◽

Coverage Ratio ◽

Next Generation Sequencing Ngs ◽

Genome Assembler ◽

Generation Sequencing

Abstract Background Repetitive sequences account for a large proportion of eukaryotes genomes. Identification of repetitive sequences plays a significant role in many applications, such as structural variation detection and genome assembly. Many existing de novo repeat identification pipelines or tools make use of assembly of the high-frequency k-mers to obtain repeats. However, a certain degree of sequence coverage is required for assemblers to get the desired assemblies. On the other hand, assemblers cut the reads into shorter k-mers for assembly, which may destroy the structure of the repetitive regions. For the above reasons, it is difficult to obtain complete and accurate repetitive regions in the genome by using existing tools. Results In this study, we present a new method called RepAHR for de novo repeat identification by assembly of the high-frequency reads. Firstly, RepAHR scans next-generation sequencing (NGS) reads to find the high-frequency k-mers. Secondly, RepAHR filters the high-frequency reads from whole NGS reads according to certain rules based on the high-frequency k-mer. Finally, the high-frequency reads are assembled to generate repeats by using SPAdes, which is considered as an outstanding genome assembler with NGS sequences. Conlusions We test RepAHR on five data sets, and the experimental results show that RepAHR outperforms RepARK and REPdenovo for detecting repeats in terms of N50, reference alignment ratio, coverage ratio of reference, mask ratio of Repbase and some other metrics.

Download Full-text

First estimates of genome size in ribbon worms (phylum Nemertea) using flow cytometry and Feulgen image analysis densitometry

Canadian Journal of Zoology ◽

10.1139/cjz-2014-0068 ◽

2014 ◽

Vol 92 (10) ◽

pp. 847-851 ◽

Cited By ~ 3

Author(s):

Kelly L. Mulligan ◽

Terra C. Hiebert ◽

Nicholas W. Jeffery ◽

T. Ryan Gregory

Keyword(s):

Flow Cytometry ◽

Image Analysis ◽

Body Size ◽

Genome Size ◽

Positive Relationship ◽

Nuclear Dna ◽

Nuclear Dna Content ◽

Size Estimation ◽

Size Diversity ◽

Size Estimates

Ribbon worms (phylum Nemertea) are among several animal groups that have been overlooked in past studies of genome-size diversity. Here, we report genome-size estimates for eight species of nemerteans, including representatives of the major lineages in the phylum. Genome sizes in these species ranged more than fivefold, and there was some indication of a positive relationship with body size. Somatic endopolyploidy also appears to be common in these animals. Importantly, this study demonstrates that both of the most common methods of genome-size estimation (flow cytometry and Feulgen image analysis densitometry) can be used to assess genome size in ribbon worms, thereby facilitating additional efforts to investigate patterns of variability in nuclear DNA content in this phylum.

Download Full-text

Genome size in cyclopoid copepods (Copepoda: Cyclopoida): chromatin diminution as a hypothesized mechanism of evolutionary constraint

Journal of Crustacean Biology ◽

10.1093/jcbiol/ruab043 ◽

2021 ◽

Vol 41 (3) ◽

Author(s):

Emilly Schutt ◽

Maria Hołyńska ◽

Grace A Wyngaard

Keyword(s):

Genome Size ◽

Life Histories ◽

Nuclear Dna ◽

Cell Lineage ◽

Fundamental Property ◽

Nuclear Dna Content ◽

Evolutionary Constraint ◽

Chromatin Diminution ◽

Somatic Genome ◽

Copepoda Cyclopoida

Abstract Genome size is a fundamental property of organisms that impacts their molecular evolution and life histories. The hypothesis that somatic genome sizes in copepods in the order Cyclopoida are small and evolutionary constrained relative to those in the order Calanoida was proposed 15 years ago. Since then, the number of estimates has almost doubled and the taxon sampling has broadened. Here we add 14 new estimates from eight genera of freshwater cyclopoids that vary from 0.2 to 6.6 pg of DNA per nucleus in the soma; all except one are 2.0 pg DNA per nucleus or smaller. This new sample adds to the pattern of genome size in copepods and is remarkably similar to the distribution on which the original hypothesis was based, as well as those of subsequently published estimates. Embryonic chromatin diminution, during which large portions of DNA are excised from the presomatic cell lineage, is reported in Paracyclops affinis (G.O. Sars, 1863). This diminution results in a somatic genome that is one half the size of the germline genome. When the sizes of the germline genomes carried in presomatic cells of cyclopoid species that possess chromatin diminution are considered, the prediminuted germline genome sizes of cyclopoid embryos overlap with the distribution of calanoid somatic genome sizes, supporting the hypothesis that chromatin diminution has functioned as a mechanism to constrain somatic nuclear DNA content in cyclopoid copepods. Geographically based variation in genome size among populations is also reviewed.

Download Full-text

De novo Genome Assembly from Next-Generation Sequencing (NGS) Reads

Next-Generation Sequencing Data Analysis ◽

10.1201/b19532-11 ◽

2016 ◽

pp. 144-155

Keyword(s):

Next Generation Sequencing ◽

Genome Assembly ◽

De Novo ◽

Next Generation ◽

De Novo Genome Assembly ◽

Next Generation Sequencing Ngs ◽

Generation Sequencing

Download Full-text

The ICR96 exon CNV validation series: a resource for orthogonal assessment of exon CNV calling in NGS data

Wellcome Open Research ◽

10.12688/wellcomeopenres.11689.1 ◽

2017 ◽

Vol 2 ◽

pp. 35 ◽

Cited By ~ 7

Author(s):

Shazia Mahamdallie ◽

Elise Ruark ◽

Shawn Yost ◽

Emma Ramsay ◽

Imran Uddin ◽

...

Keyword(s):

Sequencing Data ◽

Targeted Next Generation Sequencing ◽

Negative Results ◽

Targeted Ngs ◽

Predisposition Genes ◽

Next Generation Sequencing Ngs ◽

Ngs Data ◽

Validation Series ◽

Generation Sequencing ◽

Dependent Probe

Detection of deletions and duplications of whole exons (exon CNVs) is a key requirement of genetic testing. Accurate detection of this variant type has proved very challenging in targeted next-generation sequencing (NGS) data, particularly if only a single exon is involved. Many different NGS exon CNV calling methods have been developed over the last five years. Such methods are usually evaluated using simulated and/or in-house data due to a lack of publicly-available datasets with orthogonally generated results. This hinders tool comparisons, transparency and reproducibility. To provide a community resource for assessment of exon CNV calling methods in targeted NGS data, we here present the ICR96 exon CNV validation series. The dataset includes high-quality sequencing data from a targeted NGS assay (the TruSight Cancer Panel) together with Multiplex Ligation-dependent Probe Amplification (MLPA) results for 96 independent samples. 66 samples contain at least one validated exon CNV and 30 samples have validated negative results for exon CNVs in 26 genes. The dataset includes 46 exon CNVs in BRCA1, BRCA2, TP53, MLH1, MSH2, MSH6, PMS2, EPCAM or PTEN, giving excellent representation of the cancer predisposition genes most frequently tested in clinical practice. Moreover, the validated exon CNVs include 25 single exon CNVs, the most difficult type of exon CNV to detect. The FASTQ files for the ICR96 exon CNV validation series can be accessed through the European-Genome phenome Archive (EGA) under the accession number EGAS00001002428.

Download Full-text

Genome size mediates the effect of environmental filtering in determining plant β-diversity across temperate grasslands

10.22541/au.162081024.47885857/v1 ◽

2021 ◽

Author(s):

Hai-Yang Zhang ◽

Xiaotao Lü ◽

cunzheng wei ◽

Jeff Powell ◽

Xiaobo Wang ◽

...

Keyword(s):

Genome Size ◽

Community Assembly ◽

Nuclear Dna ◽

Environmental Filtering ◽

Nuclear Dna Content ◽

Mean Annual Precipitation ◽

Mean Annual Temperature ◽

Nitrogen And Phosphorus ◽

Temperate Grasslands ◽

Β Diversity

Elucidating mechanisms underlying community assembly and biodiversity patterns is central to ecology and evolution. Genome size (GS, i.e. nuclear DNA content) determines species’ capacity to tolerate environmental stress or to exploit new environments and therefore potentially drive community assembly. However, its role in driving β-diversity (i.e., the site-to-site variability in species composition) remains unclear. We measured GS for 169 plant species and investigated their occurrences within plant communities across 52 sites spanning a 3200-km transect in the temperate grasslands of China. We found environmental factors showed larger effects on β-diversity of large-GS than that of small-GS species. Community weighted mean GS increased with mean annual precipitation, soil total nitrogen and phosphorus concentrations, but decreased with mean annual temperature, suggesting a negative selection against species with large GS in resources-limited or warmer climates. These findings highlight the roles for GS in driving community assembly and predicting species responses to climate change.

Download Full-text

Miscanthus: Inter- and Intraspecific Genome Size Variation Among M. × Giganteus, M. Sinensis, M. Sacchariflorus Accessions

Acta Biologica Cracoviensia s Botanica ◽

10.1515/abcsb-2015-0013 ◽

2015 ◽

Vol 57 (1) ◽

pp. 104-113

Author(s):

Sandra Cichorz ◽

Maria Gośka ◽

Monika Rewers

Keyword(s):

Genome Size ◽

Dna Content ◽

Nuclear Dna ◽

Size Variation ◽

Nuclear Dna Content ◽

Interspecific Variation ◽

Genome Size Variation ◽

2C Dna Content ◽

Cytological Characteristics ◽

Stomatal Length

AbstractSinceM. sinensisAnderss.,M. sacchariflorus(Maxim.) Hack. andM. ×giganteusJ.M.Greef & Deuter ex Hodk. and Renvoize have considerably the highest potential for biomass production amongMiscanthusAnderss. species, there is an urgent need to broaden the knowledge about cytological characteristics required for their improvement. In this study our objectives were to assess the genome size variation among eighteenMiscanthusaccessions, as well as estimation of the monoploid genome size (2C and Cx) of theM. sinensiscultivars, which have not been analyzed yet. The characterization of threeMiscanthusspecies was performed with the use of flow cytometry and analysis of the stomatal length. The triploid (2n = 3x = 57)M. sinensis‘Goliath’ andM. ×giganteusclones possessed the highest 2C DNA content (8.34 pg and 7.43 pg, respectively). The intermediate 2C-values were found in the nuclei of the diploid (2n = 2x = 38)M. sinensisaccessions (5.52–5.72 pg), whereas they were the lowest in the diploid (2n = 2x = 38)M. sacchariflorusecotypes (4.58–4.59 pg). The presented study revealed interspecific variation of nuclear DNA content (P<0.01) and therefore allowed for recognition of particular taxa, inter- and intraspecific hybrids and prediction of potential parental components. Moreover, intraspecific genome size variation (P<0.01) was observed inM. sinensiscultivars at 3.62%. The values of the stomatal size obtained for the triploidM. ×giganteus‘Great Britain’ (mean 30.70 μm) or ‘Canada’ (mean 29.67 μm) and diploidM. sinensis‘Graziella’ (mean 29.96 μm) did not differ significantly, therefore this parameter is not recommended for ploidy estimation.

Download Full-text

De Novo Genome Assembly of Next-Generation Sequencing Data

Compendium of Plant Genomes - The Brassica rapa Genome ◽

10.1007/978-3-662-47901-8_4 ◽

2015 ◽

pp. 41-51

Author(s):

Min Liu ◽

Dongyuan Liu ◽

Hongkun Zheng

Keyword(s):

Next Generation Sequencing ◽

Genome Assembly ◽

De Novo ◽

Next Generation Sequencing Data ◽

Next Generation ◽

Sequencing Data ◽

De Novo Genome Assembly ◽

Generation Sequencing

Download Full-text

Computational Approaches for Transcriptome Assembly Based on Sequencing Technologies

Current Bioinformatics ◽

10.2174/1574893614666190410155603 ◽

2020 ◽

Vol 15 (1) ◽

pp. 2-16

Author(s):

Yuwen Luo ◽

Xingyu Liao ◽

Fang-Xiang Wu ◽

Jianxin Wang

Keyword(s):

De Novo ◽

Transcriptome Assembly ◽

Critical Role ◽

High Sensitivity ◽

Biological Properties ◽

Sequencing Data ◽

Sequencing Technologies ◽

Long Reads ◽

Massive Sequencing ◽

Generation Sequencing

Transcriptome assembly plays a critical role in studying biological properties and examining the expression levels of genomes in specific cells. It is also the basis of many downstream analyses. With the increase of speed and the decrease in cost, massive sequencing data continues to accumulate. A large number of assembly strategies based on different computational methods and experiments have been developed. How to efficiently perform transcriptome assembly with high sensitivity and accuracy becomes a key issue. In this work, the issues with transcriptome assembly are explored based on different sequencing technologies. Specifically, transcriptome assemblies with next-generation sequencing reads are divided into reference-based assemblies and de novo assemblies. The examples of different species are used to illustrate that long reads produced by the third-generation sequencing technologies can cover fulllength transcripts without assemblies. In addition, different transcriptome assemblies using the Hybrid-seq methods and other tools are also summarized. Finally, we discuss the future directions of transcriptome assemblies.

Download Full-text