scholarly journals Genome size and identification of abundant repetitive sequences in Vallisneria spinulosa

PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3982 ◽  
Author(s):  
RuiJuan Feng ◽  
Xin Wang ◽  
Min Tao ◽  
Guanchao Du ◽  
Qishuo Wang

Vallisneria spinulosa is a freshwater aquatic plant of ecological and economic importance. However, there is limited cytogenetic and genomics information on Vallisneria. In this study, we measured the nuclear DNA content of Vallisneria spinulosa by flow cytometry, performed a de novo assembly, and annotated repetitive sequences by using a combination of next-generation sequencing (NGS) and bioinformatics tools. The genome size of Vallisneria spinulosa is approximately 3,595 Mbp, in which nearly 60% of the genome consists of repetitive sequences. The majority of the repetitive sequences are LTR-retrotransposons comprising 43% of the genome. Although the amount of sequencing data used in this study was not sufficient for a whole-genome assembly, it could generate an overview of representative elements in the genome. These results will lay a new foundation for further studies on various species that belong to the Vallisneria genus.

Genome ◽  
1996 ◽  
Vol 39 (2) ◽  
pp. 266-271 ◽  
Author(s):  
A. Cavallini ◽  
L. Natali ◽  
G. Cionini ◽  
I. Castorena-Sanchez

Nuclear DNA content, chromatin structure, and DNA composition were investigated in four Agave species: two diploid, Agave tequilana Weber and Agave angustifolia Haworth var. marginata Hort., and two pentaploid, Agave fourcroydes Lemaire and Agave sisalana Perrine. It was determined that the genome size of pentaploid species is nearly 2.5 times that of diploid ones. Cytophotometric analyses of chromatin structure were performed following Feulgen or DAPI staining to determine optical density profiles of interphase nuclei. Pentaploid species showed higher frequencies of condensed chromatin (heterochromatin) than diploid species. On the other hand, a lower frequency of A-T rich (DAPI stained) heterochromatin was found in pentaploid species than in diploid ones, indicating that heterochromatin in pentaploid species is made up of sequences with base compositions different from those of diploid species. Since thermal denaturation profiles of extracted DNA showed minor variations in the base composition of the genomes of the four species, it is supposed that, in pentaploid species, the large heterochromatin content is not due to an overrepresentation of G-C repetitive sequences but rather to the condensation of nonrepetitive sequences, such as, for example, redundant gene copies switched off in the polyploid complement. It is suggested that speciation in the genus Agave occurs through point mutations and minor DNA rearrangements, as is also indicated by the relative stability of the karyotype of this genus. Key words : Agave, DNA cytophotometry, DNA melting profiles, chromatin structure, genome size.


2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Xingyu Liao ◽  
Xin Gao ◽  
Xiankai Zhang ◽  
Fang-Xiang Wu ◽  
Jianxin Wang

Abstract Background Repetitive sequences account for a large proportion of eukaryotes genomes. Identification of repetitive sequences plays a significant role in many applications, such as structural variation detection and genome assembly. Many existing de novo repeat identification pipelines or tools make use of assembly of the high-frequency k-mers to obtain repeats. However, a certain degree of sequence coverage is required for assemblers to get the desired assemblies. On the other hand, assemblers cut the reads into shorter k-mers for assembly, which may destroy the structure of the repetitive regions. For the above reasons, it is difficult to obtain complete and accurate repetitive regions in the genome by using existing tools. Results In this study, we present a new method called RepAHR for de novo repeat identification by assembly of the high-frequency reads. Firstly, RepAHR scans next-generation sequencing (NGS) reads to find the high-frequency k-mers. Secondly, RepAHR filters the high-frequency reads from whole NGS reads according to certain rules based on the high-frequency k-mer. Finally, the high-frequency reads are assembled to generate repeats by using SPAdes, which is considered as an outstanding genome assembler with NGS sequences. Conlusions We test RepAHR on five data sets, and the experimental results show that RepAHR outperforms RepARK and REPdenovo for detecting repeats in terms of N50, reference alignment ratio, coverage ratio of reference, mask ratio of Repbase and some other metrics.


2014 ◽  
Vol 92 (10) ◽  
pp. 847-851 ◽  
Author(s):  
Kelly L. Mulligan ◽  
Terra C. Hiebert ◽  
Nicholas W. Jeffery ◽  
T. Ryan Gregory

Ribbon worms (phylum Nemertea) are among several animal groups that have been overlooked in past studies of genome-size diversity. Here, we report genome-size estimates for eight species of nemerteans, including representatives of the major lineages in the phylum. Genome sizes in these species ranged more than fivefold, and there was some indication of a positive relationship with body size. Somatic endopolyploidy also appears to be common in these animals. Importantly, this study demonstrates that both of the most common methods of genome-size estimation (flow cytometry and Feulgen image analysis densitometry) can be used to assess genome size in ribbon worms, thereby facilitating additional efforts to investigate patterns of variability in nuclear DNA content in this phylum.


2021 ◽  
Vol 41 (3) ◽  
Author(s):  
Emilly Schutt ◽  
Maria Hołyńska ◽  
Grace A Wyngaard

Abstract Genome size is a fundamental property of organisms that impacts their molecular evolution and life histories. The hypothesis that somatic genome sizes in copepods in the order Cyclopoida are small and evolutionary constrained relative to those in the order Calanoida was proposed 15 years ago. Since then, the number of estimates has almost doubled and the taxon sampling has broadened. Here we add 14 new estimates from eight genera of freshwater cyclopoids that vary from 0.2 to 6.6 pg of DNA per nucleus in the soma; all except one are 2.0 pg DNA per nucleus or smaller. This new sample adds to the pattern of genome size in copepods and is remarkably similar to the distribution on which the original hypothesis was based, as well as those of subsequently published estimates. Embryonic chromatin diminution, during which large portions of DNA are excised from the presomatic cell lineage, is reported in Paracyclops affinis (G.O. Sars, 1863). This diminution results in a somatic genome that is one half the size of the germline genome. When the sizes of the germline genomes carried in presomatic cells of cyclopoid species that possess chromatin diminution are considered, the prediminuted germline genome sizes of cyclopoid embryos overlap with the distribution of calanoid somatic genome sizes, supporting the hypothesis that chromatin diminution has functioned as a mechanism to constrain somatic nuclear DNA content in cyclopoid copepods. Geographically based variation in genome size among populations is also reviewed.


2017 ◽  
Vol 2 ◽  
pp. 35 ◽  
Author(s):  
Shazia Mahamdallie ◽  
Elise Ruark ◽  
Shawn Yost ◽  
Emma Ramsay ◽  
Imran Uddin ◽  
...  

Detection of deletions and duplications of whole exons (exon CNVs) is a key requirement of genetic testing. Accurate detection of this variant type has proved very challenging in targeted next-generation sequencing (NGS) data, particularly if only a single exon is involved. Many different NGS exon CNV calling methods have been developed over the last five years. Such methods are usually evaluated using simulated and/or in-house data due to a lack of publicly-available datasets with orthogonally generated results. This hinders tool comparisons, transparency and reproducibility. To provide a community resource for assessment of exon CNV calling methods in targeted NGS data, we here present the ICR96 exon CNV validation series. The dataset includes high-quality sequencing data from a targeted NGS assay (the TruSight Cancer Panel) together with Multiplex Ligation-dependent Probe Amplification (MLPA) results for 96 independent samples. 66 samples contain at least one validated exon CNV and 30 samples have validated negative results for exon CNVs in 26 genes. The dataset includes 46 exon CNVs in BRCA1, BRCA2, TP53, MLH1, MSH2, MSH6, PMS2, EPCAM or PTEN, giving excellent representation of the cancer predisposition genes most frequently tested in clinical practice. Moreover, the validated exon CNVs include 25 single exon CNVs, the most difficult type of exon CNV to detect. The FASTQ files for the ICR96 exon CNV validation series can be accessed through the European-Genome phenome Archive (EGA) under the accession number EGAS00001002428.


Author(s):  
Hai-Yang Zhang ◽  
Xiaotao Lü ◽  
cunzheng wei ◽  
Jeff Powell ◽  
Xiaobo Wang ◽  
...  

Elucidating mechanisms underlying community assembly and biodiversity patterns is central to ecology and evolution. Genome size (GS, i.e. nuclear DNA content) determines species’ capacity to tolerate environmental stress or to exploit new environments and therefore potentially drive community assembly. However, its role in driving β-diversity (i.e., the site-to-site variability in species composition) remains unclear. We measured GS for 169 plant species and investigated their occurrences within plant communities across 52 sites spanning a 3200-km transect in the temperate grasslands of China. We found environmental factors showed larger effects on β-diversity of large-GS than that of small-GS species. Community weighted mean GS increased with mean annual precipitation, soil total nitrogen and phosphorus concentrations, but decreased with mean annual temperature, suggesting a negative selection against species with large GS in resources-limited or warmer climates. These findings highlight the roles for GS in driving community assembly and predicting species responses to climate change.


2015 ◽  
Vol 57 (1) ◽  
pp. 104-113
Author(s):  
Sandra Cichorz ◽  
Maria Gośka ◽  
Monika Rewers

AbstractSinceM. sinensisAnderss.,M. sacchariflorus(Maxim.) Hack. andM. ×giganteusJ.M.Greef & Deuter ex Hodk. and Renvoize have considerably the highest potential for biomass production amongMiscanthusAnderss. species, there is an urgent need to broaden the knowledge about cytological characteristics required for their improvement. In this study our objectives were to assess the genome size variation among eighteenMiscanthusaccessions, as well as estimation of the monoploid genome size (2C and Cx) of theM. sinensiscultivars, which have not been analyzed yet. The characterization of threeMiscanthusspecies was performed with the use of flow cytometry and analysis of the stomatal length. The triploid (2n = 3x = 57)M. sinensis‘Goliath’ andM. ×giganteusclones possessed the highest 2C DNA content (8.34 pg and 7.43 pg, respectively). The intermediate 2C-values were found in the nuclei of the diploid (2n = 2x = 38)M. sinensisaccessions (5.52–5.72 pg), whereas they were the lowest in the diploid (2n = 2x = 38)M. sacchariflorusecotypes (4.58–4.59 pg). The presented study revealed interspecific variation of nuclear DNA content (P<0.01) and therefore allowed for recognition of particular taxa, inter- and intraspecific hybrids and prediction of potential parental components. Moreover, intraspecific genome size variation (P<0.01) was observed inM. sinensiscultivars at 3.62%. The values of the stomatal size obtained for the triploidM. ×giganteus‘Great Britain’ (mean 30.70 μm) or ‘Canada’ (mean 29.67 μm) and diploidM. sinensis‘Graziella’ (mean 29.96 μm) did not differ significantly, therefore this parameter is not recommended for ploidy estimation.


2020 ◽  
Vol 15 (1) ◽  
pp. 2-16
Author(s):  
Yuwen Luo ◽  
Xingyu Liao ◽  
Fang-Xiang Wu ◽  
Jianxin Wang

Transcriptome assembly plays a critical role in studying biological properties and examining the expression levels of genomes in specific cells. It is also the basis of many downstream analyses. With the increase of speed and the decrease in cost, massive sequencing data continues to accumulate. A large number of assembly strategies based on different computational methods and experiments have been developed. How to efficiently perform transcriptome assembly with high sensitivity and accuracy becomes a key issue. In this work, the issues with transcriptome assembly are explored based on different sequencing technologies. Specifically, transcriptome assemblies with next-generation sequencing reads are divided into reference-based assemblies and de novo assemblies. The examples of different species are used to illustrate that long reads produced by the third-generation sequencing technologies can cover fulllength transcripts without assemblies. In addition, different transcriptome assemblies using the Hybrid-seq methods and other tools are also summarized. Finally, we discuss the future directions of transcriptome assemblies.


Sign in / Sign up

Export Citation Format

Share Document