scholarly journals De-novoassembly of zucchini genome reveals a whole genome duplication associated with the origin of theCucurbitagenus

2017 ◽  
Author(s):  
Javier Montero-Pau ◽  
José Blanca ◽  
Aureliano Bombarely ◽  
Peio Ziarsolo ◽  
Cristina Esteras ◽  
...  

AbstractTheCucurbitagenus (squashes, pumpkins, gourds) includes important domesticated species such asC. pepo,C. maximaandC. moschata. In this study, we present a high-quality draft of the zucchini (C. pepo) genome. The assembly has a size of 263 Mb, a scaffold N50 of 1.8 Mb, 34,240 gene models, includes 92% of the conserved BUSCO core gene set, and it is estimated to cover 93.0% of the genome. The genome is organized in 20 pseudomolecules, that represent 81.4% of the assembly, and it is integrated with a genetic map of 7,718 SNPs. Despite its small genome size three independent evidences support that theC. pepogenome is the result of a Whole Genome Duplication: the topology of the gene family phylogenies, the karyotype organization, and the distribution of 4DTv distances. Additionally, 40 transcriptomes of 12 species of the genus were assembled and analyzed together with all the other published genomes of the Cucurbitaceae family. The duplication was detected in all theCucurbitaspecies analyzed, includingC. maximaandC. moschata, but not in the more distant cucurbits belonging to theCucumisandCitrullusgenera, and it is likely to have happened 30 ± 4 Mya in the ancestral species that gave rise to the genus.

2019 ◽  
Author(s):  
Alex Trouern-Trend ◽  
Taylor Falk ◽  
Sumaira Zaman ◽  
Madison Caballero ◽  
David B. Neale ◽  
...  

ABSTRACTJuglans (walnuts), the most speciose genus in the walnut family (Juglandaceae) represents most of the family’s commercially valuable fruit and wood-producing trees and includes several species used as rootstock in agriculture for their resistance to various abiotic and biotic stressors. We present the full structural and functional genome annotations of six Juglans species and one outgroup within Juglandaceae (Juglans regia, J. cathayensis, J. hindsii, J. microcarpa, J. nigra, J. sigillata and Pterocarya stenoptera) produced using BRAKER2 semi-unsupervised gene prediction pipeline and additional in-house developed tools. For each annotation, gene predictors were trained using 19 tissue-specific J. regia transcriptomes aligned to the genomes. Additional functional evidence and filters were applied to multiexonic and monoexonic putative genes to yield between 27,000 and 44,000 high-confidence gene models per species. Comparison of gene models to the BUSCO embryophyta dataset suggested that, on average, genome annotation completeness was 89.6%. We utilized these high quality annotations to assess gene family evolution within Juglans and among Juglans and selected Eurosid species, which revealed significant contractions in several gene families in J. hindsii including disease resistance-related Wall-associated Kinase (WAK) and Catharanthus roseus Receptor-like Kinase (CrRLK1L) and others involved in abiotic stress response. Finally, we confirmed an ancient whole genome duplication that took place in a common ancestor of Juglandaceae using site substitution comparative analysis.SIGNIFICANCEHigh-quality full genome annotations for six species of walnut (Juglans) and a wingnut (Pterocarya) outgroup were constructed using semi-unsupervised gene prediction followed by gene model filtering and functional characterization. These annotations represent the most comprehensive set for any hardwood genus to date. Comparative analyses based on the gene models uncovered rapid evolution in multiple gene families related to disease-response and a whole genome duplication in a Juglandaceae common ancestor.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
David A. Ayala-Usma ◽  
Martha Cárdenas ◽  
Romain Guyot ◽  
Maryam Chaib De Mares ◽  
Adriana Bernal ◽  
...  

Abstract Background Pathogens of the genus Phytophthora are the etiological agents of many devastating diseases in several high-value crops and forestry species such as potato, tomato, cocoa, and oak, among many others. Phytophthora betacei is a recently described species that causes late blight almost exclusively in tree tomatoes, and it is closely related to Phytophthora infestans that causes the disease in potato crops and other Solanaceae. This study reports the assembly and annotation of the genomes of P. betacei P8084, the first of its species, and P. infestans RC1-10, a Colombian strain from the EC-1 lineage, using long-read SMRT sequencing technology. Results Our results show that P. betacei has the largest sequenced genome size of the Phytophthora genus so far with 270 Mb. A moderate transposable element invasion and a whole genome duplication likely explain its genome size expansion when compared to P. infestans, whereas P. infestans RC1-10 has expanded its genome under the activity of transposable elements. The high diversity and abundance (in terms of copy number) of classified and unclassified transposable elements in P. infestans RC1-10 relative to P. betacei bears testimony of the power of long-read technologies to discover novel repetitive elements in the genomes of organisms. Our data also provides support for the phylogenetic placement of P. betacei as a standalone species and as a sister group of P. infestans. Finally, we found no evidence to support the idea that the genome of P. betacei P8084 follows the same gene-dense/gense-sparse architecture proposed for P. infestans and other filamentous plant pathogens. Conclusions This study provides the first genome-wide picture of P. betacei and expands the genomic resources available for P. infestans. This is a contribution towards the understanding of the genome biology and evolutionary history of Phytophthora species belonging to the subclade 1c.


PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3400 ◽  
Author(s):  
Yunpeng Cao ◽  
Yahui Han ◽  
Dandan Meng ◽  
Dahui Li ◽  
Qing Jin ◽  
...  

The ethylene-insensitive3/ethylene-insensitive3-like (EIN3/EIL) proteins are a type of nuclear-localized protein with DNA-binding activity in plants. Although the EIN3/EIL gene family has been studied in several plant species, little is known about comprehensive study of the EIN3/EIL gene family in Rosaceae. In this study, ten, five, four, and five EIN3/EIL genes were identified in the genomes of pear (Pyrus bretschneideri), mei (Prunus mume), peach (Prunus persica) and strawberry (Fragaria vesca), respectively. Twenty-eight chromosomal segments of EIL/EIN3 gene family were found in four Rosaceae species, and these segments could form seven orthologous or paralogous groups based on interspecies or intraspecies gene colinearity (microsynteny) analysis. Moreover, the highly conserved regions of microsynteny were found in four Rosaceae species. Subsequently it was found that both whole genome duplication and tandem duplication events significantly contributed to the EIL/EIN3 gene family expansion. Gene expression analysis of the EIL/EIN3 genes in the pear revealed subfunctionalization for several PbEIL genes derived from whole genome duplication. It is noteworthy that according to environmental selection pressure analysis, the strong purifying selection should dominate the maintenance of the EIL/EIN3 gene family in four Rosaceae species. These results provided useful information on Rosaceae EIL/EIN3 genes, as well as insights into the evolution of this gene family in four Rosaceae species. Furthermore, high level of microsynteny in the four Rosaceae plants suggested that a large-scale genome duplication event in the EIL/EIN3 gene family was predated to speciation.


2020 ◽  
Vol 18 (9) ◽  
pp. 1848-1850 ◽  
Author(s):  
Junpei Zhang ◽  
Wenting Zhang ◽  
Feiyang Ji ◽  
Jie Qiu ◽  
Xiaobo Song ◽  
...  

GigaScience ◽  
2021 ◽  
Vol 10 (3) ◽  
Author(s):  
Zheng Fan ◽  
Tao Yuan ◽  
Piao Liu ◽  
Lu-Yu Wang ◽  
Jian-Feng Jin ◽  
...  

Abstract Background The spider Trichonephila antipodiana (Araneidae), commonly known as the batik golden web spider, preys on arthropods with body sizes ranging from ∼2 mm in length to insects larger than itself (>20‒50 mm), indicating its polyphagy and strong dietary detoxification abilities. Although it has been reported that an ancient whole-genome duplication event occurred in spiders, lack of a high-quality genome has limited characterization of this event. Results We present a chromosome-level T. antipodiana genome constructed on the basis of PacBio and Hi-C sequencing. The assembled genome is 2.29 Gb in size with a scaffold N50 of 172.89 Mb. Hi-C scaffolding assigned 98.5% of the bases to 13 pseudo-chromosomes, and BUSCO completeness analysis revealed that the assembly included 94.8% of the complete arthropod universal single-copy orthologs (n = 1,066). Repetitive elements account for 59.21% of the genome. We predicted 19,001 protein-coding genes, of which 96.78% were supported by transcriptome-based evidence and 96.32% matched protein records in the UniProt database. The genome also shows substantial expansions in several detoxification-associated gene families, including cytochrome P450 mono-oxygenases, carboxyl/cholinesterases, glutathione-S-transferases, and ATP-binding cassette transporters, reflecting the possible genomic basis of polyphagy. Further analysis of the T. antipodiana genome architecture reveals an ancient whole-genome duplication event, based on 2 lines of evidence: (i) large-scale duplications from inter-chromosome synteny analysis and (ii) duplicated clusters of Hox genes. Conclusions The high-quality T. antipodiana genome represents a valuable resource for spider research and provides insights into this species’ adaptation to the environment.


2020 ◽  
Author(s):  
Jonna Sofia Eriksson ◽  
Christine D. Bacon ◽  
Dominic J. Bennett ◽  
Bernard E. Pfeil ◽  
Bengt Oxelman ◽  
...  

Abstract Background: The great diversity in plant genome size and chromosome number is partly due to polyploidization (i.e., genome doubling events). The differences in genome size and chromosome number among diploid plant species can be a window into the intriguing phenomenon of past genome doubling that may be obscured through time by the process of diploidization. The genus Hibiscus L. (Malvaceae) has a wide diversity of chromosome numbers and a complex genomic history. Hibiscus is ideal for exploring past genomic events because although two ancient genome duplication events have been identified, more are likely to be found due to its diversity of chromosome numbers. To reappraise the history of whole genome duplication events, we tested a series of scenarios describing different polyploidization events.Results: Using target sequence capture, we generated 87 orthologous genes from four diploid species. We detected paralogues in >54% putative single-copy genes. 34 of these genes were selected for testing three different genome duplication scenarios using gene counting. Species of Hibiscus shared one genome duplication with H. syriacus and one whole genome duplication occurred along the branch leading to H. syriacus.Conclusions: Here, we corroborated the independent genome doubling previously found in the lineage leading to H. syriacus and a shared genome doubling of this lineage and the remainder of Hibiscus. Additionally, we found a previously undiscovered genome duplication shared by the /Pavonia and /Malvaviscus clades (both nested within Hibiscus) with the occurrences of two copies in what were otherwise single-copy genes. Our results highlight the complexity of genomic diversity in some plant groups, which makes orthology assessment and accurate phylogenomic inference difficult.


2020 ◽  
Author(s):  
Pavitra Ramdas ◽  
Vipin Bhardwaj ◽  
Aman Singh ◽  
Nagarjun Vijay ◽  
Ajit Chande

AbstractThe SERINC gene family comprises of five paralogs in humans of which SERINC3 and SERINC5 inhibit HIV-1 infectivity and are counteracted by Nef. The origin of this anti-retroviral activity, its prevalence among the remaining paralogs, and its ability to target retroviruses remain largely unknown. Here we show that despite their early divergence, the anti-retroviral activity is functionally conserved among four human SERINC paralogs with SERINC2 being an exception. The lack of activity in human SERINC2 is associated with its post-whole genome duplication (WGD) divergence, as evidenced by the ability of pre-WGD orthologs from yeast, fly, and a post-WGD-proximate SERINC2 from coelacanth to inhibit nef-defective HIV-1. Intriguingly, potent retroviral factors from HIV-1 and MLV are not able to relieve the SERINC2-mediated particle infectivity inhibition, indicating that such activity was directed towards other retroviruses that are found in coelacanth (like foamy viruses). However, foamy-derived vectors are intrinsically resistant to the action of SERINC2, and we show that a foamy virus envelope confers this resistance. Despite the presence of weak arms-race signatures, the functional reciprocal adaptation among SERINC2 and SERINC5 and, in response, the emergence of antagonizing ability in foamy virus appears to have resulted from a long-term conflict with the host.


PLoS ONE ◽  
2017 ◽  
Vol 12 (7) ◽  
pp. e0180936 ◽  
Author(s):  
Emilien Voldoire ◽  
Frédéric Brunet ◽  
Magali Naville ◽  
Jean-Nicolas Volff ◽  
Delphine Galiana

Plants ◽  
2021 ◽  
Vol 10 (1) ◽  
pp. 167
Author(s):  
Sara Sangi ◽  
Paula M. Araújo ◽  
Fernanda S. Coelho ◽  
Rajesh K. Gazara ◽  
Fabrício Almeida-Silva ◽  
...  

The COBRA-like (COBL) gene family has been associated with the regulation of cell wall expansion and cellulose deposition. COBL mutants result in reduced levels and disorganized deposition of cellulose causing defects in the cell wall and inhibiting plant development. In this study, we report the identification of 24 COBL genes (GmCOBL) in the soybean genome. Phylogenetic analysis revealed that the COBL proteins are divided into two groups, which differ by about 170 amino acids in the N-terminal region. The GmCOBL genes were heterogeneously distributed in 14 of the 20 soybean chromosomes. This study showed that segmental duplication has contributed significantly to the expansion of the COBL family in soybean during all Glycine-specific whole-genome duplication events. The expression profile revealed that the expression of the paralogous genes is highly variable between organs and tissues of the plant. Only 20% of the paralogous gene pairs showed similar expression patterns. The high expression levels of some GmCOBLs suggest they are likely essential for regulating cell expansion during the whole soybean life cycle. Our comprehensive overview of the COBL gene family in soybean provides useful information for further understanding the evolution and diversification of COBL genes in soybean.


Sign in / Sign up

Export Citation Format

Share Document