scholarly journals GenFam: A web application and database for gene family‐based classification and functional enrichment analysis

Plant Direct ◽  
2019 ◽  
Vol 3 (12) ◽  
Author(s):  
Renesh Bedre ◽  
Kranthi Mandadi
2021 ◽  
Vol 3 (4) ◽  
Author(s):  
Fotis A Baltoumas ◽  
Sofia Zafeiropoulou ◽  
Evangelos Karatzas ◽  
Savvas Paragkamian ◽  
Foteini Thanati ◽  
...  

Abstract Extracting and processing information from documents is of great importance as lots of experimental results and findings are stored in local files. Therefore, extracting and analyzing biomedical terms from such files in an automated way is absolutely necessary. In this article, we present OnTheFly2.0, a web application for extracting biomedical entities from individual files such as plain texts, office documents, PDF files or images. OnTheFly2.0 can generate informative summaries in popup windows containing knowledge related to the identified terms along with links to various databases. It uses the EXTRACT tagging service to perform named entity recognition (NER) for genes/proteins, chemical compounds, organisms, tissues, environments, diseases, phenotypes and gene ontology terms. Multiple files can be analyzed, whereas identified terms such as proteins or genes can be explored through functional enrichment analysis or be associated with diseases and PubMed entries. Finally, protein–protein and protein–chemical networks can be generated with the use of STRING and STITCH services. To demonstrate its capacity for knowledge discovery, we interrogated published meta-analyses of clinical biomarkers of severe COVID-19 and uncovered inflammatory and senescence pathways that impact disease pathogenesis. OnTheFly2.0 currently supports 197 species and is available at http://bib.fleming.gr:3838/OnTheFly/ and http://onthefly.pavlopouloslab.info.


2018 ◽  
Author(s):  
Renesh Bedre ◽  
Kranthi Mandadi

ABSTRACTGenome-scale studies using high-throughput sequencing (HTS) technologies generate substantial lists of differentially expressed genes under different experimental conditions. These gene lists need to be further mined to narrow down biologically relevant genes and associated functions in order to guide downstream functional genetic analyses. A popular approach is to determine statistically overrepresented genes in a user-defined list through enrichment analysis tools, which rely on functional annotations of genes based on Gene Ontology (GO) terms. Here, we propose a new approach, GenFam, which allows classification and enrichment of genes based on their gene family, thus simplifying identification of candidate gene families and associated genes that may be relevant to the query. GenFam and its integrated database comprises of three-hundred and eighty-four unique gene families and supports gene family classification and enrichment analyses for sixty plant genomes. Four comparative case studies with plant species belonging to different clades and families were performed using GenFam which demonstrated its robustness and comprehensiveness over preexisting functional enrichment tools. To make it readily accessible for plant biologists, GenFam is available as a web-based application where users can input gene IDs and export enrichment results in both tabular and graphical formats. Users can also customize analysis parameters by choosing from the various statistical enrichment tests and multiple testing correction methods. Additionally, the web-based application, source code and database are freely available to use and download. Website: http://mandadilab.webfactional.com/home/. Source code and database: http://mandadilab.webfactional.com/home/dload/.


MicroRNA ◽  
2020 ◽  
Vol 9 (2) ◽  
pp. 153-166 ◽  
Author(s):  
Dimitrios E. Magouliotis ◽  
Vasiliki S. Tasiopoulou ◽  
Ioannis Baloyiannis ◽  
Ioannis Mamaloudis ◽  
George Tzovaras

Background: Rectal Cancer (RC) is a common type of cancer with poor prognosis. The identification of biomarkers regarding RC diagnosis, monitoring, and prognosis is crucial. Objectives: The purpose of the present study was to evaluate the differential expression of the Aquaporin (AQP) gene family network in RC, and the effect of Radiotherapy (RT) on their expression profile, to indicate novel biomarkers and prognostic factors. Methods: We used data mining techniques to construct the network of the AQP-associated genes to determine the Differentially Expressed Genes (DEGs) in RC and in irradiated as compared to nonirradiated RC patients. Furthermore, survival data of The Cancer Genome Atlas (TCGA) were analysed to assess the prognostic role of the DEGs, along with the functional enrichment of gene ontologies and miRNAs related to the DEGs in RC. Results: Microarray data of one PubMed GEO dataset was extracted, incorporating 22 RC and 20 normal rectal tissue samples. Eight DEGs were reported. Four DEGs were up-regulated and four downregulated in RC. Correlations were identified among the DEGs. Deming regression analysis was performed in order to demonstrate the equations describing these correlations. One gene (Aquaporin 3) was downregulated in irradiated RC samples compared with non-irradiated samples. The most significantly affected biological pathways and miRNAs were identified by functional enrichment analysis. Conclusion: The present study demonstrates an eight-gene molecular panel that could facilitate as biomarkers regarding RC patients, which are potential targets of five miRNA families. Finally, our results highlight the effect of radiotherapy on AQPs and the associated pathways in RC.


2021 ◽  
Author(s):  
Fotis A Baltoumas ◽  
Sofia Zafeiropoulou ◽  
Evangelos Karatzas ◽  
Savvas Paragkamian ◽  
Foteini Thanati ◽  
...  

Extracting and processing information from documents is of great importance as lots of experimental results and findings are stored in local files. Therefore, extracting and analysing biomedical terms from such files in an automated way is absolutely necessary. In this article, we present OnTheFly2.0, a web application for extracting biomedical entities from individual files such as plain texts, Office documents, PDF files or images. OnTheFly2.0 can generate informative summaries in popup windows containing knowledge related to the identified terms along with links to various databases. It uses the EXTRACT tagging service to perform Named Entity Recognition (NER) for genes/proteins, chemical compounds, organisms, tissues, environments, diseases, phenotypes and Gene Ontology terms. Multiple files can be analysed, whereas identified terms such as proteins or genes can be explored through functional enrichment analysis or be associated with diseases and PubMed entries. Finally, protein-protein and protein-chemical networks can be generated with the use of STRING and STITCH services. To demonstrate its capacity for knowledge discovery, we interrogated published meta-analyses of clinical biomarkers of severe COVID-19 and uncovered inflammatory and senescence pathways that impact disease pathogenesis. OnTheFly2.0 currently supports 197 species and is available at http://onthefly.pavlopouloslab.info.


2019 ◽  
Vol 14 (7) ◽  
pp. 591-601 ◽  
Author(s):  
Aravind K. Konda ◽  
Parasappa R. Sabale ◽  
Khela R. Soren ◽  
Shanmugavadivel P. Subramaniam ◽  
Pallavi Singh ◽  
...  

Background: Chickpea is a nutritional rich premier pulse crop but its production encounters setbacks due to various stresses and understanding of molecular mechanisms can be ascribed foremost importance. Objective: The investigation was carried out to identify the differentially expressed WRKY TFs in chickpea in response to herbicide stress and decipher their interacting partners. Methods: For this purpose, transcriptome wide identification of WRKY TFs in chickpea was done. Behavior of the differentially expressed TFs was compared between other stress conditions. Orthology based cofunctional gene networks were derived from Arabidopsis. Gene ontology and functional enrichment analysis was performed using Blast2GO and STRING software. Gene Coexpression Network (GCN) was constructed in chickpea using publicly available transcriptome data. Expression pattern of the identified gene network was studied in chickpea-Fusarium interactions. Results: A unique WRKY TF (Ca_08086) was found to be significantly (q value = 0.02) upregulated not only under herbicide stress but also in other stresses. Co-functional network of 14 genes, namely Ca_08086, Ca_19657, Ca_01317, Ca_20172, Ca_12226, Ca_15326, Ca_04218, Ca_07256, Ca_14620, Ca_12474, Ca_11595, Ca_15291, Ca_11762 and Ca_03543 were identified. GCN revealed 95 hub genes based on the significant probability scores. Functional annotation indicated role in callose deposition and response to chitin. Interestingly, contrasting expression pattern of the 14 network genes was observed in wilt resistant and susceptible chickpea genotypes, infected with Fusarium. Conclusion: This is the first report of identification of a multi-stress responsive WRKY TF and its associated GCN in chickpea.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Zhenyang Liao ◽  
Xunxiao Zhang ◽  
Shengcheng Zhang ◽  
Zhicong Lin ◽  
Xingtan Zhang ◽  
...  

Abstract Background Structural variations (SVs) are a type of mutations that have not been widely detected in plant genomes and studies in animals have shown their role in the process of domestication. An in-depth study of SVs will help us to further understand the impact of SVs on the phenotype and environmental adaptability during papaya domestication and provide genomic resources for the development of molecular markers. Results We detected a total of 8083 SVs, including 5260 deletions, 552 tandem duplications and 2271 insertions with deletion being the predominant, indicating the universality of deletion in the evolution of papaya genome. The distribution of these SVs is non-random in each chromosome. A total of 1794 genes overlaps with SV, of which 1350 genes are expressed in at least one tissue. The weighted correlation network analysis (WGCNA) of these expressed genes reveals co-expression relationship between SVs-genes and different tissues, and functional enrichment analysis shows their role in biological growth and environmental responses. We also identified some domesticated SVs genes related to environmental adaptability, sexual reproduction, and important agronomic traits during the domestication of papaya. Analysis of artificially selected copy number variant genes (CNV-genes) also revealed genes associated with plant growth and environmental stress. Conclusions SVs played an indispensable role in the process of papaya domestication, especially in the reproduction traits of hermaphrodite plants. The detection of genome-wide SVs and CNV-genes between cultivated gynodioecious populations and wild dioecious populations provides a reference for further understanding of the evolution process from male to hermaphrodite in papaya.


Open Medicine ◽  
2020 ◽  
Vol 15 (1) ◽  
pp. 672-688
Author(s):  
Yanbo Dong ◽  
Siyu Lu ◽  
Zhenxiao Wang ◽  
Liangfa Liu

AbstractThe chaperonin-containing T-complex protein 1 (CCT) subunits participate in diverse diseases. However, little is known about their expression and prognostic values in human head and neck squamous cancer (HNSC). This article aims to evaluate the effects of CCT subunits regarding their prognostic values for HNSC. We mined the transcriptional and survival data of CCTs in HNSC patients from online databases. A protein–protein interaction network was constructed and a functional enrichment analysis of target genes was performed. We observed that the mRNA expression levels of CCT1/2/3/4/5/6/7/8 were higher in HNSC tissues than in normal tissues. Survival analysis revealed that the high mRNA transcriptional levels of CCT3/4/5/6/7/8 were associated with a low overall survival. The expression levels of CCT4/7 were correlated with advanced tumor stage. And the overexpression of CCT4 was associated with higher N stage of patients. Validation of CCTs’ differential expression and prognostic values was achieved by the Human Protein Atlas and GEO datasets. Mechanistic exploration of CCT subunits by the functional enrichment analysis suggests that these genes may influence the HNSC prognosis by regulating PI3K-Akt and other pathways. This study implies that CCT3/4/6/7/8 are promising biomarkers for the prognosis of HNSC.


2021 ◽  
Vol 28 (1) ◽  
pp. 20-33
Author(s):  
Lydia-Eirini Giannakou ◽  
Athanasios-Stefanos Giannopoulos ◽  
Chrissi Hatzoglou ◽  
Konstantinos I. Gourgoulianis ◽  
Erasmia Rouka ◽  
...  

Haemophilus influenzae (Hi), Moraxella catarrhalis (MorCa) and Pseudomonas aeruginosa (Psa) are three of the most common gram-negative bacteria responsible for human respiratory diseases. In this study, we aimed to identify, using the functional enrichment analysis (FEA), the human gene interaction network with the aforementioned bacteria in order to elucidate the full spectrum of induced pathogenicity. The Human Pathogen Interaction Database (HPIDB 3.0) was used to identify the human proteins that interact with the three pathogens. FEA was performed via the ToppFun tool of the ToppGene Suite and the GeneCodis database so as to identify enriched gene ontologies (GO) of biological processes (BP), cellular components (CC) and diseases. In total, 11 human proteins were found to interact with the bacterial pathogens. FEA of BP GOs revealed associations with mitochondrial membrane permeability relative to apoptotic pathways. FEA of CC GOs revealed associations with focal adhesion, cell junctions and exosomes. The most significantly enriched annotations in diseases and pathways were lung adenocarcinoma and cell cycle, respectively. Our results suggest that the Hi, MorCa and Psa pathogens could be related to the pathogenesis and/or progression of lung adenocarcinoma via the targeting of the epithelial cellular junctions and the subsequent deregulation of the cell adhesion and apoptotic pathways. These hypotheses should be experimentally validated.


AMB Express ◽  
2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Zhiyong Liu ◽  
Kai Dang ◽  
Cunzhi Li ◽  
Junhong Gao ◽  
Hong Wang ◽  
...  

Abstract Hexanitrohexaazaisowurtzitane (CL-20) is a compound with a polycyclic cage and an N-nitro group that has been shown to play an unfavorable role in environmental fate, biosafety, and physical health. The aim of this study was to isolate the microbial community and to identify a single microbial strain that can degrade CL-20 with desirable efficiency. Metagenomic sequencing methods were performed to investigate the dynamic changes in the composition of the community diversity. The most varied genus among the microbial community was Pseudomonas, which increased from 1.46% to 44.63% during the period of incubation (MC0–MC4). Furthermore, the new strain was isolated and identified from the activated sludge by bacterial morphological and 16s rRNA sequencing analyses. The CL-20 concentrations decreased by 75.21 μg/mL and 74.02 μg/mL in 48 h by MC4 and Pseudomonas sp. ZyL-01, respectively. Moreover, ZyL-01 could decompose 98% CL-20 of the real effluent in 14 day’s incubation with the glucose as carbon source. Finally, a draft genome sequence was obtained to predict possible degrading enzymes involved in the biodegradation of CL-20. Specifically, 330 genes that are involved in energy production and conversion were annotated by Gene Ontology functional enrichment analysis, and some of these candidates may encode enzymes that are responsible for CL-20 degradation. In summary, our studies indicate that microbes might be a valuable biological resource for the treatment of environmental contamination caused by CL-20 and that Pseudomonas sp. ZyL-01 might be a promising candidate for eradicating CL-20 to achieve a more biosafe environment and improve public health.


Sign in / Sign up

Export Citation Format

Share Document