scholarly journals GenFam: A web application and database for gene family-based classification and functional enrichment analysis

2018 ◽  
Author(s):  
Renesh Bedre ◽  
Kranthi Mandadi

ABSTRACTGenome-scale studies using high-throughput sequencing (HTS) technologies generate substantial lists of differentially expressed genes under different experimental conditions. These gene lists need to be further mined to narrow down biologically relevant genes and associated functions in order to guide downstream functional genetic analyses. A popular approach is to determine statistically overrepresented genes in a user-defined list through enrichment analysis tools, which rely on functional annotations of genes based on Gene Ontology (GO) terms. Here, we propose a new approach, GenFam, which allows classification and enrichment of genes based on their gene family, thus simplifying identification of candidate gene families and associated genes that may be relevant to the query. GenFam and its integrated database comprises of three-hundred and eighty-four unique gene families and supports gene family classification and enrichment analyses for sixty plant genomes. Four comparative case studies with plant species belonging to different clades and families were performed using GenFam which demonstrated its robustness and comprehensiveness over preexisting functional enrichment tools. To make it readily accessible for plant biologists, GenFam is available as a web-based application where users can input gene IDs and export enrichment results in both tabular and graphical formats. Users can also customize analysis parameters by choosing from the various statistical enrichment tests and multiple testing correction methods. Additionally, the web-based application, source code and database are freely available to use and download. Website: http://mandadilab.webfactional.com/home/. Source code and database: http://mandadilab.webfactional.com/home/dload/.

2017 ◽  
Author(s):  
Zhenqing Ye ◽  
Tao Ma ◽  
Michael T. Kalmbach ◽  
Surendra Dasari ◽  
Jean-Pierre A. Kocher ◽  
...  

AbstractBackgroundThe sequence logo has been widely used to represent DNA or RNA motifs for more than three decades. Despite its intelligibility and intuitiveness, the traditional sequence logo is unable to display the intra-motif dependencies and therefore is insufficient to fully characterize nucleotide motifs. Many methods have been developed to quantify the intra-motif dependencies, but fewer tools are available for visualization.ResultWe developed CircularLogo, a web-based interactive application, which is able to not only visualize the position-specific nucleotide consensus and diversity but also display the intra-motif dependencies. Applying CircularLogo to HNF6 binding sites and tRNA sequences demonstrated its ability to show intra-motif dependencies and intuitively reveal biomolecular structure. CircularLogo is implemented in JavaScript and Python based on the Django web framework. The program’s source code and user’s manual are freely available at http://circularlogo.sourceforge.net. CircularLogo web server can be accessed from http://bioinformaticstools.mayo.edu/circularlogo/index.html.ConclusionCircularLogo is an innovative web application that is specifically designed to visualize and interactively explore intra-motif dependencies.


2019 ◽  
Author(s):  
Wenlong Jia ◽  
Hechen Li ◽  
Shiying Li ◽  
Shuaicheng Li

ABSTRACTSummaryVisualizing integrated-level data from genomic research remains a challenge, as it requires sufficient coding skills and experience. Here, we present LandScapeoviz, a web-based application for interactive and real-time visualization of summarized genetic information. LandScape utilizes a well-designed file format that is capable of handling various data types, and offers a series of built-in functions to customize the appearance, explore results, and export high-quality diagrams that are available for publication.Availability and implementationLandScape is deployed at bio.oviz.org/demo-project/analyses/landscape for online use. Documentation and demo data are freely available on this website and GitHub (github.com/Nobel-Justin/Oviz-Bio-demo)[email protected]


2021 ◽  
Vol 3 (4) ◽  
Author(s):  
Fotis A Baltoumas ◽  
Sofia Zafeiropoulou ◽  
Evangelos Karatzas ◽  
Savvas Paragkamian ◽  
Foteini Thanati ◽  
...  

Abstract Extracting and processing information from documents is of great importance as lots of experimental results and findings are stored in local files. Therefore, extracting and analyzing biomedical terms from such files in an automated way is absolutely necessary. In this article, we present OnTheFly2.0, a web application for extracting biomedical entities from individual files such as plain texts, office documents, PDF files or images. OnTheFly2.0 can generate informative summaries in popup windows containing knowledge related to the identified terms along with links to various databases. It uses the EXTRACT tagging service to perform named entity recognition (NER) for genes/proteins, chemical compounds, organisms, tissues, environments, diseases, phenotypes and gene ontology terms. Multiple files can be analyzed, whereas identified terms such as proteins or genes can be explored through functional enrichment analysis or be associated with diseases and PubMed entries. Finally, protein–protein and protein–chemical networks can be generated with the use of STRING and STITCH services. To demonstrate its capacity for knowledge discovery, we interrogated published meta-analyses of clinical biomarkers of severe COVID-19 and uncovered inflammatory and senescence pathways that impact disease pathogenesis. OnTheFly2.0 currently supports 197 species and is available at http://bib.fleming.gr:3838/OnTheFly/ and http://onthefly.pavlopouloslab.info.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8831 ◽  
Author(s):  
Xiaojiao Guan ◽  
Yao Yao ◽  
Guangyao Bao ◽  
Yue Wang ◽  
Aimeng Zhang ◽  
...  

Esophageal cancer is a common malignant tumor in the world, and the aim of this study was to screen key genes related to the development of esophageal cancer using a variety of bioinformatics analysis tools and analyze their biological functions. The data of esophageal squamous cell carcinoma from the Gene Expression Omnibus (GEO) were selected as the research object, processed and analyzed to screen differentially expressed microRNAs (miRNAs) and differential methylation genes. The competing endogenous RNAs (ceRNAs) interaction network of differentially expressed genes was constructed by bioinformatics tools DAVID, String, and Cytoscape. Biofunctional enrichment analysis was performed using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG). The expression of the screened genes and the survival of the patients were verified. By analyzing GSE59973 and GSE114110, we found three down-regulated and nine up-regulated miRNAs. The gene expression matrix of GSE120356 was calculated by Pearson correlation coefficient, and the 11696 pairs of ceRNA relation were determined. In the ceRNA network, 643 lncRNAs and 147 mRNAs showed methylation difference. Functional enrichment analysis showed that these differentially expressed genes were mainly concentrated in the FoxO signaling pathway and were involved in the corresponding cascade of calcineurin. By analyzing the clinical data in The Cancer Genome Atlas (TCGA) database, it was found that four lncRNAs had an important impact on the survival and prognosis of esophageal carcinoma patients. QRT-PCR was also conducted to identify the expression of the key lncRNAs (RNF217-AS1, HCP5, ZFPM2-AS1 and HCG22) in ESCC samples. The selected key genes can provide theoretical guidance for further research on the molecular mechanism of esophageal carcinoma and the screening of molecular markers.


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 709 ◽  
Author(s):  
Liis Kolberg ◽  
Uku Raudvere ◽  
Ivan Kuzmin ◽  
Jaak Vilo ◽  
Hedi Peterson

g:Profiler (https://biit.cs.ut.ee/gprofiler) is a widely used gene list functional profiling and namespace conversion toolset that has been contributing to reproducible biological data analysis already since 2007. Here we introduce the accompanying R package, gprofiler2, developed to facilitate programmatic access to g:Profiler computations and databases via REST API. The gprofiler2 package provides an easy-to-use functionality that enables researchers to incorporate functional enrichment analysis into automated analysis pipelines written in R. The package also implements interactive visualisation methods to help to interpret the enrichment results and to illustrate them for publications. In addition, gprofiler2 gives access to the versatile gene/protein identifier conversion functionality in g:Profiler enabling to map between hundreds of different identifier types or orthologous species. The gprofiler2 package is freely available at the CRAN repository.


2020 ◽  
Author(s):  
Kumari Sonal Choudhary ◽  
Eoin Fahy ◽  
Kevin Coakley ◽  
Manish Sud ◽  
Mano R Maurya ◽  
...  

ABSTRACTWith the advent of high throughput mass spectrometric methods, metabolomics has emerged as an essential area of research in biomedicine with the potential to provide deep biological insights into normal and diseased functions in physiology. However, to achieve the potential offered by metabolomics measures, there is a need for biologist-friendly integrative analysis tools that can transform data into mechanisms that relate to phenotypes. Here, we describe MetENP, an R package, and a user-friendly web application deployed at the Metabolomics Workbench site extending the metabolomics enrichment analysis to include species-specific pathway analysis, pathway enrichment scores, gene-enzyme information, and enzymatic activities of the significantly altered metabolites. MetENP provides a highly customizable workflow through various user-specified options and includes support for all metabolite species with available KEGG pathways. MetENPweb is a web application for calculating metabolite and pathway enrichment analysis.Availability and ImplementationThe MetENP package is freely available from Metabolomics Workbench GitHub: (https://github.com/metabolomicsworkbench/MetENP), the web application, is freely available at (https://www.metabolomicsworkbench.org/data/analyze.php)


2019 ◽  
Vol 35 (21) ◽  
pp. 4462-4464
Author(s):  
Jordan H Creed ◽  
Garrick Aden-Buie ◽  
Alvaro N Monteiro ◽  
Travis A Gerke

Abstract Summary Complementary advances in genomic technology and public data resources have created opportunities for researchers to conduct multifaceted examination of the genome on a large scale. To meet the need for integrative genome wide exploration, we present epiTAD. This web-based tool enables researchers to compare genomic 3D organization and annotations across multiple databases in an interactive manner to facilitate in silico discovery. Availability and implementation epiTAD can be accessed at https://apps.gerkelab.com/epiTAD/ where we have additionally made publicly available the source code and a Docker containerized version of the application.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11333
Author(s):  
Daniyar Karabayev ◽  
Askhat Molkenov ◽  
Kaiyrgali Yerulanuly ◽  
Ilyas Kabimoldayev ◽  
Asset Daniyarov ◽  
...  

Background High-throughput sequencing platforms generate a massive amount of high-dimensional genomic datasets that are available for analysis. Modern and user-friendly bioinformatics tools for analysis and interpretation of genomics data becomes essential during the analysis of sequencing data. Different standard data types and file formats have been developed to store and analyze sequence and genomics data. Variant Call Format (VCF) is the most widespread genomics file type and standard format containing genomic information and variants of sequenced samples. Results Existing tools for processing VCF files don’t usually have an intuitive graphical interface, but instead have just a command-line interface that may be challenging to use for the broader biomedical community interested in genomics data analysis. re-Searcher solves this problem by pre-processing VCF files by chunks to not load RAM of computer. The tool can be used as standalone user-friendly multiplatform GUI application as well as web application (https://nla-lbsb.nu.edu.kz). The software including source code as well as tested VCF files and additional information are publicly available on the GitHub repository (https://github.com/LabBandSB/re-Searcher).


2016 ◽  
Author(s):  
Richard Bruskiewich ◽  
Kenneth Huellas-Bruskiewicz ◽  
Farzin Ahmed ◽  
Rajaram Kaliyaperumal ◽  
Mark Thompson ◽  
...  

AbstractKnowledge.Bio is a web platform that enhances access and interpretation of knowledge networks extracted from biomedical research literature. The interaction is mediated through a collaborative graphical user interface for building and evaluating maps of concepts and their relationships, alongside associated evidence. In the first release of this platform, conceptual relations are drawn from the Semantic Medline Database and the Implicitome, two compleme ntary resources derived from text mining of PubMed abstracts.Availability— Knowledge.Bio is hosted at http://knowledge.bio/ and the open source code is available at http://bitbucket.org/sulab/kb1/.Contact— [email protected]; [email protected]


Sign in / Sign up

Export Citation Format

Share Document