Semantic Publication of Agricultural Scientific Literature Using Property Graphs

Francisco Abad-Navarro; José Antonio Bernabé-Diaz; Alexander García-Castro; Jesualdo Tomás Fernandez-Breis

doi:10.3390/app10030861

Semantic Publication of Agricultural Scientific Literature Using Property Graphs

Applied Sciences ◽

10.3390/app10030861 ◽

2020 ◽

Vol 10 (3) ◽

pp. 861

Author(s):

Francisco Abad-Navarro ◽

José Antonio Bernabé-Diaz ◽

Alexander García-Castro ◽

Jesualdo Tomás Fernandez-Breis

Keyword(s):

Language Processing ◽

Domain Knowledge ◽

Scientific Literature ◽

Semantic Annotation ◽

Knowledge Graph ◽

Processing Technologies ◽

Extra Effort ◽

Literature Searches ◽

Rdf Data ◽

Knowledge Graphs

During the last decades, there have been significant changes in science that have provoked a big increase in the number of articles published every year. This increment implies a new difficulty for scientists, who have to do an extra effort for selecting literature relevant for their activity. In this work, we present a pipeline for the generation of scientific literature knowledge graphs in the agriculture domain. The pipeline combines Semantic Web and natural language processing technologies, which make data understandable by computer agents, empowering the development of final user applications for literature searches. This workflow consists of (1) RDF generation, including metadata and contents; (2) semantic annotation of the content; and (3) property graph population by adding domain knowledge from ontologies, in addition to the previously generated RDF data describing the articles. This pipeline was applied to a set of 127 agriculture articles, generating a knowledge graph implemented in Neo4j, publicly available on Docker. The potential of our model is illustrated through a series of queries and use cases, which not only include queries about authors or references but also deal with article similarity or clustering based on semantic annotation, which is facilitated by the inclusion of domain ontologies in the graph.

Download Full-text

Conversation Concepts: Understanding Topics and Building Taxonomies for Financial Services

Information ◽

10.3390/info12040160 ◽

2021 ◽

Vol 12 (4) ◽

pp. 160

Author(s):

John P. McCrae ◽

Pranab Mohanty ◽

Siddharth Narayanan ◽

Bianca Pereira ◽

Paul Buitelaar ◽

...

Keyword(s):

Language Processing ◽

Financial Services ◽

State Of The Art ◽

Knowledge Graph ◽

Financial Industry ◽

High Quality ◽

Processing Technologies ◽

Knowledge Graphs ◽

Graph System ◽

New Applications

Knowledge graphs are proving to be an increasingly important part of modern enterprises, and new applications of such enterprise knowledge graphs are still being found. In this paper, we report on the experience with the use of an automatic knowledge graph system called Saffron in the context of a large financial enterprise and show how this has found applications within this enterprise as part of the “Conversation Concepts Artificial Intelligence” tool. In particular, we analyse the use cases for knowledge graphs within this enterprise, and this led us to a new extension to the knowledge graph system. We present the results of these adaptations, including the introduction of a semi-supervised taxonomy extraction system, which includes analysts in-the-loop. Further, we extend the kinds of relations extracted by the system and show how the use of the BERTand ELMomodels can produce high-quality results. Thus, we show how this tool can help realize a smart enterprise and how requirements in the financial industry can be realised by state-of-the-art natural language processing technologies.

Download Full-text

Development of Knowledge Graph for Data Management Related to Flooding Disasters Using Open Data

Future Internet ◽

10.3390/fi13050124 ◽

2021 ◽

Vol 13 (5) ◽

pp. 124

Author(s):

Jiseong Son ◽

Chul-Su Lim ◽

Hyoung-Seop Shim ◽

Ji-Sun Kang

Keyword(s):

Artificial Intelligence ◽

Domain Knowledge ◽

Open Data ◽

Heterogeneous Data ◽

Big Data Analysis ◽

Knowledge Graph ◽

Cross Domain ◽

Disaster Data ◽

Knowledge Graphs ◽

Open Datasets

Despite the development of various technologies and systems using artificial intelligence (AI) to solve problems related to disasters, difficult challenges are still being encountered. Data are the foundation to solving diverse disaster problems using AI, big data analysis, and so on. Therefore, we must focus on these various data. Disaster data depend on the domain by disaster type and include heterogeneous data and lack interoperability. In particular, in the case of open data related to disasters, there are several issues, where the source and format of data are different because various data are collected by different organizations. Moreover, the vocabularies used for each domain are inconsistent. This study proposes a knowledge graph to resolve the heterogeneity among various disaster data and provide interoperability among domains. Among disaster domains, we describe the knowledge graph for flooding disasters using Korean open datasets and cross-domain knowledge graphs. Furthermore, the proposed knowledge graph is used to assist, solve, and manage disaster problems.

Download Full-text

BUILD KNOWLEDGE GRAPH FROM HETEROGENEOUS DOCUMENTS

Journal of Science and Technology - IUH ◽

10.46242/jst-iuh.v47i05.761 ◽

2021 ◽

Vol 47 (05) ◽

Author(s):

NGUYỄN CHÍ HIẾU

Keyword(s):

Information Retrieval ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Question Answering ◽

Semantic Analysis ◽

Knowledge Graph ◽

Question Answering Systems ◽

Knowledge Graphs

Knowledge Graphs are applied in many fields such as search engines, semantic analysis, and question answering in recent years. However, there are many obstacles for building knowledge graphs as methodologies, data and tools. This paper introduces a novel methodology to build knowledge graph from heterogeneous documents. We use the methodologies of Natural Language Processing and deep learning to build this graph. The knowledge graph can use in Question answering systems and Information retrieval especially in Computing domain

Download Full-text

Virtual Knowledge Graphs: An Overview of Systems and Use Cases

Data Intelligence ◽

10.1162/dint_a_00011 ◽

2019 ◽

Vol 1 (3) ◽

pp. 201-223 ◽

Cited By ~ 17

Author(s):

Guohui Xiao ◽

Linfang Ding ◽

Benjamin Cogrel ◽

Diego Calvanese

Keyword(s):

Data Integration ◽

Domain Knowledge ◽

Data Access ◽

Use Cases ◽

Future Research ◽

Knowledge Graph ◽

Research Directions ◽

Wide Range ◽

Future Research Directions ◽

Knowledge Graphs

In this paper, we present the virtual knowledge graph (VKG) paradigm for data integration and access, also known in the literature as Ontology-based Data Access. Instead of structuring the integration layer as a collection of relational tables, the VKG paradigm replaces the rigid structure of tables with the flexibility of graphs that are kept virtual and embed domain knowledge. We explain the main notions of this paradigm, its tooling ecosystem and significant use cases in a wide range of applications. Finally, we discuss future research directions.

Download Full-text

Predictive article recommendation using natural language processing and machine learning to support evidence updates in domain-specific knowledge graphs

JAMIA Open ◽

10.1093/jamiaopen/ooaa028 ◽

2020 ◽

Vol 3 (3) ◽

pp. 332-337

Author(s):

Bhuvan Sharma ◽

Van C Willis ◽

Claudia S Huettner ◽

Kirk Beaty ◽

Jane L Snowdon ◽

...

Keyword(s):

Machine Learning ◽

Language Processing ◽

Named Entity Recognition ◽

Entity Recognition ◽

Human Cognition ◽

Current Evidence ◽

Knowledge Graph ◽

Space Modeling ◽

Domain Specific Knowledge ◽

Knowledge Graphs

Abstract Objectives Describe an augmented intelligence approach to facilitate the update of evidence for associations in knowledge graphs. Methods New publications are filtered through multiple machine learning study classifiers, and filtered publications are combined with articles already included as evidence in the knowledge graph. The corpus is then subjected to named entity recognition, semantic dictionary mapping, term vector space modeling, pairwise similarity, and focal entity match to identify highly related publications. Subject matter experts review recommended articles to assess inclusion in the knowledge graph; discrepancies are resolved by consensus. Results Study classifiers achieved F-scores from 0.88 to 0.94, and similarity thresholds for each study type were determined by experimentation. Our approach reduces human literature review load by 99%, and over the past 12 months, 41% of recommendations were accepted to update the knowledge graph. Conclusion Integrated search and recommendation exploiting current evidence in a knowledge graph is useful for reducing human cognition load.

Download Full-text

A Survey on Knowledge Graph Embeddings for Link Prediction

Symmetry ◽

10.3390/sym13030485 ◽

2021 ◽

Vol 13 (3) ◽

pp. 485

Author(s):

Meihong Wang ◽

Linling Qiu ◽

Xiaoli Wang

Keyword(s):

Language Processing ◽

Link Prediction ◽

Knowledge Graph ◽

Practical Utilization ◽

Complete Knowledge ◽

Benchmark Datasets ◽

Comprehensive Survey ◽

Knowledge Graphs ◽

Representation Techniques ◽

Open Nature

Knowledge graphs (KGs) have been widely used in the field of artificial intelligence, such as in information retrieval, natural language processing, recommendation systems, etc. However, the open nature of KGs often implies that they are incomplete, having self-defects. This creates the need to build a more complete knowledge graph for enhancing the practical utilization of KGs. Link prediction is a fundamental task in knowledge graph completion that utilizes existing relations to infer new relations so as to build a more complete knowledge graph. Numerous methods have been proposed to perform the link-prediction task based on various representation techniques. Among them, KG-embedding models have significantly advanced the state of the art in the past few years. In this paper, we provide a comprehensive survey on KG-embedding models for link prediction in knowledge graphs. We first provide a theoretical analysis and comparison of existing methods proposed to date for generating KG embedding. Then, we investigate several representative models that are classified into five categories. Finally, we conducted experiments on two benchmark datasets to report comprehensive findings and provide some new insights into the strengths and weaknesses of existing models.

Download Full-text

Discovering alignment relations with Graph Convolutional Networks: A biomedical case study

Semantic Web ◽

10.3233/sw-210452 ◽

2021 ◽

pp. 1-20

Author(s):

Pierre Monnin ◽

Chedy Raïssi ◽

Amedeo Napoli ◽

Adrien Coulet

Keyword(s):

Domain Knowledge ◽

Knowledge Graph ◽

Convolutional Networks ◽

Real World Application ◽

Web Of Data ◽

Knowledge Graphs ◽

The Web ◽

Node Embeddings ◽

Weak Similarity

Knowledge graphs are freely aggregated, published, and edited in the Web of data, and thus may overlap. Hence, a key task resides in aligning (or matching) their content. This task encompasses the identification, within an aggregated knowledge graph, of nodes that are equivalent, more specific, or weakly related. In this article, we propose to match nodes within a knowledge graph by (i) learning node embeddings with Graph Convolutional Networks such that similar nodes have low distances in the embedding space, and (ii) clustering nodes based on their embeddings, in order to suggest alignment relations between nodes of a same cluster. We conducted experiments with this approach on the real world application of aligning knowledge in the field of pharmacogenomics, which motivated our study. We particularly investigated the interplay between domain knowledge and GCN models with the two following focuses. First, we applied inference rules associated with domain knowledge, independently or combined, before learning node embeddings, and we measured the improvements in matching results. Second, while our GCN model is agnostic to the exact alignment relations (e.g., equivalence, weak similarity), we observed that distances in the embedding space are coherent with the “strength” of these different relations (e.g., smaller distances for equivalences), letting us considering clustering and distances in the embedding space as a means to suggest alignment relations in our case study.

Download Full-text

Phenoscape: Semantic analysis of organismal traits and genes yields insights in evolutionary biology

10.7287/peerj.preprints.26988v1 ◽

2018 ◽

Cited By ~ 1

Author(s):

Paula M Mabee ◽

Wasila M Dahdul ◽

James P Balhoff ◽

Hilmar Lapp ◽

Prashanti Manda ◽

...

Keyword(s):

Language Processing ◽

Evolutionary Biology ◽

Domain Knowledge ◽

Large Scale ◽

Semantic Analysis ◽

Comparative Anatomy ◽

Semantic Annotation ◽

Model Organisms ◽

Free Text ◽

Mutant Phenotypes

The study of how the observable features of organisms, i.e., their phenotypes, result from the complex interplay between genetics, development, and the environment, is central to much research in biology. The varied language used in the description of phenotypes, however, impedes the large scale and interdisciplinary analysis of phenotypes by computational methods. The Phenoscape project (www.phenoscape.org) has developed semantic annotation tools and a gene–phenotype knowledgebase, the Phenoscape KB, that uses machine reasoning to connect evolutionary phenotypes from the comparative literature to mutant phenotypes from model organisms. The semantically annotated data enables the linking of novel species phenotypes with candidate genes that may underlie them. Semantic annotation of evolutionary phenotypes further enables previously difficult or novel analyses of comparative anatomy and evolution. These include generating large, synthetic character matrices of presence/absence phenotypes based on inference, and searching for taxa and genes with similar variation profiles using semantic similarity. Phenoscape is further extending these tools to enable users to automatically generate synthetic supermatrices for diverse character types, and use the domain knowledge encoded in ontologies for evolutionary trait analysis. Curating the annotated phenotypes necessary for this research requires significant human curator effort, although semi-automated natural language processing tools promise to expedite the curation of free text. As semantic tools and methods are developed for the biodiversity sciences, new insights from the increasingly connected stores of interoperable phenotypic and genetic data are anticipated.

Download Full-text

Automatically Semantic Annotation of Network Document Based on Domain Knowledge Graph

2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC) ◽

10.1109/ispa/iucc.2017.00111 ◽

2017 ◽

Cited By ~ 3

Author(s):

Yuezhong Wu ◽

Zhihong Wang ◽

Shuhong Chen ◽

Guojun Wang ◽

Changyun Li

Keyword(s):

Domain Knowledge ◽

Semantic Annotation ◽

Knowledge Graph

Download Full-text

Few-Shot Knowledge Graph Completion

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5698 ◽

2020 ◽

Vol 34 (03) ◽

pp. 3041-3048 ◽

Cited By ~ 2

Author(s):

Chuxu Zhang ◽

Huaxiu Yao ◽

Chao Huang ◽

Meng Jiang ◽

Zhenhui Li ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

State Of The Art ◽

Knowledge Graph ◽

Graph Structure ◽

Reference Set ◽

Knowledge Graphs ◽

Public Datasets ◽

Relation Learning

Knowledge graphs (KGs) serve as useful resources for various natural language processing applications. Previous KG completion approaches require a large number of training instances (i.e., head-tail entity pairs) for every relation. The real case is that for most of the relations, very few entity pairs are available. Existing work of one-shot learning limits method generalizability for few-shot scenarios and does not fully use the supervisory information; however, few-shot KG completion has not been well studied yet. In this work, we propose a novel few-shot relation learning model (FSRL) that aims at discovering facts of new relations with few-shot references. FSRL can effectively capture knowledge from heterogeneous graph structure, aggregate representations of few-shot references, and match similar entity pairs of reference set for every relation. Extensive experiments on two public datasets demonstrate that FSRL outperforms the state-of-the-art.

Download Full-text