An Optimized Network Representation Learning Algorithm Using Multi-Relational Data

Zhonglin Ye; Haixing Zhao; Ke Zhang; Yu Zhu; Zhaoyang Wang

doi:10.3390/math7050460

An Optimized Network Representation Learning Algorithm Using Multi-Relational Data

Mathematics ◽

10.3390/math7050460 ◽

2019 ◽

Vol 7 (5) ◽

pp. 460

Author(s):

Zhonglin Ye ◽

Haixing Zhao ◽

Ke Zhang ◽

Yu Zhu ◽

Zhaoyang Wang

Keyword(s):

Knowledge Representation ◽

Learning Algorithm ◽

Representation Learning ◽

Knowledge Bases ◽

Relational Data ◽

New Approach ◽

Network Representation ◽

Structural Relationships ◽

Low Dimensional

Representation learning aims to encode the relationships of research objects into low-dimensional, compressible, and distributed representation vectors. The purpose of network representation learning is to learn the structural relationships between network vertices. Knowledge representation learning is oriented to model the entities and relationships in knowledge bases. In this paper, we first introduce the idea of knowledge representation learning into network representation learning, namely, we propose a new approach to model the vertex triplet relationships based on DeepWalk without TransE. Consequently, we propose an optimized network representation learning algorithm using multi-relational data, MRNR, which introduces the multi-relational data between vertices into the procedures of network representation learning. Importantly, we adopted a kind of higher order transformation strategy to optimize the learnt network representation vectors. The purpose of MRNR is that multi-relational data (triplets) can effectively guide and constrain the procedures of network representation learning. The experimental results demonstrate that the proposed MRNR can learn the discriminative network representations, which show better performance on network classification, visualization, and case study tasks compared to the proposed baseline algorithms in this paper.

Download Full-text

Multi-View Network Representation Learning Algorithm Research

Algorithms ◽

10.3390/a12030062 ◽

2019 ◽

Vol 12 (3) ◽

pp. 62 ◽

Cited By ~ 1

Author(s):

Zhonglin Ye ◽

Haixing Zhao ◽

Ke Zhang ◽

Yu Zhu

Keyword(s):

Learning Algorithm ◽

Representation Learning ◽

Research Field ◽

Dimensional Representation ◽

Multi Scale ◽

Network Representation ◽

Excellent Research ◽

Network Feature ◽

The Matrix ◽

Low Dimensional

Network representation learning is a key research field in network data mining. In this paper, we propose a novel multi-view network representation algorithm (MVNR), which embeds multi-scale relations of network vertices into the low dimensional representation space. In contrast to existing approaches, MVNR explicitly encodes higher order information using k-step networks. In addition, we introduce the matrix forest index as a kind of network feature, which can be applied to balance the representation weights of different network views. We also research the relevance amongst MVNR and several excellent research achievements, including DeepWalk, node2vec and GraRep and so forth. We conduct our experiment on several real-world citation datasets and demonstrate that MVNR outperforms some new approaches using neural matrix factorization. Specifically, we demonstrate the efficiency of MVNR on network classification, visualization and link prediction tasks.

Download Full-text

The Network Representation Learning Algorithm Based on Semi-Supervised Random Walk

IEEE Access ◽

10.1109/access.2020.3044367 ◽

2020 ◽

Vol 8 ◽

pp. 222956-222965

Author(s):

Dong Liu ◽

Qinpeng Li ◽

Yan Ru ◽

Jun Zhang

Keyword(s):

Random Walk ◽

Learning Algorithm ◽

Representation Learning ◽

Network Representation

Download Full-text

Network Representation Learning Algorithm Combined with Node Text Information

Journal of Physics Conference Series ◽

10.1088/1742-6596/1769/1/012054 ◽

2021 ◽

Vol 1769 (1) ◽

pp. 012054

Author(s):

Rui Wang ◽

Yu Liu ◽

Jiawang Chen

Keyword(s):

Learning Algorithm ◽

Representation Learning ◽

Network Representation ◽

Text Information

Download Full-text

Negative Sampling in Knowledge Representation Learning: A Mini-Review

10.5121/csit.2020.101519 ◽

2020 ◽

Author(s):

Jing Qian ◽

Gangmin Li ◽

Katie Atkinson ◽

Yong Yue

Keyword(s):

Knowledge Representation ◽

Link Prediction ◽

Representation Learning ◽

Cluster Sampling ◽

Continuous Space ◽

Knowledge Representations ◽

Space Efficiency ◽

Low Dimensional ◽

Fixed Distribution

Knowledge representation learning (KRL) aims at encoding components of a knowledge graph (KG) into a low-dimensional continuous space, which has brought considerable successes in applying deep learning to graph embedding. Most famous KGs contain only positive instances for space efficiency. Typical KRL techniques, especially translational distance-based models, are trained through discriminating positive and negative samples. Thus, negative sampling is unquestionably a non-trivial step in KG embedding. The quality of generated negative samples can directly influence the performance of final knowledge representations in downstream tasks, such as link prediction and triple classification. This review summarizes current negative sampling methods in KRL and we categorize them into three sorts, fixed distribution-based, generative adversarial net (GAN)-based and cluster sampling. Based on this categorization we discuss the most prevalent existing approaches and their characteristics.

Download Full-text

Exponential Family Graph Embeddings

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5737 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3357-3364

Author(s):

Abdulkadir Celikkanat ◽

Fragkiskos D. Malliaros

Keyword(s):

Random Walk ◽

Exponential Family ◽

Representation Learning ◽

Learning Problems ◽

Interaction Patterns ◽

Network Representation ◽

Learning Tasks ◽

Learning Techniques ◽

Real World Datasets ◽

Low Dimensional

Representing networks in a low dimensional latent space is a crucial task with many interesting applications in graph learning problems, such as link prediction and node classification. A widely applied network representation learning paradigm is based on the combination of random walks for sampling context nodes and the traditional Skip-Gram model to capture center-context node relationships. In this paper, we emphasize on exponential family distributions to capture rich interaction patterns between nodes in random walk sequences. We introduce the generic exponential family graph embedding model, that generalizes random walk-based network representation learning techniques to exponential family conditional distributions. We study three particular instances of this model, analyzing their properties and showing their relationship to existing unsupervised learning models. Our experimental evaluation on real-world datasets demonstrates that the proposed techniques outperform well-known baseline methods in two downstream machine learning tasks.

Download Full-text

Evolutionary dataset optimisation: learning algorithm quality through evolution

Applied Intelligence ◽

10.1007/s10489-019-01592-4 ◽

2019 ◽

Vol 50 (4) ◽

pp. 1172-1191 ◽

Cited By ~ 2

Author(s):

Henry Wilde ◽

Vincent Knight ◽

Jonathan Gillard

Keyword(s):

Finite Number ◽

Open Source ◽

Learning Algorithm ◽

Clustering Algorithms ◽

Genetic Evolution ◽

Artificial Data ◽

New Approach ◽

Benchmark Datasets ◽

Novel Method

AbstractIn this paper we propose a novel method for learning how algorithms perform. Classically, algorithms are compared on a finite number of existing (or newly simulated) benchmark datasets based on some fixed metrics. The algorithm(s) with the smallest value of this metric are chosen to be the ‘best performing’. We offer a new approach to flip this paradigm. We instead aim to gain a richer picture of the performance of an algorithm by generating artificial data through genetic evolution, the purpose of which is to create populations of datasets for which a particular algorithm performs well on a given metric. These datasets can be studied so as to learn what attributes lead to a particular progression of a given algorithm. Following a detailed description of the algorithm as well as a brief description of an open source implementation, a case study in clustering is presented. This case study demonstrates the performance and nuances of the method which we call Evolutionary Dataset Optimisation. In this study, a number of known properties about preferable datasets for the clustering algorithms known as k-means and DBSCAN are realised in the generated datasets.

Download Full-text

Network Representation Learning-Based Drug Mechanism Discovery and Anti-Inflammatory Response Against COVID-19

10.26434/chemrxiv.12531314.v3 ◽

2021 ◽

Author(s):

Wang Xiaoqi ◽

Bin Xin ◽

Zhijian Xu ◽

Kenli LI ◽

Fei Li ◽

...

Keyword(s):

Inflammatory Response ◽

Inflammatory Responses ◽

Representation Learning ◽

Binding Modes ◽

Network Representation ◽

Drug Mechanism ◽

Docking Program ◽

Therapeutic Development ◽

Anti Inflammatory ◽

Low Dimensional

<p>Recent studies have been demonstrated that the excessive inflammatory response is an important factor of death in COVID-19 patients. In this study, we proposed a network representation learning-based methodology, termed AIdrug2cov, to discover drug mechanism and anti-inflammatory response for patients with COVID-19. This work explores the multi-hub characteristic of a heterogeneous drug network integrating 8 unique networks. Inspired by the multi-hub characteristic, we design three billion special meta paths to train a deep representation model for learning low-dimensional vectors that integrate long-range structure dependency and complex semantic relation among network nodes. Using the representation vectors, AIdrug2cov identifies 40 potential targets and 22 high-confidence drugs that bind to tumor necrosis factor(TNF)-α or interleukin(IL)-6 to prevent excessive inflammatory responses in COVID-19 patients. Finally, we analyze mechanisms of action based on PubMed publications and ongoing clinical trials, and explore the possible binding modes between the new predicted drugs and targets via docking program. In addition, the results in 5 pharmacological application suggested that AIdrug2cov significantly outperforms 5 other state-of-the-art network representation approaches, future demonstrating the availability of AIdrug2cov in drug development field. In summary, AIdrug2cov is practically useful for accelerating COVID-19 therapeutic development. The source code and data can be downloaded from https://github.com/pengsl-lab/AIdrug2cov.git.</p>

Download Full-text

Propagators and Solvers for the Algebra of Modular Systems

10.29007/t7r9 ◽

2018 ◽

Author(s):

Bart Bogaerts ◽

Eugenia Ternovska ◽

David Mitchell

Keyword(s):

Knowledge Representation ◽

Learning Algorithm ◽

Knowledge Bases ◽

Black Box ◽

Modular System ◽

General Notion ◽

Combined System ◽

Modular Systems ◽

Problem Solvers ◽

Model Expansion

Solving complex problems can involve non-trivial combinations of distinct knowledge bases and problem solvers. The Algebra of Modular Systems is a knowledge representation framework that provides a method for formally specifying such systems in purely semantic terms. Many practical systems based on expressive formalisms solve the model expansion task. In this paper, we con- struct a solver for the model expansion task for a complex modular system from an expression in the algebra and black-box propagators or solvers for the primitive modules. To this end, we define a general notion of propagators equipped with an explanation mechanism, an extension of the algebra to propagators, and a lazy conflict-driven learning algorithm. The result is a framework for seamlessly combining solving technology from different domains to produce a solver for a combined system.

Download Full-text

Feature Hashing for Network Representation Learning

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/390 ◽

2018 ◽

Cited By ~ 2

Author(s):

Qixiang Wang ◽

Shanfeng Wang ◽

Maoguo Gong ◽

Yue Wu

Keyword(s):

Link Prediction ◽

Feature Space ◽

Representation Learning ◽

Learning Approaches ◽

Network Representation ◽

Proximity Matrix ◽

Low Dimensional ◽

Vector Representations ◽

Feature Hashing ◽

Node Embeddings

The goal of network representation learning is to embed nodes so as to encode the proximity structures of a graph into a continuous low-dimensional feature space. In this paper, we propose a novel algorithm called node2hash based on feature hashing for generating node embeddings. This approach follows the encoder-decoder framework. There are two main mapping functions in this framework. The first is an encoder to map each node into high-dimensional vectors. The second is a decoder to hash these vectors into a lower dimensional feature space. More specifically, we firstly derive a proximity measurement called expected distance as target which combines position distribution and co-occurrence statistics of nodes over random walks so as to build a proximity matrix, then introduce a set of T different hash functions into feature hashing to generate uniformly distributed vector representations of nodes from the proximity matrix. Compared with the existing state-of-the-art network representation learning approaches, node2hash shows a competitive performance on multi-class node classification and link prediction tasks on three real-world networks from various domains.

Download Full-text

An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks

Mathematics ◽

10.3390/math9151767 ◽

2021 ◽

Vol 9 (15) ◽

pp. 1767

Author(s):

Xin Xu ◽

Yang Lu ◽

Yupeng Zhou ◽

Zhiguo Fu ◽

Yanjie Fu ◽

...

Keyword(s):

Random Walk ◽

Representation Learning ◽

Local Information ◽

Learning Framework ◽

Network Representation ◽

Label Node ◽

Label Information ◽

Classification Tasks ◽

Node Classification ◽

Low Dimensional

Network representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, where random walk strategy is one of the wildly utilized approaches. However, the existing random walk based methods have some challenges, including: 1. The insufficiency of explaining what network knowledge in the walking path-samplings; 2. The adverse effects caused by the mixture of different information in networks; 3. The poor generality of the methods with hyper-parameters on different networks. This paper proposes an information-explainable random walk based unsupervised network representation learning framework named Probabilistic Accepted Walk (PAW) to obtain network representation from the perspective of the stationary distribution of networks. In the framework, we design two stationary distributions based on nodes’ self-information and local-information of networks to guide our proposed random walk strategy to learn representational vectors of networks through sampling paths of nodes. Numerous experimental results demonstrated that the PAW could obtain more expressive representation than the other six widely used unsupervised network representation learning baselines on four real-world networks in single-label and multi-label node classification tasks.

Download Full-text