AI-CTO: Knowledge graph for automated and dependable software stack solution

Xiaoyun Xu; Jingzheng Wu; Mutian Yang; Tianyue Luo; Qianru Meng; Weiheng Li; Yanjun Wu

doi:10.3233/jifs-200899

AI-CTO: Knowledge graph for automated and dependable software stack solution

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200899 ◽

2021 ◽

Vol 40 (1) ◽

pp. 799-812

Author(s):

Xiaoyun Xu ◽

Jingzheng Wu ◽

Mutian Yang ◽

Tianyue Luo ◽

Qianru Meng ◽

...

Keyword(s):

Current Practice ◽

Graph Embedding ◽

Software Systems ◽

Professional Experience ◽

Knowledge Graph ◽

Dimensional Vector ◽

Dimensional Vector Space ◽

Svm Model ◽

Low Dimensional ◽

Industry Experience

As the scale of software systems continues expanding, software architecture is receiving more and more attention as the blueprint for the complex software system. An outstanding architecture requires a lot of professional experience and expertise. In current practice, architects try to find solutions manually, which is time-consuming and error-prone because of the knowledge barrier between newcomers and experienced architects. The problem can be solved by easing the process of apply experience from prominent architects. To this end, this paper proposes a novel graph-embedding-based method, AI-CTO, to automatically suggest software stack solutions according to the knowledge and experience of prominent architects. Firstly, AI-CTO converts existing industry experience to knowledge, i.e., knowledge graph. Secondly, the knowledge graph is embedded in a low-dimensional vector space. Then, the entity vectors are used to predict valuable software stack solutions by an SVM model. We evaluate AI-CTO with two case studies and compare its solutions with the software stacks of large companies. The experiment results show that AI-CTO can find effective and correct stack solutions and it outperforms other baseline methods.

Download Full-text

Understanding Negative Sampling in Knowledge Graph Embedding

International Journal of Artificial Intelligence & Applications ◽

10.5121/ijaia.2021.12105 ◽

2021 ◽

Vol 12 (1) ◽

pp. 71-81

Author(s):

Jing Qian ◽

Gangmin Li ◽

Katie Atkinson ◽

Yong Yue

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Knowledge Graph ◽

Direct Impact ◽

Dimensional Vector Space ◽

Dynamic Distribution ◽

Space Efficiency ◽

Node Classification ◽

Low Dimensional

Knowledge graph embedding (KGE) is to project entities and relations of a knowledge graph (KG) into a low-dimensional vector space, which has made steady progress in recent years. Conventional KGE methods, especially translational distance-based models, are trained through discriminating positive samples from negative ones. Most KGs store only positive samples for space efficiency. Negative sampling thus plays a crucial role in encoding triples of a KG. The quality of generated negative samples has a direct impact on the performance of learnt knowledge representation in a myriad of downstream tasks, such as recommendation, link prediction and node classification. We summarize current negative sampling approaches in KGE into three categories, static distribution-based, dynamic distribution-based and custom cluster-based respectively. Based on this categorization we discuss the most prevalent existing approaches and their characteristics. It is a hope that this review can provide some guidelines for new thoughts about negative sampling in KGE.

Download Full-text

How much topological structure is preserved by graph embeddings?

Computer Science and Information Systems ◽

10.2298/csis181001011l ◽

2019 ◽

Vol 16 (2) ◽

pp. 597-614 ◽

Cited By ~ 1

Author(s):

Xin Liu ◽

Chenyi Zhuang ◽

Tsuyoshi Murata ◽

Kyoung-Sook Kim ◽

Natthawut Kertkeidkachorn

Keyword(s):

Topological Structure ◽

Link Prediction ◽

State Of The Art ◽

Graph Embedding ◽

Graph Embeddings ◽

Dimensional Vector ◽

Original Graph ◽

Dimensional Vector Space ◽

Low Dimensional ◽

Link Distribution

Graph embedding aims at learning representations of nodes in a low dimensional vector space. Good embeddings should preserve the graph topological structure. To study how much such structure can be preserved, we propose evaluation methods from four aspects: 1) How well the graph can be reconstructed based on the embeddings, 2) The divergence of the original link distribution and the embedding-derived distribution, 3) The consistency of communities discovered from the graph and embeddings, and 4) To what extent we can employ embeddings to facilitate link prediction. We find that it is insufficient to rely on the embeddings to reconstruct the original graph, to discover communities, and to predict links at a high precision. Thus, the embeddings by the state-of-the-art approaches can only preserve part of the topological structure.

Download Full-text

TransET: Knowledge Graph Embedding with Entity Types

Electronics ◽

10.3390/electronics10121407 ◽

2021 ◽

Vol 10 (12) ◽

pp. 1407

Author(s):

Peng Wang ◽

Jing Zhou ◽

Yuzhang Liu ◽

Xingchen Zhou

Keyword(s):

Link Prediction ◽

State Of The Art ◽

Score Function ◽

Graph Embedding ◽

Vector Spaces ◽

Knowledge Graph ◽

Semantic Features ◽

Knowledge Graphs ◽

Real World Datasets ◽

Low Dimensional

Knowledge graph embedding aims to embed entities and relations into low-dimensional vector spaces. Most existing methods only focus on triple facts in knowledge graphs. In addition, models based on translation or distance measurement cannot fully represent complex relations. As well-constructed prior knowledge, entity types can be employed to learn the representations of entities and relations. In this paper, we propose a novel knowledge graph embedding model named TransET, which takes advantage of entity types to learn more semantic features. More specifically, circle convolution based on the embeddings of entity and entity types is utilized to map head entity and tail entity to type-specific representations, then translation-based score function is used to learn the presentation triples. We evaluated our model on real-world datasets with two benchmark tasks of link prediction and triple classification. Experimental results demonstrate that it outperforms state-of-the-art models in most cases.

Download Full-text

An Approach to Knowledge Base Completion by a Committee-Based Knowledge Graph Embedding

Applied Sciences ◽

10.3390/app10082651 ◽

2020 ◽

Vol 10 (8) ◽

pp. 2651

Author(s):

Su Jeong Choi ◽

Hyun-Je Song ◽

Seong-Bae Park

Keyword(s):

Knowledge Base ◽

Language Processing ◽

Graph Embedding ◽

Knowledge Bases ◽

Knowledge Graph ◽

Data Sets ◽

Complete Knowledge ◽

Proposed Model ◽

Ranking Task ◽

Low Dimensional

Knowledge bases such as Freebase, YAGO, DBPedia, and Nell contain a number of facts with various entities and relations. Since they store many facts, they are regarded as core resources for many natural language processing tasks. Nevertheless, they are not normally complete and have many missing facts. Such missing facts keep them from being used in diverse applications in spite of their usefulness. Therefore, it is significant to complete knowledge bases. Knowledge graph embedding is one of the promising approaches to completing a knowledge base and thus many variants of knowledge graph embedding have been proposed. It maps all entities and relations in knowledge base onto a low dimensional vector space. Then, candidate facts that are plausible in the space are determined as missing facts. However, any single knowledge graph embedding is insufficient to complete a knowledge base. As a solution to this problem, this paper defines knowledge base completion as a ranking task and proposes a committee-based knowledge graph embedding model for improving the performance of knowledge base completion. Since each knowledge graph embedding has its own idiosyncrasy, we make up a committee of various knowledge graph embeddings to reflect various perspectives. After ranking all candidate facts according to their plausibility computed by the committee, the top-k facts are chosen as missing facts. Our experimental results on two data sets show that the proposed model achieves higher performance than any single knowledge graph embedding and shows robust performances regardless of k. These results prove that the proposed model considers various perspectives in measuring the plausibility of candidate facts.

Download Full-text

Logic Attention Based Neighborhood Aggregation for Inductive Knowledge Graph Embedding

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33017152 ◽

2019 ◽

Vol 33 ◽

pp. 7152-7159 ◽

Cited By ~ 4

Author(s):

Peifeng Wang ◽

Jialong Han ◽

Chenliang Li ◽

Rong Pan

Keyword(s):

Real World ◽

Graph Embedding ◽

Daily Basis ◽

Knowledge Graph ◽

World Knowledge ◽

Attention Network ◽

Knowledge Graphs ◽

Low Dimensional

Knowledge graph embedding aims at modeling entities and relations with low-dimensional vectors. Most previous methods require that all entities should be seen during training, which is unpractical for real-world knowledge graphs with new entities emerging on a daily basis. Recent efforts on this issue suggest training a neighborhood aggregator in conjunction with the conventional entity and relation embeddings, which may help embed new entities inductively via their existing neighbors. However, their neighborhood aggregators neglect the unordered and unequal natures of an entity’s neighbors. To this end, we summarize the desired properties that may lead to effective neighborhood aggregators. We also introduce a novel aggregator, namely, Logic Attention Network (LAN), which addresses the properties by aggregating neighbors with both rules- and network-based attention weights. By comparing with conventional aggregators on two knowledge graph completion tasks, we experimentally validate LAN’s superiority in terms of the desired properties.

Download Full-text

Bootstrapping Entity Alignment with Knowledge Graph Embedding

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/611 ◽

2018 ◽

Cited By ~ 35

Author(s):

Zequn Sun ◽

Wei Hu ◽

Qingheng Zhang ◽

Yuzhong Qu

Keyword(s):

Performance Improvement ◽

Real World ◽

State Of The Art ◽

Graph Embedding ◽

Training Data ◽

Knowledge Graph ◽

Error Accumulation ◽

Knowledge Graphs ◽

Real World Datasets ◽

Low Dimensional

Embedding-based entity alignment represents different knowledge graphs (KGs) as low-dimensional embeddings and finds entity alignment by measuring the similarities between entity embeddings. Existing approaches have achieved promising results, however, they are still challenged by the lack of enough prior alignment as labeled training data. In this paper, we propose a bootstrapping approach to embedding-based entity alignment. It iteratively labels likely entity alignment as training data for learning alignment-oriented KG embeddings. Furthermore, it employs an alignment editing method to reduce error accumulation during iterations. Our experiments on real-world datasets showed that the proposed approach significantly outperformed the state-of-the-art embedding-based ones for entity alignment. The proposed alignment-oriented KG embedding, bootstrapping process and alignment editing method all contributed to the performance improvement.

Download Full-text

TransConv: Relationship Embedding in Social Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33014130 ◽

2019 ◽

Vol 33 ◽

pp. 4130-4138 ◽

Cited By ~ 2

Author(s):

Yi-Yu Lai ◽

Jennifer Neville ◽

Dan Goldwasser

Keyword(s):

Social Networks ◽

Social Relationships ◽

Graph Embedding ◽

Representation Learning ◽

Knowledge Graph ◽

Social Network Data ◽

Novel Approach ◽

The Rich ◽

Low Dimensional ◽

Improve Representation

Representation learning (RL) for social networks facilitates real-world tasks such as visualization, link prediction and friend recommendation. Traditional knowledge graph embedding models learn continuous low-dimensional embedding of entities and relations. However, when applied to social networks, existing approaches do not consider the rich textual communications between users, which contains valuable information to describe social relationships. In this paper, we propose TransConv, a novel approach that incorporates textual interactions between pair of users to improve representation learning of both users and relationships. Our experiments on real social network data show TransConv learns better user and relationship embeddings compared to other state-of-theart knowledge graph embedding models. Moreover, the results illustrate that our model is more robust for sparse relationships where there are fewer examples.

Download Full-text

Meta-Learning on Graph with Curvature-Based Analysis (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7210 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13875-13876

Author(s):

Tae Hong Moon ◽

Sungsu Lim

Keyword(s):

Vector Space ◽

Dimensional Vector ◽

Topological Information ◽

Dimensional Vector Space ◽

Meta Learning ◽

Data Points ◽

Flexible Framework ◽

Latent Representations ◽

Low Dimensional ◽

Local Proximity

Learning latent representations in graphs is finding a mapping that embeds nodes or edges as data points in a low-dimensional vector space. This paper introduces a flexible framework to enhance existing methodologies that have difficulty capturing local proximity and global relationships at the same time. Our approach generates a virtual edge between non-adjacent nodes based on the Forman-Ricci curvature in network. By analyzing the network using topological information, global relationships structurally similar can easily be detected and successfully integrated with previous works.

Download Full-text

A Novel Negative Sampling Based on Frequency of Relational Association Entities for Knowledge Graph Embedding

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2068 ◽

2021 ◽

Author(s):

Wanhua Cao ◽

Yi Zhang ◽

Juntao Liu ◽

Ziyun Rao

Keyword(s):

Link Prediction ◽

State Of The Art ◽

Evaluation Criteria ◽

Relation Extraction ◽

Graph Embedding ◽

Semantic Space ◽

Knowledge Graph ◽

Knowledge Reasoning ◽

Relation Prediction ◽

Low Dimensional

Knowledge graph embedding improves the performance of relation extraction and knowledge reasoning by encoding entities and relationships in low-dimensional semantic space. During training, negative samples are usually constructed by replacing the head/tail entity. And the different replacing relationships lead to different accuracy of the prediction results. This paper develops a negative triplets construction framework according to the frequency of relational association entities. The proposed construction framework can fully consider the quantitative of relations and entities in the dataset to assign the proportion of relation and entity replacement and the frequency of the entities associated with each relationship to set reasonable proportions for different relations. To verify the validity of the proposed construction framework, it is integrated into the state-of-the-art knowledge graph embedding models, such as TransE, TransH, DistMult, ComplEx, and Analogy. And both the evaluation criteria of relation prediction and entity prediction are used to evaluate the performance of link prediction more comprehensively. The experimental results on two commonly used datasets, WN18 and FB15K, show that the proposed method improves entity link and triplet classification accuracy, especially the accuracy of relational link prediction.

Download Full-text

A Transition-Based Knowledge Graph Embedding with Adapting New Entities

Contemporary Research Trend of IT Convergence Technology ◽

10.21742/asehl.2016.4.03 ◽

2016 ◽

Author(s):

A-Yeong Kim ◽

◽

Hee-Guen Yoon ◽

Seong-Bae Park ◽

Se-Young Park ◽

...

Keyword(s):

Graph Embedding ◽

Knowledge Graph

Download Full-text