Link Prediction via Sparse Gaussian Graphical Model

Mathematical Problems in Engineering ◽

10.1155/2016/7213432 ◽

2016 ◽

Vol 2016 ◽

pp. 1-11

Author(s):

Liangliang Zhang ◽

Longqi Yang ◽

Guyu Hu ◽

Zhisong Pan ◽

Zhen Li

Keyword(s):

Link Prediction ◽

Graphical Model ◽

Area Under The Curve ◽

Superior Performance ◽

Gaussian Graphical Model ◽

Training Set ◽

Baseline Method ◽

Inverse Covariance Matrix ◽

Auc Value ◽

Real World Datasets

Link prediction is an important task in complex network analysis. Traditional link prediction methods are limited by network topology and lack of node property information, which makes predicting links challenging. In this study, we address link prediction using a sparse Gaussian graphical model and demonstrate its theoretical and practical effectiveness. In theory, link prediction is executed by estimating the inverse covariance matrix of samples to overcome information limits. The proposed method was evaluated with four small and four large real-world datasets. The experimental results show that the area under the curve (AUC) value obtained by the proposed method improved by an average of 3% and 12.5% compared to 13 mainstream similarity methods, respectively. This method outperforms the baseline method, and the prediction accuracy is superior to mainstream methods when using only 80% of the training set. The method also provides significantly higher AUC values when using only 60% in Dolphin and Taro datasets. Furthermore, the error rate of the proposed method demonstrates superior performance with all datasets compared to mainstream methods.

Download Full-text

Mean Received Resources Meet Machine Learning Algorithms to Improve Link Prediction Methods

Information ◽

10.3390/info13010035 ◽

2022 ◽

Vol 13 (1) ◽

pp. 35

Author(s):

Jibouni Ayoub ◽

Dounia Lotfi ◽

Ahmed Hammouch

Keyword(s):

Machine Learning ◽

Link Prediction ◽

Learning Algorithms ◽

Area Under The Curve ◽

Machine Learning Algorithms ◽

Actual State ◽

The Future ◽

Auc Value ◽

The Mean ◽

Analysis Of Social Networks

The analysis of social networks has attracted a lot of attention during the last two decades. These networks are dynamic: new links appear and disappear. Link prediction is the problem of inferring links that will appear in the future from the actual state of the network. We use information from nodes and edges and calculate the similarity between users. The more users are similar, the higher the probability of their connection in the future will be. The similarity metrics play an important role in the link prediction field. Due to their simplicity and flexibility, many authors have proposed several metrics such as Jaccard, AA, and Katz and evaluated them using the area under the curve (AUC). In this paper, we propose a new parameterized method to enhance the AUC value of the link prediction metrics by combining them with the mean received resources (MRRs). Experiments show that the proposed method improves the performance of the state-of-the-art metrics. Moreover, we used machine learning algorithms to classify links and confirm the efficiency of the proposed combination.

Download Full-text

Con&Net: A Cross-Network Anchor Link Discovery Method Based on Embedding Representation

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3469083 ◽

2022 ◽

Vol 16 (2) ◽

pp. 1-18

Author(s):

Xueyuan Wang ◽

Hongpo Zhang ◽

Zongmin Wang ◽

Yaqiong Qiao ◽

Jiangtao Ma ◽

...

Keyword(s):

Area Under The Curve ◽

Research Problem ◽

Cosine Similarity ◽

Baseline Method ◽

Latent Space ◽

Link Discovery ◽

Cross Platform ◽

Auc Value ◽

The Stability ◽

Discovery Method

Cross-network anchor link discovery is an important research problem and has many applications in heterogeneous social network. Existing schemes of cross-network anchor link discovery can provide reasonable link discovery results, but the quality of these results depends on the features of the platform. Therefore, there is no theoretical guarantee to the stability. This article employs user embedding feature to model the relationship between cross-platform accounts, that is, the more similar the user embedding features are, the more similar the two accounts are. The similarity of user embedding features is determined by the distance of the user features in the latent space. Based on the user embedding features, this article proposes an embedding representation-based method Con&Net(Content and Network) to solve cross-network anchor link discovery problem. Con&Net combines the user’s profile features, user-generated content (UGC) features, and user’s social structure features to measure the similarity of two user accounts. Con&Net first trains the user’s profile features to get profile embedding. Then it trains the network structure of the nodes to get structure embedding. It connects the two features through vector concatenating, and calculates the cosine similarity of the vector based on the embedding vector. This cosine similarity is used to measure the similarity of the user accounts. Finally, Con&Net predicts the link based on similarity for account pairs across the two networks. A large number of experiments in Sina Weibo and Twitter networks show that the proposed method Con&Net is better than state-of-the-art method. The area under the curve (AUC) value of the receiver operating characteristic (ROC) curve predicted by the anchor link is 11% higher than the baseline method, and Precision@30 is 25% higher than the baseline method.

Download Full-text

Comparison of two inference approaches in Gaussian graphical models

Turkish Journal of Biochemistry ◽

10.1515/tjb-2016-0298 ◽

2017 ◽

Vol 42 (2) ◽

Cited By ~ 1

Author(s):

Vilda Purutçuoğlu ◽

Ezgi Ayyıldız ◽

Ernst Wit

Keyword(s):

Graphical Models ◽

Gradient Descent ◽

Probabilistic Models ◽

Graphical Model ◽

Computational Cost ◽

Area Under The Curve ◽

Gaussian Graphical Model ◽

Gaussian Graphical Models ◽

Graphical Lasso ◽

Conditional Independency

AbstractIntroduction:The Gaussian Graphical Model (GGM) is one of the well-known probabilistic models which is based on the conditional independency of nodes in the biological system. Here, we compare the estimates of the GGM parameters by the graphical lasso (glasso) method and the threshold gradient descent (TGD) algorithm.Methods:We evaluate the performance of both techniques via certain measures such as specificity, F-measure and AUC (area under the curve). The analyses are conducted by Monte Carlo runs under different dimensional systems.Results:The results indicate that the TGD algorithm is more accurate than the glasso method in all selected criteria, whereas, it is more computationally demanding than this method too.Discussion and conclusion:Therefore, in high dimensional systems, we recommend glasso for its computational efficiency in spite of its loss in accuracy and we believe than the computational cost of the TGD algorithm can be improved by suggesting alternative steps in inference of the network.

Download Full-text

Discriminating Malignancy in Thyroid Nodules: The Nomogram Versus the Kwak and ACR TI-RADS

Otolaryngology ◽

10.1177/0194599820939071 ◽

2020 ◽

Vol 163 (6) ◽

pp. 1156-1165

Author(s):

Juan Xiao ◽

Qiang Xiao ◽

Wei Cong ◽

Ting Li ◽

Shouluan Ding ◽

...

Keyword(s):

Thyroid Nodules ◽

Characteristic Curve ◽

Area Under The Curve ◽

Diagnostic Study ◽

Diagnostic Efficiency ◽

Training Set ◽

Multivariable Logistic Regression Model ◽

Predictive Values ◽

Validation Set ◽

Sensitivity Specificity

Objective To develop an easy-to-use nomogram for discrimination of malignant thyroid nodules and to compare diagnostic efficiency with the Kwak and American College of Radiology (ACR) Thyroid Imaging, Reporting and Data System (TI-RADS). Study Design Retrospective diagnostic study. Setting The Second Hospital of Shandong University. Subjects and Methods From March 2017 to April 2019, 792 patients with 1940 thyroid nodules were included into the training set; from May 2019 to December 2019, 174 patients with 389 nodules were included into the validation set. Multivariable logistic regression model was used to develop a nomogram for discriminating malignant nodules. To compare the diagnostic performance of the nomogram with the Kwak and ACR TI-RADS, the area under the receiver operating characteristic curve, sensitivity, specificity, and positive and negative predictive values were calculated. Results The nomogram consisted of 7 factors: composition, orientation, echogenicity, border, margin, extrathyroidal extension, and calcification. In the training set, for all nodules, the area under the curve (AUC) for the nomogram was 0.844, which was higher than the Kwak TI-RADS (0.826, P = .008) and the ACR TI-RADS (0.810, P < .001). For the 822 nodules >1 cm, the AUC of the nomogram was 0.891, which was higher than the Kwak TI-RADS (0.852, P < .001) and the ACR TI-RADS (0.853, P < .001). In the validation set, the AUC of the nomogram was also higher than the Kwak and ACR TI-RADS ( P < .05), each in the whole series and separately for nodules >1 or ≤1 cm. Conclusions When compared with the Kwak and ACR TI-RADS, the nomogram had a better performance in discriminating malignant thyroid nodules.

Download Full-text

Retinal Vessel Segmentation by Deep Residual Learning with Wide Activation

Computational Intelligence and Neuroscience ◽

10.1155/2020/8822407 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Yuliang Ma ◽

Xue Li ◽

Xiaopeng Duan ◽

Yun Peng ◽

Yingchun Zhang

Keyword(s):

Blood Vessel ◽

Blood Vessels ◽

Area Under The Curve ◽

Retinal Vessel ◽

Vessel Segmentation ◽

Superior Performance ◽

Retinal Blood Vessel ◽

Low Contrast ◽

Blood Vessel Segmentation ◽

Small Vessels

Purpose. Retinal blood vessel image segmentation is an important step in ophthalmological analysis. However, it is difficult to segment small vessels accurately because of low contrast and complex feature information of blood vessels. The objective of this study is to develop an improved retinal blood vessel segmentation structure (WA-Net) to overcome these challenges. Methods. This paper mainly focuses on the width of deep learning. The channels of the ResNet block were broadened to propagate more low-level features, and the identity mapping pathway was slimmed to maintain parameter complexity. A residual atrous spatial pyramid module was used to capture the retinal vessels at various scales. We applied weight normalization to eliminate the impacts of the mini-batch and improve segmentation accuracy. The experiments were performed on the DRIVE and STARE datasets. To show the generalizability of WA-Net, we performed cross-training between datasets. Results. The global accuracy and specificity within datasets were 95.66% and 96.45% and 98.13% and 98.71%, respectively. The accuracy and area under the curve of the interdataset diverged only by 1%∼2% compared with the performance of the corresponding intradataset. Conclusion. All the results show that WA-Net extracts more detailed blood vessels and shows superior performance on retinal blood vessel segmentation tasks.

Download Full-text

TransET: Knowledge Graph Embedding with Entity Types

Electronics ◽

10.3390/electronics10121407 ◽

2021 ◽

Vol 10 (12) ◽

pp. 1407

Author(s):

Peng Wang ◽

Jing Zhou ◽

Yuzhang Liu ◽

Xingchen Zhou

Keyword(s):

Link Prediction ◽

State Of The Art ◽

Score Function ◽

Graph Embedding ◽

Vector Spaces ◽

Knowledge Graph ◽

Semantic Features ◽

Knowledge Graphs ◽

Real World Datasets ◽

Low Dimensional

Knowledge graph embedding aims to embed entities and relations into low-dimensional vector spaces. Most existing methods only focus on triple facts in knowledge graphs. In addition, models based on translation or distance measurement cannot fully represent complex relations. As well-constructed prior knowledge, entity types can be employed to learn the representations of entities and relations. In this paper, we propose a novel knowledge graph embedding model named TransET, which takes advantage of entity types to learn more semantic features. More specifically, circle convolution based on the embeddings of entity and entity types is utilized to map head entity and tail entity to type-specific representations, then translation-based score function is used to learn the presentation triples. We evaluated our model on real-world datasets with two benchmark tasks of link prediction and triple classification. Experimental results demonstrate that it outperforms state-of-the-art models in most cases.

Download Full-text

A modified expectation‐maximization algorithm for latent Gaussian graphical model

Canadian Journal of Statistics ◽

10.1002/cjs.11643 ◽

2021 ◽

Author(s):

Chaowen Zheng ◽

Jingfang Huang ◽

Ian A. Wood ◽

Yichao Wu

Keyword(s):

Expectation Maximization ◽

Graphical Model ◽

Expectation Maximization Algorithm ◽

Gaussian Graphical Model

Download Full-text

Unsupervised Anomaly Detection with Distillated Teacher-Student Network Ensemble

Entropy ◽

10.3390/e23020201 ◽

2021 ◽

Vol 23 (2) ◽

pp. 201

Author(s):

Qinfeng Xiao ◽

Jing Wang ◽

Youfang Lin ◽

Wenbo Gongsa ◽

Ganghui Hu ◽

...

Keyword(s):

Anomaly Detection ◽

Multivariate Data ◽

Failure Detection ◽

Superior Performance ◽

Detection Algorithms ◽

Teacher Student ◽

Model Complex ◽

Unsupervised Anomaly Detection ◽

Real World Datasets ◽

Complex Features

We address the problem of unsupervised anomaly detection for multivariate data. Traditional machine learning based anomaly detection algorithms rely on specific assumptions of normal patterns and fail to model complex feature interactions and relations. Recently, existing deep learning based methods are promising for extracting representations from complex features. These methods train an auxiliary task, e.g., reconstruction and prediction, on normal samples. They further assume that anomalies fail to perform well on the auxiliary task since they are never trained during the model optimization. However, the assumption does not always hold in practice. Deep models may also perform the auxiliary task well on anomalous samples, leading to the failure detection of anomalies. To effectively detect anomalies for multivariate data, this paper introduces a teacher-student distillation based framework Distillated Teacher-Student Network Ensemble (DTSNE). The paradigm of the teacher-student distillation is able to deal with high-dimensional complex features. In addition, an ensemble of student networks provides a better capability to avoid generalizing the auxiliary task performance on anomalous samples. To validate the effectiveness of our model, we conduct extensive experiments on real-world datasets. Experimental results show superior performance of DTSNE over competing methods. Analysis and discussion towards the behavior of our model are also provided in the experiment section.

Download Full-text

External Validation and Test-Retest Reliability of Postpartum Bonding Questionnaire in Spanish Mothers

The Spanish Journal of Psychology ◽

10.1017/sjp.2021.44 ◽

2021 ◽

Vol 24 ◽

Author(s):

Anna Torres-Giménez ◽

Alba Roca-Lecumberri ◽

Bàrbara Sureda ◽

Susana Andrés-Perpiña ◽

Bruma Palacios-Hernández ◽

...

Keyword(s):

Mental Health ◽

Maternal Mental Health ◽

Characteristic Curve ◽

External Validation ◽

Area Under The Curve ◽

Retest Reliability ◽

Outpatient Unit ◽

Auc Value ◽

Postpartum Bonding ◽

Test Retest Reliability

Abstract The aim of the present study was to validate the Spanish Postpartum Bonding Questionnaire (PBQ) against external criteria of bonding disorder, as well as to establish its test-retest reliability. One hundred fifty-six postpartum women consecutively recruited from a perinatal mental health outpatient unit completed the PBQ at 4–6 weeks postpartum. Four weeks later, all mothers completed again the PBQ and were interviewed using the Birmingham Interview for Maternal Mental Health to establish the presence of a bonding disorder. Receiver operating characteristic curve analysis revealed an area under the curve (AUC) value for the PBQ total score of 0.93, 95% CI [0.88, 0.98], with the optimal cut-off of 13 for detecting bonding disorders (sensitivity: 92%, specificity: 87%). Optimal cut-off scores for each scale were also obtained. The test-retest reliability coefficients were moderate to good. Our data confirm the validity of PBQ for detecting bonding disorders in Spanish population.

Download Full-text

Link Prediction: A Graphical Model Approach

2020 European Control Conference (ECC) ◽

10.23919/ecc51009.2020.9143706 ◽

2020 ◽

Author(s):

Daniele Alpago ◽

Mattia Zorzi ◽

Augusto Ferrante

Keyword(s):

Link Prediction ◽

Graphical Model ◽

Model Approach

Download Full-text