Using Aggregated Relational Data to Feasibly Identify Network Structure without Network Data

The American Economic Review ◽

10.1257/aer.20170861 ◽

2020 ◽

Vol 110 (8) ◽

pp. 2454-2484 ◽

Cited By ~ 3

Author(s):

Emily Breza ◽

Arun G. Chandrasekhar ◽

Tyler H. McCormick ◽

Mengjie Pan

Keyword(s):

Social Network ◽

Network Structure ◽

Field Experiments ◽

Network Formation ◽

Relational Data ◽

Network Data ◽

Formation Model ◽

Social Network Data ◽

Level Statistics ◽

Aggregated Relational Data

Social network data are often prohibitively expensive to collect, limiting empirical network research. We propose an inexpensive and feasible strategy for network elicitation using Aggregated Relational Data (ARD): responses to questions of the form “how many of your links have trait k ?” Our method uses ARD to recover parameters of a network formation model, which permits sampling from a distribution over node- or graph-level statistics. We replicate the results of two field experiments that used network data and draw similar conclusions with ARD alone. (JEL C81, C93, D85, Z13)

Download Full-text

GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks

Bioinformatics ◽

10.1093/bioinformatics/btaa459 ◽

2020 ◽

Vol 36 (Supplement_1) ◽

pp. i464-i473

Author(s):

Kapil Devkota ◽

James M Murphy ◽

Lenore J Cowen

Keyword(s):

Network Structure ◽

Biological Networks ◽

Link Prediction ◽

Prediction Method ◽

Global Network ◽

Local Network ◽

Supplementary Information ◽

Network Data ◽

Ppi Network ◽

Diffusion State

Abstract Motivation One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally observed. Methods to predict missing links have been more extensively studied for social than for biological networks; it was recently argued that there is some special structure in protein–protein interaction (PPI) network data that might mean that alternate methods may outperform the best methods for social networks. Based on a generalization of the diffusion state distance, we design a new embedding-based link prediction method called global and local integrated diffusion embedding (GLIDE). GLIDE is designed to effectively capture global network structure, combined with alternative network type-specific customized measures that capture local network structure. We test GLIDE on a collection of three recently curated human biological networks derived from the 2016 DREAM disease module identification challenge as well as a classical version of the yeast PPI network in rigorous cross validation experiments. Results We indeed find that different local network structure is dominant in different types of biological networks. We find that the simple local network measures are dominant in the highly connected network core between hub genes, but that GLIDE’s global embedding measure adds value in the rest of the network. For example, we make GLIDE-based link predictions from genes known to be involved in Crohn’s disease, to genes that are not known to have an association, and make some new predictions, finding support in other network data and the literature. Availability and implementation GLIDE can be downloaded at https://bitbucket.org/kap_devkota/glide. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

A Delay-Aware Network Structure for Wireless Sensor Networks With In-Network Data Fusion

IEEE Sensors Journal ◽

10.1109/jsen.2013.2240617 ◽

2013 ◽

Vol 13 (5) ◽

pp. 1622-1631 ◽

Cited By ~ 52

Author(s):

Chi-Tsun Cheng ◽

Henry Leung ◽

Patrick Maupin

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Data Fusion ◽

Network Structure ◽

Wireless Sensor ◽

Network Data

Download Full-text

Network statistics and measurement error

10.1093/oso/9780198805090.003.0009 ◽

2018 ◽

Author(s):

Mark Newman

Keyword(s):

Measurement Error ◽

Em Algorithm ◽

Error Correction ◽

Network Structure ◽

Expectation Maximization ◽

Link Prediction ◽

Network Data ◽

Types Of Error

This chapter introduces the mathematics of network statistics, the quantification of errors in network data, and the estimation of network structure in the presence of error. The discussion starts with a summary of the types of error that can occur in network data and the empirical sources of those errors. The remainder of the chapter is given over to a discussion of the theory of network statistics, beginning with a review of the theory for ordinary real-valued (non-network) data, then developing the expectation-maximization (EM) algorithm for estimating network structure and error levels in the presence of error, with example applications. The chapter ends with a discussion of error correction methods such as link prediction and node disambiguation.

Download Full-text

Network structure from relational data: Measurement and inference in four operational models

Social Networks ◽

10.1016/0378-8733(89)90008-7 ◽

1989 ◽

Vol 11 (2) ◽

pp. 89-134 ◽

Cited By ~ 8

Author(s):

Raymond Trevor Bradley ◽

Nancy C. Roberts

Keyword(s):

Network Structure ◽

Relational Data ◽

Data Measurement ◽

Operational Models

Download Full-text

A Large-Scale Comparative Study of Informal Social Networks in Firms

Management Science ◽

10.1287/mnsc.2021.3997 ◽

2021 ◽

Author(s):

Abigail Z. Jacobs ◽

Duncan J. Watts

Keyword(s):

Network Structure ◽

Large Scale ◽

Network Size ◽

Emergent Properties ◽

Network Data ◽

Data Set ◽

Network Characteristics ◽

Individual Level ◽

Industrial Sectors ◽

Wide Range

Theories of organizations are sympathetic to long-standing ideas from network science that organizational networks should be regarded as multiscale and capable of displaying emergent properties. However, the historical difficulty of collecting individual-level network data for many (N ≫ 1) organizations, each of which comprises many (n ≫ 1) individuals, has hobbled efforts to develop specific, theoretically motivated hypotheses connecting micro- (i.e., individual-level) network structure with macro-organizational properties. In this paper we seek to stimulate such efforts with an exploratory analysis of a unique data set of aggregated, anonymized email data from an enterprise email system that includes 1.8 billion messages sent by 1.4 million users from 65 publicly traded U.S. firms spanning a wide range of sizes and 7 industrial sectors. We uncover wide heterogeneity among firms with respect to all measured network characteristics, and we find robust network and organizational variation as a result of size. Interestingly, we find no clear associations between organizational network structure and firm age, industry, or performance; however, we do find that centralization increases with geographical dispersion—a result that is not explained by network size. Although preliminary, these results raise new questions for organizational theory as well as new issues for collecting, processing, and interpreting digital network data. This paper was accepted by David Simchi-Levi, Special Issue of Management Science: 65th Anniversary.

Download Full-text

Leveraging Network Structure to Infer Missing Values in Relational Data

10.2172/1082426 ◽

2007 ◽

Cited By ~ 1

Author(s):

B Gallagher ◽

T Eliassi-Rad

Keyword(s):

Network Structure ◽

Missing Values ◽

Relational Data

Download Full-text

Inference and influence of network structure using snapshot social behavior without network data

Science Advances ◽

10.1126/sciadv.abb8762 ◽

2021 ◽

Vol 7 (23) ◽

pp. eabb8762

Author(s):

Antonia Godoy-Lorite ◽

Nick S. Jones

Keyword(s):

European Union ◽

Network Structure ◽

Social Preferences ◽

Behavioral Outcomes ◽

Population Level ◽

Behavioral Data ◽

Network Data ◽

The European Union ◽

Mayoral Elections ◽

Time Point

Population behavior, like voting and vaccination, depends on the structure of social networks. This structure can differ depending on behavior type and is typically hidden. However, we do often have behavioral data, albeit only snapshots taken at one time point. We present a method jointly inferring a model for both network structure and human behavior using only snapshot population-level behavioral data. This exploits the simplicity of a few parameter model, geometric sociodemographic network model, and a spin-based model of behavior. We illustrate, for the European Union referendum and two London mayoral elections, how the model offers both prediction and the interpretation of the homophilic inclinations of the population. Beyond extracting behavior-specific network structure from behavioral datasets, our approach yields a framework linking inequalities and social preferences to behavioral outcomes. We illustrate potential network-sensitive policies: How changes to income inequality, social temperature, and homophilic preferences might have reduced polarization in a recent election.

Download Full-text

The network data envelopment analysis models for non-homogenous decision making units based on the sun network structure

Central European Journal of Operations Research ◽

10.1007/s10100-018-0560-9 ◽

2018 ◽

Vol 27 (4) ◽

pp. 1221-1244 ◽

Cited By ~ 3

Author(s):

Qingyou Yan ◽

Fei Zhao ◽

Xu Wang ◽

Guoliang Yang ◽

Tomas Baležentis ◽

...

Keyword(s):

Decision Making ◽

Data Envelopment Analysis ◽

Network Structure ◽

Network Data ◽

Data Envelopment ◽

The Sun ◽

Network Data Envelopment Analysis ◽

Analysis Models ◽

Decision Making Units

Download Full-text

A SURVEY ON PRIVACY PRESERVING TECHNIQUES FOR SOCIAL NETWORK DATA

Asian Journal of Pharmaceutical and Clinical Research ◽

10.22159/ajpcr.2017.v10s1.19587 ◽

2017 ◽

Vol 10 (13) ◽

pp. 112

Author(s):

Sharath Kumar J ◽

Maheswari N

Keyword(s):

Social Network ◽

Online Social Network ◽

Information Leakage ◽

Personal Data ◽

Relational Data ◽

Network Data ◽

Tabular Data ◽

New Techniques ◽

Social Network Data ◽

Social Graphs

In this era of 20th century, online social network like Facebook, twitter, etc. plays a very important role in everyone’s life. Social network data, regarding any individual organization can be published online at any time, in which there is a risk of information leakage of anyone’s personal data. So preserving the privacy of individual organizations and companies are needed before data is published online. Therefore the research was carried out in this area for many years and it is still going on. There have been various existing techniques that provide the solutions for preserving privacy to tabular data called as relational data and also social network data represented in graphs. Different techniques exists for tabular data but you can’t apply directly to the structured complex graph data,which consists of vertices represented as individuals and edges representing some kind of connection or relationship between the nodes. Various techniques like K-anonymity, L-diversity, and T-closeness exist to provide privacy to nodes and techniques like edge perturbation, edge randomization are there to provide privacy to edges in social graphs. Development of new techniques by Integration to exiting techniques like K-anonymity ,edge perturbation, edge randomization, L-Diversity are still going on to provide more privacy to relational data and social network data are ongoingin the best possible manner.

Download Full-text