ArticlePDF Available

Abstract and Figures

The identification of causal peer effects (also known as social contagion or induction) from observational data in social networks is challenged by two distinct sources of bias: latent homophily and unobserved confounding. In this paper, we investigate how causal peer effects of traits and behaviors can be identified using genes (or other structurally isomorphic variables) as instrumental variables (IV) in a large set of data generating models with homophily and confounding. We use directed acyclic graphs to represent these models and employ multiple IV strategies and report three main identification results. First, using a single fixed gene (or allele) as an IV will generally fail to identify peer effects if the gene affects past values of the treatment. Second, multiple fixed genes/alleles, or, more promisingly, time-varying gene expression, can identify peer effects if we instrument exclusion violations as well as the focal treatment. Third, we show that IV identification of peer effects remains possible even under multiple complications often regarded as lethal for IV identification of intra-individual effects, such as pleiotropy on observables and unobservables, homophily on past phenotype, past and ongoing homophily on genotype, inter-phenotype peer effects, population stratification, gene expression that is endogenous to past phenotype and past gene expression, and others. We apply our identification results to estimating peer effects of body mass index (BMI) among friends and spouses in the Framingham Heart Study. Results suggest a positive causal peer effect of BMI between friends.
Content may be subject to copyright.
A preview of the PDF is not available
... We found the effect of an omitted confounder would need to have a combined relationship with the outcome (employees' purchases at transaction t) and predictor (coworkers' purchase with the employee at t − 1) larger than any measured for the observed covariates in our models before our statistically significant measures were rendered non-significant (Supplementary Table 10). Finally, we conducted an instrumental variables analysis using purchases by a second-degree coworker (the coworker's coworker) as a putatively exogenous source of variation in the coworker's purchasing, thus eliminating concerns of homophily to the extent that it manifests as unmeasured confounding ( Supplementary Fig. 9) 47 . For these analyses, we began with the same subset of data used above in Fig. 3 involving the strictest definition of a social tie (tie probability ≥0.6), but to justify the instrumental variable exclusion restriction, we excluded any second-degree alters of a given ego that were also first-degree alters of that same ego during the study period, as well second-degree alters that were egos in the analysis themselves. ...
... The exception to that caveat comes from our instrumental variables analysis that, if its assumptions hold, would remove all bias due to homophily, confounding or simultaneity. We reiterate our caution in accepting that our available instrument satisfies the exclusion restriction in the presence of homophily, confounding or simultaneity and the independence assumption required for valid instrumental variables estimation 47 . Future work could consider using different instrumental variables (including the special case of randomized assignment to peers/networks) in a completely identified, closed network, although these are challenging methods to execute in real-world scenarios. ...
... The data setup for each stage (that is, the timing of purchases included as dependent and independent variables) was identical to that used in the GEE analyses. In the first stage regression, purchases by second-degree alters (the primary alters' alters, Supplementary Fig. 9) were used as an instrument for primary alters' purchases 47 . To strengthen the plausibility of the exclusion restriction, only purchases by secondary alters who did not appear elsewhere in the data as either first-degree alters or egos were included as instruments. ...
Article
Full-text available
Unhealthy food choice is an important driver of obesity, but research examining the relationship of food choices and social influence has been limited. We sought to assess associations in the healthfulness of workplace food choices among a large population of diverse employees whose food-related social connections were identified using passively collected data in a validated model. Data were drawn from 3 million encounters where pairs of employees made purchases together in 2015–2016. The healthfulness of food items was defined by ‘traffic light’ labels. Cross-sectional simultaneously autoregressive models revealed that proportions of both healthy and unhealthy items purchased were positively associated between connected employees. Longitudinal generalized estimating equation models also found positive associations between an employee’s current food purchase and the most recent previous food purchase a coworker made together with the employee. These data indicate that workplace interventions to promote healthy eating and reduce obesity should test peer-based strategies.
... In sociology and social sciences, the purpose of causal inference is to examine the association between social network and behaviors, also known as peer effects, social contagion or induction [173]- [175]. The peer effect means that the behavior, traits, or characteristics of an individual's peers (those he is connected to or alters) would affect his behavior [175]. ...
... In sociology and social sciences, the purpose of causal inference is to examine the association between social network and behaviors, also known as peer effects, social contagion or induction [173]- [175]. The peer effect means that the behavior, traits, or characteristics of an individual's peers (those he is connected to or alters) would affect his behavior [175]. Due to contextual confounding, peer selection, simultaneity bias and measurement error, [176] points that it is very difficult to estimate the peer effects from observational data but instrumental variables (IVs) can help to address these problems. ...
Preprint
Full-text available
Causal inference is the process of using assumptions, study designs, and estimation strategies to draw conclusions about the causal relationships between variables based on data. This allows researchers to better understand the underlying mechanisms at work in complex systems and make more informed decisions. In many settings, we may not fully observe all the confounders that affect both the treatment and outcome variables, complicating the estimation of causal effects. To address this problem, a growing literature in both causal inference and machine learning proposes to use Instrumental Variables (IV). This paper serves as the first effort to systematically and comprehensively introduce and discuss the IV methods and their applications in both causal inference and machine learning. First, we provide the formal definition of IVs and discuss the identification problem of IV regression methods under different assumptions. Second, we categorize the existing work on IV methods into three streams according to the focus on the proposed methods, including two-stage least squares with IVs, control function with IVs, and evaluation of IVs. For each stream, we present both the classical causal inference methods, and recent developments in the machine learning literature. Then, we introduce a variety of applications of IV methods in real-world scenarios and provide a summary of the available datasets and algorithms. Finally, we summarize the literature, discuss the open problems and suggest promising future research directions for IV methods and their applications. We also develop a toolkit of IVs methods reviewed in this survey at https://github.com/causal-machine-learning-lab/mliv.
... Past studies on inferring network effects have attempted to develop a better understanding of the role of homophily in network formation, and to test hypotheses about the relative prevalence of homophily and peer influence in social behavior using techniques such as instrumental variables [89], propensity score matching [90], SAOMs [17], [65]- [69], ERGMs [16], and latent space models [91], [92]. In this section, we will illustrate how our simulation strategy builds upon and extends [17], [93] to obtain estimates of homophily and influence based on network and behavioral characteristics across different sampling conditions. ...
Preprint
Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) and diffusion contagion models with a novel network sampling method based on the identification of independent sets. By assigning respondents from an independent set to the treatment, we are able to block any direct spillover of the treatment, thereby allowing us to isolate the direct effect of the treatment from the indirect network-induced effects. As a result, our method allows for the estimation of both the direct as well as the net effect of a chosen policy intervention, in the presence of network effects in the population. We perform a comparative simulation analysis to show that the choice of sampling technique leads to significantly distinct estimates for both direct and net effects of the policy, as well as for the relevant network effects, such as homophily. Furthermore, using a modified diffusion contagion model, we show that our proposed sampling technique leads to greater and faster spread of the policy-linked behavior through the network. This study highlights the importance of network sampling techniques in improving policy evaluation studies and has the potential to help researchers and policymakers with better planning, designing, and anticipating policy responses in a networked society.
... It is plausible that there may be cross-trait social genetic effects given large phenotypic and genotypic correlations between BMI, drinking, and smoking (Liu et al. 2019) and the possibility that genes have 'pleiotropic effects' (having effects on multiple phenotypes; Lee et al. 2012;Visscher et al. 2017). A suggestion to study social influence effects using genes as instrumental variables forms another approach that may give insight here, but suffers from stricter assumptions (O'Malley et al. 2014). ...
Article
Full-text available
Partners resemble each other in health behaviors and outcomes such as alcohol use, smoking, physical activity, and obesity. While this is consistent with social contagion theory suggesting partner influence, it is notoriously difficult to establish causality because of assortative mating and contextual confounding. We offer a novel approach to studying social contagion in health in long-term partnerships by combining genetic data of both partners in married/cohabiting couples with longitudinal data on their health behaviors and outcomes. We examine the influence of the partner’s genetic predisposition for three health outcomes and behaviors (BMI, smoking, and drinking) among married/cohabiting couples. We use longitudinal data from the Health and Retirement Study and the English Longitudinal Study of Ageing with data on health outcomes and genotypes for both partners. Results show that changes over time in BMI, smoking, and drinking depend on the partner’s genetic predispositions to these traits. These findings underline the importance of people’s social surroundings for their health and highlight the potential of targeting health interventions at couples. Supplementary Information The online version contains supplementary material available at 10.1007/s10519-023-10147-w.
... Addressing collider bias in the causal modelling of social network data and learning-based hypotheses presents additional challenges (Lyons, 2011;Shalizi & Thomas, 2011). While some traditional approaches for addressing bias may assist with collider bias [i.e., counterfactual (Elwert & Christakis, 2008) or instrumental variable approaches (O'Malley et al., 2014)], several statistical models unique to network analysis have also been introduced as possible solutions. Shalizi and Thomas (2011) propose a model of tie formation and dissolution, aiming to identify instances where behaviors spread in response to tie formation and cease in response to tie dissolution, eliminating instances where an outcome is maintained by similar characteristics rather than social contagion (Krivitsky & Handcock, 2014). ...
Article
Full-text available
Objectives We provide a brief overview of collider bias and its implications for criminological research. Methods Owing to the nature of the topics studied, as well as the common data sources used to carry out much of this research, work in the field may often become vulnerable to a specific methodological problem known as collider bias. Collider bias occurs when exposure variables and outcomes independently cause a third variable, and this variable is included in statistical models. Colliders represent somewhat of a paradox in that there is scholarship discussing the issue, yet it has managed to remain a relatively cryptic threat compared to other sources of bias. Results We argue that, far from being an obscure concern, colliders almost certainly have pervasive impact in criminal justice and criminology. Conclusion We close by offering a general set of strategies for addressing the challenges posed by collider bias. While there is no panacea, there are better practices, many of which are underutilized in the disciplines that study crime and it's attendant topics.
... Individual outcomes can also be influenced by the macro features of a network, such as cohesion, hierarchy, clustering, and composition. Cohesion describes how densely connected a network is The instrumental variables approach is discussed by Bramoullé et al. (2009), O'Malley et al. (2014, and An (2015a). Randomized peer treatment is discussed by An ( , 2015b. ...
Article
Fueled by recent advances in statistical modeling and the rapid growth of network data, social network analysis has become increasingly popular in sociology and related disciplines. However, a significant amount of work in the field has been descriptive and correlational, which prevents the findings from being more rigorously translated into practices and policies. This article provides a review of the popular models and methods for causal network analysis, with a focus on causal inference threats (such as measurement error, missing data, network endogeneity, contextual confounding, simultaneity, and collinearity) and potential solutions (such as instrumental variables, specialized experiments, and leveraging longitudinal data). It covers major models and methods for both network formation and network effects and for both sociocentric networks and egocentric networks. Lastly, this review also discusses future directions for causal network analysis. Expected final online publication date for the Annual Review of Sociology, Volume 48 is July 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Article
Many studies have examined the diffusion of health care innovation but less is known about the diffusion of health care fraud. In this paper, we consider the diffusion of potentially fraudulent Medicare home health care billing in the United States during 2002-16, with a focus on the 21 hospital referral regions (HRRs) covered by local Department of Justice (DOJ) anti-fraud "strike force" offices. We hypothesize that patient-sharing across home health care agencies (HHAs) provides a mechanism for the rapid diffusion of fraudulent strategies. We measure such activity using a novel bipartite mixture (or BMIX) network index, which captures patient sharing across multiple agencies and thus conveys more information about the diffusion process than conventional unipartite network measures. Using a complete population of fee-for-service Medicare claims data, we first find a remarkable increase in home health care activity between 2002 and 2009 in many regions targeted by the DOJ; average billing per Medicare enrollee in McAllen TX and Miami increased by $2127 and $2422 compared to just an average $289 increase in other HRRs not targeted by the DOJ. Second, we establish that the HRR-level BMIX (but not other network measures) was a strong predictor of above-average home health care expenditures across HRRs. Third, within HRRs, agencies sharing more patients with other agencies were predicted to increase billing. Finally, the initial 2002 BMIX index was a strong predictor of subsequent changes in HRR-level home health billing during 2002-9. These results highlight the importance of bipartite network structure in diffusion and in infection and contagion models more generally.
Article
In this article, I present a new multivariate regression model for analyzing outcomes with network dependence. The model is capable to account for two types of outcome dependence including the mean dependence that allows the outcome to depend on selected features of a known dependence network and the error dependence that allows the outcome to be additionally correlated based on patterned connections in the dependence network (e.g., according to whether the ties are asymmetric, mutual, or triadic). For example, when predicting a group of students’ smoking status, the outcome can depend on the students’ positions in their friendship network and also be correlated among friends. I show that analyses ignoring the mean dependence can lead to severe bias in the estimated coefficients while analyses ignoring the error dependence can lead to inefficient inferences and failures in recognizing unmeasured social processes. I compare the new model with related models such as multilevel models, spatial regression models, and exponential random graph models and show their connections and differences. I propose a two-step, feasible generalized least squares estimator to estimate the model that is computationally fast and robust. Simulations show the validity of the new model (and the estimator) while four empirical examples demonstrate its versatility. Associated R package “fglsnet” is available for public use.
Article
A prominent threat to causal inference about peer effects in social science studies is the presence of homophily bias, that is, social influence between friends and families is entangled with common characteristics or underlying similarities that form close connections. Analysis of social study data has suggested that certain health conditions such as obesity and psychological states including happiness and loneliness can spread between friends and relatives. However, such analyses of peer effects or contagion effects have come under criticism because homophily bias may compromise the causal statement. We develop a regression‐based approach which leverages a negative control exposure for identification and estimation of contagion effects on additive or multiplicative scales, in the presence of homophily bias. We apply our methods to evaluate the peer effect of obesity in Framingham Offspring Study. This article is protected by copyright. All rights reserved
Chapter
Full-text available
This chapter discusses the use of directed acyclic graphs (DAGs) for causal inference in the observational social sciences. It focuses on DAGs’ main uses, discusses central principles, and gives applied examples. DAGs are visual representations of qualitative causal assumptions: They encode researchers’ beliefs about how the world works. Straightforward rules map these causal assumptions onto the associations and independencies in observable data. The two primary uses of DAGs are (1) determining the identifiability of causal effects from observed data and (2) deriving the testable implications of a causal model. Concepts covered in this chapter include identification, d-separation, confounding, endogenous selection, and overcontrol. Illustrative applications then demonstrate that conditioning on variables at any stage in a causal process can induce as well as remove bias, that confounding is a fundamentally causal rather than an associational concept, that conventional approaches to causal mediation analysis are often biased, and that causal inference in social networks inherently faces endogenous selection bias. The chapter discusses several graphical criteria for the identification of causal effects of single, time-point treatments (including the famous backdoor criterion), as well identification criteria for multiple, time-varying treatments.
Article
We outline a framework for causal inference in settings where assignment to a binary treatment is ignorable, but compliance with the assignment is not perfect so that the receipt of treatment is nonignorable. To address the problems associated with comparing subjects by the ignorable assignment - an "intention-to-treat analysis" - we make use of instrumental variables, which have long been used by economists in the context of regression models with constant treatment effects. We show that the instrumental variables (IV) estimand can be embedded within the Rubin Causal Model (RCM) and that under some simple and easily interpretable assumptions, the IV estimand is the average causal effect for a subgroup of units, the compliers. Without these assumptions, the IV estimand is simply the ratio of intention-to-treat causal estimands with no interpretation as an average causal effect. The advantages of embedding the IV approach in the RCM are that it clarifies the nature of critical assumptions needed for a causal interpretation, and moreover allows us to consider sensitivity of the results to deviations from key assumptions in a straightforward manner. We apply our analysis to estimate the effect of veteran status in the Vietnam era on mortality, using the lottery number that assigned priority for the draft as an instrument, and we use our results to investigate the sensitivity of the conclusions to critical assumptions.
Article
Background: Alcohol consumption has important health-related consequences and numerous biological and social determinants. Objective: To explore quantitatively whether alcohol consumption behavior spreads from person to person in a large social network of friends, coworkers, siblings, spouses, and neighbors, followed for 32 years. Design: Longitudinal network cohort study. Setting: The Framingham Heart Study. Participants: 12 067 persons assessed at several time points between 1971 and 2003. Measurements: Self-reported alcohol consumption (number of drinks per week on average over the past year and number of days drinking within the past week) and social network ties, measured at each time point. Results: Clusters of drinkers and abstainers were present in the network at all time points, and the clusters extended to 3 degrees of separation. These clusters were not only due to selective formation of social ties among drinkers but also seem to reflect interpersonal influence. Changes in the alcohol consumption behavior of a person's social network had a statistically significant effect on that person's subsequent alcohol consumption behavior. The behaviors of immediate neighbors and coworkers were not significantly associated with a person's drinking behavior, but the behavior of relatives and friends was. Limitations: A nonclinical measure of alcohol consumption was used. Also, it is unclear whether the effects on long-term health are positive or negative, because alcohol has been shown to be both harmful and protective. Finally, not all network ties were observed. Conclusion: Network phenomena seem to influence alcohol consumption behavior. This has implications for clinical and public health interventions and further supports group-level interventions to reduce problematic drinking.
Article
The prevalence of obesity has increased substantially over the past 30 years. We performed a quantitative analysis of the nature and extent of the person-to-person spread of obesity as a possible factor contributing to the obesity epidemic. We evaluated a densely interconnected social network of 12,067 people assessed repeatedly from 1971 to 2003 as part of the Framingham Heart Study. The body-mass index was available for all subjects. We used longitudinal statistical models to examine whether weight gain in one person was associated with weight gain in his or her friends, siblings, spouse, and neighbors. Discernible clusters of obese persons (body-mass index [the weight in kilograms divided by the square of the height in meters], > or =30) were present in the network at all time points, and the clusters extended to three degrees of separation. These clusters did not appear to be solely attributable to the selective formation of social ties among obese persons. A person's chances of becoming obese increased by 57% (95% confidence interval [CI], 6 to 123) if he or she had a friend who became obese in a given interval. Among pairs of adult siblings, if one sibling became obese, the chance that the other would become obese increased by 40% (95% CI, 21 to 60). If one spouse became obese, the likelihood that the other spouse would become obese increased by 37% (95% CI, 7 to 73). These effects were not seen among neighbors in the immediate geographic location. Persons of the same sex had relatively greater influence on each other than those of the opposite sex. The spread of smoking cessation did not account for the spread of obesity in the network. Network phenomena appear to be relevant to the biologic and behavioral trait of obesity, and obesity appears to spread through social ties. These findings have implications for clinical and public health interventions.
Article
Lyons (2011) offered several critiques of the social network analyses of Christakis and Fowler, including issues of confounding, model inconsistency, and statistical dependence in networks. Here we show that in some settings, social network analyses of the type employed by Christakis and Fowler will still yield valid tests of the null of no social contagion, even though estimates and confidence intervals may not be valid. In particular, we show that if the alter's state is lagged by an additional period, then under the null of no contagion, the problems of model inconsistency and statistical dependence effectively disappear which allow for testing for contagion. Our results clarify the setting in which even "flawed" social network analyses are still useful for assessing social contagion and social influence.
Article
Written by one of the preeminent researchers in the field, this book provides a comprehensive exposition of modern analysis of causation. It shows how causality has grown from a nebulous concept into a mathematical theory with significant applications in the fields of statistics, artificial intelligence, economics, philosophy, cognitive science, and the health and social sciences. Judea Pearl presents and unifies the probabilistic, manipulative, counterfactual, and structural approaches to causation and devises simple mathematical tools for studying the relationships between causal connections and statistical associations. The book will open the way for including causal analysis in the standard curricula of statistics, artificial intelligence, business, epidemiology, social sciences, and economics. Students in these fields will find natural models, simple inferential procedures, and precise mathematical definitions of causal concepts that traditional texts have evaded or made unduly complicated. The first edition of Causality has led to a paradigmatic change in the way that causality is treated in statistics, philosophy, computer science, social science, and economics. Cited in more than 5,000 scientific publications, it continues to liberate scientists from the traditional molds of statistical thinking. In this revised edition, Judea Pearl elucidates thorny issues, answers readers’ questions, and offers a panoramic view of recent advances in this field of research. Causality will be of interests to students and professionals in a wide variety of fields. Anyone who wishes to elucidate meaningful relationships from data, predict effects of actions and policies, assess explanations of reported events, or form theories of causal understanding and causal speech will find this book stimulating and invaluable.
Article
SUMMARY The primary aim of this paper is to show how graphical models can be used as a mathematical language for integrating statistical and subject-matter information. In par- ticular, the paper develops a principled, nonparametric framework for causal inference, in which diagrams are queried to determine if the assumptions available are sufficient for identifying causal effects from nonexperimental data. If so the diagrams can be queried to produce mathematical expressions for causal effects in terms of observed distributions; otherwise, the diagrams can be queried to suggest additional observations or auxiliary experiments from which the desired inferences can be obtained.