Jiahui Xu’s research while affiliated with Pennsylvania State University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (3)


Flexibly Detecting Effect Heterogeneity with an Application to the Effects of College on Reducing Poverty
  • Preprint

September 2024

·

6 Reads

Jiahui Xu

·

Jennie E. Brand

·

Tanvi Shinkre

·

Units of analysis in social research do not respond uniformly to events and interventions. Yet it is not always clear which axes of heterogeneity are most important to consider before data analysis. We use causal forests to nonparametrically uncover heterogeneous treatment effects. We then adapt causal forests and advance causal mediation forests to assess heterogeneous direct and indirect effects. This novel adaptation explores heterogeneity in the causal paths linking a treatment to an outcome through a binary, multinomial, or continuous mediator. Bothcausal forests and causal mediation forests robustly adjust for high-dimensional confounders, yielding asymptotically normal and n1/2 consistent estimates. We show that forest-based approaches often outperform alternative methods in identifying effect heterogeneity. We apply the forest-based methods to study the heterogeneous effects of four-year college on reducing poverty with data from the National Longitudinal Survey of Youth 1979 and 1997 cohorts and find large gains for disadvantaged youth.


Uncovering Sociological Effect Heterogeneity Using Tree-Based Machine Learning

March 2021

·

47 Reads

·

34 Citations

Sociological Methodology

Individuals do not respond uniformly to treatments, such as events or interventions. Sociologists routinely partition samples into subgroups to explore how the effects of treatments vary by selected covariates, such as race and gender, on the basis of theoretical priors. Data-driven discoveries are also routine, yet the analyses by which sociologists typically go about them are often problematic and seldom move us beyond our biases to explore new meaningful subgroups. Emerging machine learning methods based on decision trees allow researchers to explore sources of variation that they may not have previously considered or envisaged. In this article, the authors use tree-based machine learning, that is, causal trees, to recursively partition the sample to uncover sources of effect heterogeneity. Assessing a central topic in social inequality, college effects on wages, the authors compare what is learned from covariate and propensity score–based partitioning approaches with recursive partitioning based on causal trees. Decision trees, although superseded by forests for estimation, can be used to uncover subpopulations responsive to treatments. Using observational data, the authors expand on the existing causal tree literature by applying leaf-specific effect estimation strategies to adjust for observed confounding, including inverse propensity weighting, nearest neighbor matching, and doubly robust causal forests. We also assess localized balance metrics and sensitivity analyses to address the possibility of differential imbalance and unobserved confounding. The authors encourage researchers to follow similar data exploration practices in their work on variation in sociological effects and offer a straightforward framework by which to do so.


Uncovering Sociological Effect Heterogeneity Using Machine Learning

September 2019

·

975 Reads

·

4 Citations

Individuals do not respond uniformly to treatments, events, or interventions. Sociologists routinely partition samples into subgroups to explore how the effects of treatments vary by covariates like race, gender, and socioeconomic status. In so doing, analysts determine the key subpopulations based on theoretical priors. Data-driven discoveries are also routine, yet the analyses by which sociologists typically go about them are problematic and seldom move us beyond our expectations, and biases, to explore new meaningful subgroups. Emerging machine learning methods allow researchers to explore sources of variation that they may not have previously considered, or envisaged. In this paper, we use causal trees to recursively partition the sample and uncover sources of treatment effect heterogeneity. We use honest estimation, splitting the sample into a training sample to grow the tree and an estimation sample to estimate leaf-specific effects. Assessing a central topic in the social inequality literature, college effects on wages, we compare what we learn from conventional approaches for exploring variation in effects to causal trees. Given our use of observational data, we use leaf-specific matching and sensitivity analyses to address confounding and offer interpretations of effects based on observed and unobserved heterogeneity. We encourage researchers to follow similar practices in their work on variation in sociological effects.

Citations (2)


... This investigation intentionally circumvents the multinomial logit model to prevent regression to a variable-centered approach and to avoid the "curse of dimensionality" that arises when integrating a wide array of interaction terms into the model. Instead, the research employs a conditional inference tree model-a tree-based methodology recognized as a supervised learning algorithm conducive to causal inference (Athey and Imbens, 2016;Wager and Athey, 2018;Brand et al., 2021). By utilizing techniques such as heterogeneity maximization and adaptive nearest neighbor matching within recursive partitioning, these models enable the segmentation of the dataset into distinct sub-samples. ...

Reference:

From “transitions” to “trajectories”: towards a holistic interactionistic analysis of educational inequality in contemporary China
Uncovering Sociological Effect Heterogeneity Using Tree-Based Machine Learning
  • Citing Article
  • March 2021

Sociological Methodology