Xiao Xu’s research while affiliated with University of Groningen and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (3)


Understanding Narratives of Uncertainty in Fertility Intentions of Dutch Women: A Neural Topic Modeling Approach
  • Article

August 2024

·

13 Reads

Xiao Xu

·

Anne Gauthier

·

·

Uncertainty in fertility intentions is a major obstacle to understanding contemporary trends in fertility decision-making and its outcomes. Quantifying this uncertainty by structural factors such as income, ethnicity, and housing conditions is recognized as insufficient. A recently proposed framework on subjective narratives has opened up a new way to gauge factors behind fertility decision-making and uncertainty. Through surveys, such narratives can be elicited with open-ended questions (OEQs). However, analyzing answers to OEQs typically involves extensive human coding, imposing constraints on sample size. Natural Language Processing (NLP) techniques assist researchers in grasping aspects of the underlying reasoning behind responses with much less human effort. In this study, using automatic neural topic modeling methods, we identify and interpret topics and themes underlying the narratives on fertility intention uncertainty of women in the Netherlands. We used Contextualized Topic Models (CTMs), a neural topic model using pre-trained representations of Dutch language, to conduct our analyses. Our results show that nine topics dominate the narratives about fertility planning, with age and health-related issues as the most prominent ones. In addition, we found that uncertainty in fertility intentions is not homogeneous, as women who feel uncertain due to real-life constraints and those who have no fertility plans at all put their stress on vastly different narratives.


The five outcomes that models had to predict.
Predictive ability (R²) for different models (x-axis and colours) on different outcomes (panels). Full: model including all variables; ego: model including only ego characteristics; composition: only network composition variables; structure: only network structure variables. The dot is the average out-of-sample R² from a LASSO regression across ten folds; the maximum and minimum heights of the bars represent one standard error above and below the average. The standard error is based on the standard deviation in R² across the ten folds divided by the square root of ten. The diamonds represent in-sample R² based on a linear regression.
The magnitude of the LASSO regression coefficients that were not shrunk to zero in the model including all variables. For none of the outcomes, structural characteristics were kept in the model. No characteristics were kept in the model in predicting happiness in relation to having children.
A data-driven approach shows that individuals' characteristics are more important than their networks in predicting fertility preferences
  • Article
  • Full-text available

December 2023

·

88 Reads

·

2 Citations

People's networks are considered key in explaining fertility outcomes—whether people want and have children. Existing research on social influences on fertility is limited because data often come from small networks or highly selective samples, only few network variables are considered, and the strength of network effects is not properly assessed. We use data from a representative sample of Dutch women reporting on over 18 000 relationships. A data-driven approach including many network characteristics accounted for 0 to 40% of the out-of-sample variation in different outcomes related to fertility preferences. Individual characteristics were more important for all outcomes than network variables. Network composition was also important, particularly those people in the network desiring children or those choosing to be childfree. Structural network characteristics, which feature prominently in social influence theories and are based on the relations between people in the networks, hardly mattered. We discuss to what extent our results provide support for different mechanisms of social influence, and the advantages and disadvantages of our data-driven approach in comparison to traditional approaches.

Download

Citations (1)


... In the social sciences, there is a growing recognition that quantifying (out-ofsample) predictability of an outcome can improve our scientific understanding of it and assess the practical relevance of the theories explaining it [27][28][29][30][31]. Despite the potential of a focus on prediction, it remains under-utilised in the social sciences and demography in particular, although notable exceptions do exist [32][33][34][35][36][37][38]. One of the methods to measure predictability is a data challenge, where several teams compete to predict a particular outcome using the same dataset and evaluation criteria. ...

Reference:

Combining the strengths of Dutch survey and register data in a data challenge to predict fertility (PreFer)
A data-driven approach shows that individuals' characteristics are more important than their networks in predicting fertility preferences