Christopher M. Danforth’s research while affiliated with University of Vermont and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (313)


An Assessment of Measuring Local Levels of Homelessness Through Proxy Social Media Signals
  • Article
  • Full-text available

January 2025

·

12 Reads

International Journal on Homelessness

Yoshi Bird

·

Sarah Grobe

·

Michael Arnold

·

[...]

·

Peter Sheridan Dodds

Although nearly 600,000 people experience homelessness in the United States (U.S) every year, efforts to address this public health crisis are limited by the underperformance of standard methods to estimate localized and nationwide homelessness. Recent studies suggest social media activity can function as a proxy for measures of state-level public health, detectable through straightforward applications of natural language processing. We present the results of our efforts to apply this approach to estimate homelessness at the state level throughout the US during the period 2010-2019 and 2022 using a dataset of roughly 1 million geotagged tweets containing the substring “homeless.” Correlations between homelessness-related tweet counts and ranked per capita homelessness volume, but not general-population densities, suggest a relationship between the likelihood of Twitter users to personally encounter or observe homelessness in their everyday lives and their likelihood to communicate about it online. An increase in the log-odds of the word “homeless” appearing in an English-language tweet, as well as an acceleration in the increase in average tweet sentiment, suggest that tweets about homelessness are also affected by trends at the nation-scale. Additionally, changes to the lexical content of tweets over time suggest that reversals to the polarity of national or state-level trends may be detectable through an increase in political or service-sector language over the semantics of charity or direct appeals. While a computational approach to social media analysis may provide a low-cost, real-time dataset rich with information about nationwide and localized impacts of homelessness and homelessness policy, we find that practical issues abound, limiting the potential of social media as a proxy to complement other measures of homelessness.

Download

Collective sleep and activity patterns of college students from wearable devices

December 2024

·

12 Reads

To optimize interventions for improving wellness, it is essential to understand habits, which wearable devices can measure with greater precision. Using high temporal resolution biometric data taken from the Oura Gen3 ring, we examine daily and weekly sleep and activity patterns of a cohort of young adults (N=582) in their first semester of college. A high compliance rate is observed for both daily and nightly wear, with slight dips in wear compliance observed shortly after waking up and also in the evening. Most students have a late-night chronotype with a median midpoint of sleep at 5AM, with males and those with mental health impairment having more delayed sleep periods. Social jetlag, or the difference in sleep times between free days and school days, is prevalent in our sample. While sleep periods generally shift earlier on weekdays and later on weekends, sleep duration on both weekdays and weekends is shorter than during prolonged school breaks, suggesting chronic sleep debt when school is in session. Synchronized spikes in activity consistent with class schedules are also observed, suggesting that walking in between classes is a widespread behavior in our sample that substantially contributes to physical activity. Lower active calorie expenditure is associated with weekends and a delayed but longer sleep period the night before, suggesting that for our cohort, active calorie expenditure is affected less by deviations from natural circadian rhythms and more by the timing associated with activities. Our study shows that regular sleep and activity routines may be inferred from consumer wearable devices if high temporal resolution and long data collection periods are available.


Tokens, the oft-overlooked appetizer: Large language models, the distributional hypothesis, and meaning

December 2024

·

8 Reads

Tokenization is a necessary component within the current architecture of many language models, including the transformer-based large language models (LLMs) of Generative AI, yet its impact on the model's cognition is often overlooked. We argue that LLMs demonstrate that the Distributional Hypothesis (DM) is sufficient for reasonably human-like language performance, and that the emergence of human-meaningful linguistic units among tokens motivates linguistically-informed interventions in existing, linguistically-agnostic tokenization techniques, particularly with respect to their roles as (1) semantic primitives and as (2) vehicles for conveying salient distributional patterns from human language to the model. We explore tokenizations from a BPE tokenizer; extant model vocabularies obtained from Hugging Face and tiktoken; and the information in exemplar token vectors as they move through the layers of a RoBERTa (large) model. Besides creating sub-optimal semantic building blocks and obscuring the model's access to the necessary distributional patterns, we describe how tokenization pretraining can be a backdoor for bias and other unwanted content, which current alignment practices may not remediate. Additionally, we relay evidence that the tokenization algorithm's objective function impacts the LLM's cognition, despite being meaningfully insulated from the main system intelligence.


Percentage difference between individuals who chose health goals by dichotomous demographic, health, and lifestyle factors
Gender (men—women); Sexual Orientation (heterosexual—sexual minority); Income (family earnings < $50,000—family earnings > $50,000); Age (17–30–50–76); Mental Health (MH) Diagnosis (At least one mental health diagnosis—none); Physical Health (PH) Diagnosis (At least one physical health diagnosis—none); Wearable Use (Wears a wearable device—does not); Meditation App Use (Uses Calm or Headspace Apps—not); Extraversion (High—Low; cut at mean); Perseverance (high–low; cut at mean).
Rank biased overlap scores for each health goal across demographics
Gender (woman vs man); Sexual Orientation (heterosexual vs sexual minority); Income (family earnings < vs > $50,000); Age (17–30 vs 50–76); Mental Health (MH) Diagnosis (At least one mental health diagnosis vs none); Physical Health (PH) Diagnosis (At least one physical health diagnosis vs none); Wearable Use (Wears a wearable device vs does not); Meditation App Use (Uses Calm or Headspace Apps vs not); Extraversion (High vs Low cut at mean); Perseverance (high vs low cut at mean). RBO is a similarity measure that falls between 0–1, where an RBO of 0 means the lists are disjoint, and an RBO of 1 means the lists are identical.
Sankey diagrams
(A) Top ten practices for improving sleep as preferred by participants with and without the Headspace or Calm apps. (B) Top ten practices for improving physical health as preferred by participants with and without cardiovascular disease. (C) Top ten practices for improving emotional health as preferred by participants with depression compared to those without any mental health condition. (D) Top ten practices for improving social wellness as preferred by participants with anxiety compared to those without any mental health conditions.
Top ten practices for social wellness by region of the United States
The base layer map of the United States depicted in this figure is adapted from an open source freely available at https://www.fla-shop.com/svg/usa/, which permits use for both commercial and personal purposes with proper attribution, under the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
Dashboard screenshots
(A) The landing page for the dashboard allows users to select the dimension they are interested in exploring further. (B) Upon selecting a health goal, users will see the top 10 practices for that particular health goal which they can filter by demographic by selecting filters from the dropdown on the left side of the screen. In the top right corner is a “Download CSV” button that allows users to download the raw data that was used to identify the top practices. (C) By choosing to compare all goals on the landing page, users can select goals and filters to view a bar graph of the RBO scores. (D) Users can view a Sankey diagram of the wellness practices chosen for a specific goal separated by a particular demographic, health, or lifestyle factor by clicking on a specific bar in the bar graph. The images and clip-art were created using both free and Pro elements from Canva (https://www.canva.com/). All elements are used in accordance with Canva’s Content License Agreement (https://www.canva.com/policies/content-license-agreement/).

+2

Meeting people where they are: Crowdsourcing goal-specific personalized wellness practices

November 2024

·

28 Reads

Objectives Despite the development of efficacious wellness interventions, sustainable wellness behavior change remains challenging. To optimize engagement, initiating small behaviors that build upon existing practices congruent with individuals’ lifestyles may promote sustainable wellness behavior change. In this study, we crowd-sourced helpful, flexible, and engaging wellness practices to identify a list of those commonly used for improving sleep, productivity, and physical, emotional, and social wellness from participants who felt they had been successful in these dimensions. Method We recruited a representative sample of 992 U.S. residents to survey the wellness dimensions in which they had achieved success and their specific wellness practices. Results Responses were aggregated across demographic, health, lifestyle factors, and wellness dimension. Exploration of these data revealed that there was little overlap in preferred practices across wellness dimensions. Within wellness dimensions, preferred practices were similar across demographic factors, especially within the top 3–4 most selected practices. Interestingly, daily wellness practices differ from those typically recommended as efficacious by research studies and seem to be impacted by health status (e.g., depression, cardiovascular disease). Additionally, we developed and provide for public use a web dashboard that visualizes and enables exploration of the study results. Conclusions Findings identify personalized, sustainable wellness practices targeted at specific wellness dimensions. Future studies could leverage tailored practices as recommendations for optimizing the development of healthier behaviors.


Figure 2: Top Ten Mortality Causes by Gender in Film versus Real-Life: Female suicide deaths and male accidents, cerebrovascular disease, and heart disease are over-represented in film. Men are over-represented in all other causes of death. We only include causes of death present in Cinemorgue dataset.
Hollywood's misrepresentation of death: A comparison of overall and by-gender mortality causes in film and the real world

November 2024

·

32 Reads

The common phrase 'representation matters' asserts that media has a measurable and important impact on civic society's perception of self and others. The representation of health in media, in particular, may reflect and perpetuate a society's disease burden. Here, for the top 10 major causes of death in the United States, we examine how cinematic representation of overall and by-gender mortality diverges from reality. Using crowd-sourced data on film deaths from Cinemorgue Wiki, we employ natural language processing (NLP) techniques to analyze shifts in representation of deaths in movies versus the 2021 National Vital Statistic Survey (NVSS) top ten mortality causes. Overall, movies strongly overrepresent suicide and, to a lesser degree, accidents. In terms of gender, movies overrepresent men and underrepresent women for nearly every major mortality cause, including heart disease and cerebrovascular disease. The two exceptions for which women are overrepresented are suicide and accidents. We discuss the implications of under- and over-representing causes of death overall and by gender, as well as areas of future research.



A quantitative analysis of the affirmative furtherance of fair housing in the Housing Choice Voucher program

August 2024

·

3 Reads

The history of fair housing policy in the United States has fallen short of two of its primary objectives: Deconcentration of poverty and community desegregation. We compare distributions of Housing Choice Voucher (HCV) households at various scales to broader US distributions to better understand the program's successes and shortcomings. We find that HCV households at the nation-scale have been consistently more highly distributed among lower-Area Median Income (AMI) census tracts and racially or ethnically concentrated areas of poverty (RECAPs). We specifically consider the potential of affirmatively furthering fair housing within the HCV program to simultaneously boost overall distributions in higher-AMI census tracts and to decrease RECAP concentrations of HCV households, observing that targeted deployment of mobility-based interventions can significantly decrease the proportion of HCV households residing in RECAPs. We further consider the impact of small area fair market rents (SAFMRs) and state-level source-of-income protections on HCV household distributions, finding that both interventions may boost HCV households into higher-AMI census tracts.


Fig. 1. Cluster consistency across different training subsets and initializations. Each curve shows a cluster centroid found for a given number of clusters k using a randomized 10% subset of the data and a randomized centroid initialization to run k-means. With 30 different randomized subsets and 30 different centroid initializations, there are 900*k curves in each plot. Highly consistent cluster centroids are found for k = 2 but are not found for k = 3 and higher (see online suppl. Fig. S1 for k > 3).
Descriptive statistics of the sleep measures for each cluster
Results of logistic mixed-effects regression models for predicting the cluster membership of a sleep period using variables related to the individual, with the participant ID as a random effect
Results of logistic regression models predicting the response variable from the fraction of sleep periods an individual has in cluster 1
The Two Fundamental Shapes of Sleep Heart Rate Dynamics and Their Connection to Mental Health in College Students

July 2024

·

75 Reads

·

4 Citations

Digital Biomarkers

Introduction Wearable devices are rapidly improving our ability to observe health-related processes for extended durations in an unintrusive manner. In this study, we use wearable devices to understand how the shape of the heart rate curve during sleep relates to mental health. Methods As part of the Lived Experiences Measured Using Rings Study (LEMURS), we collected heart rate measurements using the Oura ring (Gen3) for over 25,000 sleep periods and self-reported mental health indicators from roughly 600 first-year university students in the USA during the fall semester of 2022. Using clustering techniques, we find that the sleeping heart rate curves can be broadly separated into two categories that are mainly differentiated by how far along the sleep period the lowest heart rate is reached. Results Sleep periods characterized by reaching the lowest heart rate later during sleep are also associated with shorter deep and REM sleep and longer light sleep, but not a difference in total sleep duration. Aggregating sleep periods at the individual level, we find that consistently reaching the lowest heart rate later during sleep is a significant predictor of (1) self-reported impairment due to anxiety or depression, (2) a prior mental health diagnosis, and (3) firsthand experience in traumatic events. This association is more pronounced among females. Conclusion Our results show that the shape of the sleeping heart rate curve, which is only weakly correlated with descriptive statistics such as the average or the minimum heart rate, is a viable but mostly overlooked metric that can help quantify the relationship between sleep and mental health.


Participant demographics, stress, and sleep characteristics of fall semester for 525 first-year college students from 3,112 Oura sleep measures
Proportions, averages, and standard deviations of sleep measures with perceived stress scores (PSS).
Model fit for Mixed-Effects Multi-linear Regressions with PSS as a continuous outcome measure
In this model the fixed effects are gender, week, and sleep measures TST, RHR, HRV, and ARR. The random effects are participant and week of study. The likelihood ratio (LR) compares the model to the previous model fit.
Model fit for Mixed-Effects Multi-linear Regressions with PSS-Moderate (PSS> = 14) as a binary outcome measure
In this model the fixed effects are gender, week, and mean sleep measures TST, RHR, HRV, and ARR. The random effects are participant and week of study. The likelihood ratio (LR) compares the model to the previous model fit.
Association between sleep measures and changes in stress and deviations in stress among first-year college students (N = 3,112)
In this model the fixed effects are gender, week, and the sleep measures TST, RHR, HRV, and ARR. The random effects are participant and week of study. The likelihood ratio (LR) compares the model to the previous model fit.
Predicting stress in first-year college students using sleep data from wearable devices

April 2024

·

124 Reads

·

10 Citations

Consumer wearables have been successful at measuring sleep and may be useful in predicting changes in mental health measures such as stress. A key challenge remains in quantifying the relationship between sleep measures associated with physiologic stress and a user’s experience of stress. Students from a public university enrolled in the Lived Experiences Measured Using Rings Study (LEMURS) provided continuous biometric data and answered weekly surveys during their first semester of college between October-December 2022. We analyzed weekly associations between estimated sleep measures and perceived stress for participants (N = 525). Through mixed-effects regression models, we identified consistent associations between perceived stress scores and average nightly total sleep time (TST), resting heart rate (RHR), heart rate variability (HRV), and respiratory rate (ARR). These effects persisted after controlling for gender and week of the semester. Specifically, for every additional hour of TST, the odds of experiencing moderate-to-high stress decreased by 0.617 or by 38.3% (p<0.01). For each 1 beat per minute increase in RHR, the odds of experiencing moderate-to-high stress increased by 1.036 or by 3.6% (p<0.01). For each 1 millisecond increase in HRV, the odds of experiencing moderate-to-high stress decreased by 0.988 or by 1.2% (p<0.05). For each additional breath per minute increase in ARR, the odds of experiencing moderate-to-high stress increased by 1.230 or by 23.0% (p<0.01). Consistent with previous research, participants who did not identify as male (i.e., female, nonbinary, and transgender participants) had significantly higher self-reported stress throughout the study. The week of the semester was also a significant predictor of stress. Sleep data from wearable devices may help us understand and to better predict stress, a strong signal of the ongoing mental health epidemic among college students.


Expecting the Unexpected: Predicting Panic Attacks From Mood, Twitter, and Apple Watch Data

January 2024

·

170 Reads

·

6 Citations

IEEE Open Journal of Engineering in Medicine and Biology

Objective: Panic attacks are an impairing mental health problem that affects 11% of adults every year. Current criteria describe them as occurring without warning, despite evidence suggesting individuals can often identify attack triggers. We aimed to prospectively explore qualitative and quantitative factors associated with the onset of panic attacks. Results: Of 87 participants, 95% retrospectively identified a trigger for their panic attacks. Worse individually reported mood and state-level mood, as indicated by Twitter ratings, were related to greater likelihood of next-day panic attack. In a subsample of participants who uploaded their wearable sensor data (n=32), louder ambient noise and higher resting heart rate were related to greater likelihood of next-day panic attack. Conclusions: These promising results suggest that individuals who experience panic attacks may be able to anticipate their next attack which could be used to inform future prevention and intervention efforts.


Citations (41)


... As students adapt to the novel social contexts of the college environment, they can form new habits [42] that can improve or degrade their well-being [43]. To understand the wellness behaviors and habits of and ultimately develop engaging interventions for this age group, the Lived Experiences Measured Using Rings Study (LEMURS) [44][45][46] observed behaviors among first-year college students using data from both surveys and the Oura Gen3 ring, a wearable device that monitors sleep and physical activity. All participants were recruited from the same class year at a single university over an 8-week study period creating a controlled, comparable shared context that shaped daily routines. ...

Reference:

Collective sleep and activity patterns of college students from wearable devices
The Two Fundamental Shapes of Sleep Heart Rate Dynamics and Their Connection to Mental Health in College Students

Digital Biomarkers

... As students adapt to the novel social contexts of the college environment, they can form new habits [42] that can improve or degrade their well-being [43]. To understand the wellness behaviors and habits of and ultimately develop engaging interventions for this age group, the Lived Experiences Measured Using Rings Study (LEMURS) [44][45][46] observed behaviors among first-year college students using data from both surveys and the Oura Gen3 ring, a wearable device that monitors sleep and physical activity. All participants were recruited from the same class year at a single university over an 8-week study period creating a controlled, comparable shared context that shaped daily routines. ...

Predicting stress in first-year college students using sleep data from wearable devices

... DHTs provide a compelling opportunity to capture a better picture of a patient's mental health, but they also enable remote delivery of intervention wherever and whenever it is needed. Mental health disorders are ripe for just-intime adaptive interventions delivered via DHTs because of the contextual fluctuations over time, to be alerted, within moments of your reactivity, to intervention recommendations such as additional supports during a depressive episode [52], when faced with a trigger for alcohol use disorder [53], or when experiencing a panic attack [54][55][56]. DHTs grant users access to data, but also services when they are most useful to them. It is very challenging to have the insight to engage in even known interventions at the moment when our conditions are most activated (e.g., [54]), thus interventions (e.g., alerts, reminders) based on personalized metrics that are grounded in human-computer interaction research [57][58][59][60][61], could help remind us when to deploy the management strategies we already know. ...

Expecting the Unexpected: Predicting Panic Attacks From Mood, Twitter, and Apple Watch Data

IEEE Open Journal of Engineering in Medicine and Biology

... Real-world data such as gait analysis or time/event-contingent actigraphy data using ecological momentary assessment might provide additional markers predicting TR-AD [96][97][98][99] . Machine learning approaches could aid in integrating biological, biographical and ecological momentary assessment markers 82 . ...

Discovering Digital Biomarkers of Panic Attack Risk in Consumer Wearables Data
  • Citing Conference Paper
  • July 2023

... Allotaxonometry is the comparison of any two complex systems with internally diverse structures (Dodds et al., 2020). For any two text 7 We included 2016 in the latter period because the first PiT count to register a nationwide increase took place in January corpora, allotaxonometry with rank-turbulence divergence uses the relative frequency of a word in each respective text in order to identify which words contribute the most to the difference between the texts. ...

Allotaxonometry and rank-turbulence divergence: a universal instrument for comparing complex systems

EPJ Data Science

... As students adapt to the novel social contexts of the college environment, they can form new habits [42] that can improve or degrade their well-being [43]. To understand the wellness behaviors and habits of and ultimately develop engaging interventions for this age group, the Lived Experiences Measured Using Rings Study (LEMURS) [44][45][46] observed behaviors among first-year college students using data from both surveys and the Oura Gen3 ring, a wearable device that monitors sleep and physical activity. All participants were recruited from the same class year at a single university over an 8-week study period creating a controlled, comparable shared context that shaped daily routines. ...

A large clinical trial to improve well-being during the transition to college using wearables: The lived experiences measured using rings study
  • Citing Article
  • September 2023

Contemporary Clinical Trials

... Parks and Peters (2023) discuss a "dialogical" machine-assisted framework which is not quantitizing but also relies on NLP tools. "Distant reading" in digital humanities (Moretti 2013) and "ousiometrics" (Fudolig et al. 2023) are machine-assisted approaches, but typically rely on counting words or topic clusters and not manual coding. ...

A decomposition of book structure through ousiometric fluctuations in cumulative word-time

Humanities and Social Sciences Communications

... Changing lifestyles with increased use of electronic media ('videophilia', Minor et al., 2023), negative attitudes towards nature in children ('biophobia', Soga et al., 2020) and altered parenting approaches combined with time pressure (Orr, 2002;Skar et al., 2016) often provide children and adolescents less freedom for outdoor activities and contribute to the declining orientation towards nature in younger generations. Providing opportunities through providing biodiverse urban green spaces close to people thus does not automatically counteract the extinction of experience if the orientation towards nature is not simultaneously strengthened (Colléony et al., 2020;Lin et al., 2014). ...

Nature Exposure is Associated With Reduced Smartphone Use
  • Citing Article
  • April 2023

Environment and Behavior

... For example, radio host Alex Jones disparaged Clinton saying: "She's a witch… Look at her face… All she needs is green skin" (Taylor-Coleman 2016). Weaving et al. (2023) documented how Hillary Clinton experienced "a deluge of misogyny": they found 64,285 misogynistic word usages associated with Hillary Clinton on Twitter between 2014 and 2018. Commonplace were offensive campaign slogans such as "Life's a b*tch: don't vote for one" (Beinhart 2016, 15), and the use of monster metaphors to describe Clinton's body. ...

Twitter misogyny associated with Hillary Clinton increased throughout the 2016 U.S. election campaign

... At present, researchers have used this method to measure the public's reaction to government policies such as mental health and social prejudice (38,39). In the aspect of emotional attitude research, we also began to use this method to carry out relevant research (40). The use of large samples in big data mining implies strong objectivity, high timeliness, and significant impact, making it a more suitable method for this study. ...

Expecting the Unexpected: Predicting Panic Attacks from Mood and Twitter