Daniel A. Bowen’s research while affiliated with Centers for Disease Control and Prevention and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (18)


Intercoder reliability (Krippendorf’s α) of two coders for all emergent subthemes
Emergent themes (and clusters) from forum coding, with sample-wide frequencies.
Intercoder reliability (Krippendorf’s α) of two coders for all emergent themes.
“I did it without hesitation. Am I the bad guy?”: Online conversations in response to controversial in-game violence
  • Article
  • Publisher preview available

April 2024

·

82 Reads

·

7 Citations

·

Daniel A Bowen

·

·

[...]

·

Philippe de Villemor Chauveau

Video game content has evolved over the last six decades, from a basic focus on challenge and competition to include more serious and introspective narratives capable of encouraging critical contemplation within gamers. The “No Russian” mission from Call of Duty: Modern Warfare 2 casts players as terrorists responsible for the murder of innocent bystanders, sparking debate around how players engage and react to wanton violence in modern video games. Through thematic analysis of 649 Reddit posts discussing the mission, 10 themes emerged representing complexity in player experiences. Those themes were grouped into categories representing (descending order), (1) rote gameplay experiences, (2) dark humor, (3) comparing the mission to other games and real-world events, and (4) self-reflective eudaimonic reactions to the mission. Although less common, the presence of eudaimonic media effects (in at least 15% of posts) holds promise for the use of video games as reflective spaces for violence prevention.

View access options

Predicting state level suicide fatalities in the united states with realtime data and machine learning

January 2024

·

61 Reads

·

5 Citations

npj Mental Health Research

Digital trace data and machine learning techniques are increasingly being adopted to predict suicide-related outcomes at the individual level; however, there is also considerable public health need for timely data about suicide trends at the population level. Although significant geographic variation in suicide rates exist by state within the United States, national systems for reporting state suicide trends typically lag by one or more years. We developed and validated a deep learning based approach to utilize real-time, state-level online (Mental Health America web-based depression screenings; Google and YouTube Search Trends), social media (Twitter), and health administrative data (National Syndromic Surveillance Program emergency department visits) to estimate weekly suicide counts in four participating states. Specifically, per state, we built a long short-term memory (LSTM) neural network model to combine signals from the real-time data sources and compared predicted values of suicide deaths from our model to observed values in the same state. Our LSTM model produced accurate estimates of state-specific suicide rates in all four states (percentage error in suicide rate of −2.768% for Utah, −2.823% for Louisiana, −3.449% for New York, and −5.323% for Colorado). Furthermore, our deep learning based approach outperformed current gold-standard baseline autoregressive models that use historical death data alone. We demonstrate an approach to incorporate signals from multiple proxy real-time data sources that can potentially provide more timely estimates of suicide trends at the state level. Timely suicide data at the state level has the potential to improve suicide prevention planning and response tailored to the needs of specific geographic communities.


Using Transformer-Based Topic Modeling to Examine Discussions of Delta-8 Tetrahydrocannabinol: Content Analysis

December 2023

·

30 Reads

·

1 Citation

Journal of Medical Internet Research

Background Delta-8 tetrahydrocannabinol (THC) is a psychoactive cannabinoid found in small amounts naturally in the cannabis plant; it can also be synthetically produced in larger quantities from hemp-derived cannabidiol. Most states permit the sale of hemp and hemp-derived cannabidiol products; thus, hemp-derived delta-8 THC products have become widely available in many state hemp marketplaces, even where delta-9 THC, the most prominently occurring THC isomer in cannabis, is not currently legal. Health concerns related to the processing of delta-8 THC products and their psychoactive effects remain understudied. Objective The goal of this study is to implement a novel topic modeling approach based on transformers, a state-of-the-art natural language processing architecture, to identify and describe emerging trends and topics of discussion about delta-8 THC from social media discourse, including potential symptoms and adverse health outcomes experienced by people using delta-8 THC products. Methods Posts from January 2008 to December 2021 discussing delta-8 THC were isolated from cannabis-related drug forums on Reddit (Reddit Inc), a social media platform that hosts the largest web-based drug forums worldwide. Unsupervised topic modeling with state-of-the-art transformer-based models was used to cluster posts into topics and assign labels describing the kinds of issues being discussed with respect to delta-8 THC. Results were then validated by human subject matter experts. Results There were 41,191 delta-8 THC posts identified and 81 topics isolated, the most prevalent being (1) discussion of specific brands or products, (2) comparison of delta-8 THC to other hemp-derived cannabinoids, and (3) safety warnings. About 5% (n=1220) of posts from the resulting topics included content discussing health-related symptoms such as anxiety, sleep disturbance, and breathing problems. Until 2020, Reddit posts contained fewer than 10 mentions of delta-8-THC for every 100,000 cannabis posts annually. However, in 2020, these rates increased by 13 times the 2019 rate (to 99.2 mentions per 100,000 cannabis posts) and continued to increase into 2021 (349.5 mentions per 100,000 cannabis posts). Conclusions Our study provides insights into emerging public health concerns around delta-8 THC, a novel substance about which little is known. Furthermore, we demonstrate the use of transformer-based unsupervised learning approaches to derive intelligible topics from highly unstructured discussions of delta-8 THC, which may help improve the timeliness of identification of emerging health concerns related to new substances.


Transformer-Based Topic Modeling to Examine Discussions of Delta-8 THC (Preprint)

May 2023

·

15 Reads

BACKGROUND Delta-8 tetrahydrocannabinol (THC) is a psychoactive cannabinoid found in small amounts naturally in the cannabis plant; it can also be synthetically produced in larger quantities from hemp-derived cannabidiol, or CBD. Most states permit the sale of hemp and hemp-derived CBD products; thus, hemp-derived delta-8 THC products have become widely available in many state hemp marketplaces, even where delta-9 THC, the most prominently occurring THC isomer in cannabis, is not currently legal. Health concerns related to the processing of delta-8 THC products and their psychoactive effects remain understudied. OBJECTIVE The goal of this study is to implement a novel topic modeling approach based on transformers, a state-of-the art natural language processing architecture, to identify and describe emerging trends and topics of discussion about delta-8 THC from social media discourse, including potential symptoms and adverse health outcomes experienced by people using delta-8 THC products. METHODS Posts from January 2008 to December 2021 discussing delta-8 THC were isolated from cannabis-related drug forums on Reddit, a social media platform which hosts the largest online drug forums worldwide. Using Python, unsupervised topic modeling leveraging state-of-the-art transformer-based models was employed. The models cluster posts into topics and assign labels describing the kinds of issues being discussed with respect to delta-8 THC. Results were then validated by human subject matter experts. RESULTS There were 41,191 delta-8 THC posts identified and 81 topics isolated, the most prevalent being 1) discussion of specific brands/products, 2) comparison of delta-8 THC to other hemp-derived cannabinoids, 3) and safety warnings. About 5% of the resulting topics included posts discussing health-related symptoms such as anxiety, sleep disturbance, and breathing problems. Until 2020, Reddit posts contained less than 10 mentions of delta-8-THC for every 100,000 cannabis posts annually. However, in 2020 these rates increased by 13 times the 2019 rate (to 99.2 mentions per 100,000 cannabis posts) and continued to increase into 2021 (349.5 mentions per 100,000 cannabis posts). CONCLUSIONS Our study provides insights into emerging public health concerns around delta-8 THC, a novel substance about which little is known. Furthermore, we demonstrate the utility of transformer-based unsupervised learning approaches to derive intelligible topics from highly unstructured discussions of delta-8 THC, which may help improve the timeliness of identification of emerging health concerns related to new substances. CLINICALTRIAL N/A


Development of a Machine Learning Model to Estimate US Firearm Homicides in Near Real Time

March 2023

·

61 Reads

·

4 Citations

JAMA Network Open

Importance: Firearm homicides are a major public health concern; lack of timely mortality data presents considerable challenges to effective response. Near real-time data sources offer potential for more timely estimation of firearm homicides. Objective: To estimate near real-time burden of weekly and annual firearm homicides in the US. Design, setting, and participants: In this prognostic study, anonymous, longitudinal time series data were obtained from multiple data sources, including Google and YouTube search trends related to firearms (2014-2019), emergency department visits for firearm injuries (National Syndromic Surveillance Program, 2014-2019), emergency medical service activations for firearm-related injuries (biospatial, 2014-2019), and National Domestic Violence Hotline contacts flagged with the keyword firearm (2016-2019). Data analysis was performed from September 2021 to September 2022. Main outcomes and measures: Weekly estimates of US firearm homicides were calculated using a 2-phase pipeline, first fitting optimal machine learning models for each data stream and then combining the best individual models into a stacked ensemble model. Model accuracy was assessed by comparing predictions of firearm homicides in 2019 to actual firearm homicides identified by National Vital Statistics System death certificates. Results were also compared with a SARIMA (seasonal autoregressive integrated moving average) model, a common method to forecast injury mortality. Results: Both individual and ensemble models yielded highly accurate estimates of firearm homicides. Individual models' mean error for weekly estimates of firearm homicides (root mean square error) varied from 24.95 for emergency department visits to 31.29 for SARIMA forecasting. Ensemble models combining data sources had lower weekly mean error and higher annual accuracy than individual data sources: the all-source ensemble model had a weekly root mean square error of 24.46 deaths and full-year accuracy of 99.74%, predicting the total number of firearm homicides in 2019 within 38 deaths for the entire year (compared with 95.48% accuracy and 652 deaths for the SARIMA model). The model decreased the time lag of reporting weekly firearm homicides from 7 to 8 months to approximately 6 weeks. Conclusions and relevance: In this prognostic study of diverse secondary data on machine learning, ensemble modeling produced accurate near real-time estimates of weekly and annual firearm homicides and substantially decreased data source time lags. Ensemble model forecasts can accelerate public health practitioners' and policy makers' ability to respond to unanticipated shifts in firearm homicides.


Are Home Evictions Associated with Child Welfare System Involvement? Empirical Evidence from National Eviction Records and Child Protective Services Data

September 2022

·

48 Reads

·

2 Citations

Child Maltreatment

This study aimed to understand the relationship between home eviction and child welfare system involvement at the county level. Using administrative data, we examined associations of home eviction and eviction filing rates with child abuse and neglect (CAN) reports and foster care entries. We found one additional eviction per 100 renter-occupied homes in a county was associated with a 1.3% increase in the rate of CAN reports and a 1.6% increase in foster care entries. The association between eviction and foster care entries was strongest among Hispanic children with an 8.1% increase. Assisting parents in providing stable housing may reduce the risk of child welfare system involvement, including out-of-home child placement. Primary and secondary prevention strategies could include housing assistance, increasing access to affordable and safe housing, as well as providing economic support for families (e.g., tax credits, childcare subsidies) that reduce parental financial burden to access stable housing.


Estimating Weekly National Opioid Overdose Deaths in Near Real Time Using Multiple Proxy Data Sources

July 2022

·

26 Reads

·

19 Citations

JAMA Network Open

Importance: Opioid overdose is a leading public health problem in the United States; however, national data on overdose deaths are delayed by several months or more. Objectives: To build and validate a statistical model for estimating national opioid overdose deaths in near real time. Design, setting, and participants: In this cross-sectional study, signals from 5 overdose-related, proxy data sources encompassing health, law enforcement, and online data from 2014 to 2019 in the US were combined using a LASSO (least absolute shrinkage and selection operator) regression model, and weekly predictions of opioid overdose deaths were made for 2018 and 2019 to validate model performance. Results were also compared with those from a baseline SARIMA (seasonal autoregressive integrated moving average) model, one of the most used approaches to forecasting injury mortality. Exposures: Time series data from 2014 to 2019 on emergency department visits for opioid overdose from the National Syndromic Surveillance Program, data on the volume of heroin and synthetic opioids circulating in illicit markets via the National Forensic Laboratory Information System, data on the search volume for heroin and synthetic opioids on Google, and data on post volume on heroin and synthetic opioids on Twitter and Reddit were used to train and validate prediction models of opioid overdose deaths. Main outcomes and measures: Model-based predictions of weekly opioid overdose deaths in the United States were made for 2018 and 2019 and compared with actual observed opioid overdose deaths from the National Vital Statistics System. Results: Statistical models using the 5 real-time proxy data sources estimated the national opioid overdose death rate for 2018 and 2019 with an error of 1.01% and -1.05%, respectively. When considering the accuracy of weekly predictions, the machine learning-based approach possessed a mean error in its weekly estimates (root mean squared error) of 60.3 overdose deaths for 2018 (compared with 310.2 overdose deaths for the SARIMA model) and 67.2 overdose deaths for 2019 (compared with 83.3 overdose deaths for the SARIMA model). Conclusions and relevance: Results of this serial cross-sectional study suggest that proxy administrative data sources can be used to estimate national opioid overdose mortality trends to provide a more timely understanding of this public health problem.


Using the Centers for Disease Control and Prevention's National Syndromic Surveillance Program Data to Monitor Trends in US Emergency Department Visits for Firearm Injuries, 2018 to 2019

May 2022

·

19 Reads

·

13 Citations

Annals of Emergency Medicine

Study objective: We describe trends in emergency department (ED) visits for initial firearm injury encounters in the United States. Methods: Using data from the Centers for Disease Control and Prevention's National Syndromic Surveillance Program, we analyzed monthly and yearly trends in ED visit rates involving a firearm injury (calculated as the number of firearm injury-related ED visits divided by the total number of ED visits for each month and multiplied by 100,000) by sex-specific age group and US region from 2018 to 2019 and conducted Joinpoint regression to detect trend significance. Results: Among approximately 215 million ED visits captured in the National Syndromic Surveillance Program from January 2018 to December 2019, 132,767 involved a firearm injury (61.6 per 100,000 ED visits). Among males, rates of firearm injury-related ED visits significantly increased for all age groups between 15 and 64 years during the study period. Among females, rates of firearm injury-related ED visits significantly increased for all age groups between 15 and 54 years during the study period. By region, rates significantly changed in the northeast, southeast, and southwest for males and females during the study period. Conclusion: These analyses highlight a novel data source for monitoring trends in ED visits for firearm injuries. With increased and effective use of state and local syndromic surveillance data, in addition to improvements to firearm injury syndrome definitions by intent, public health professionals could better detect unusual patterns of firearm injuries across the United States for improved prevention and tailored response efforts.


Figure 1: Characteristics of the 25 Parcels Experiencing the Highest Levels of Violent Crime, 2012-2017, Atlanta, GA
Figure 2: Geographic Distribution of the 25 Parcels Experiencing the Highest Levels of Violent Crime Footnote: Geographic subdivisions or boundary lines represent census block groups. White dots represent the location of the 25 parcels with the largest amount of violent crime across the entire city.
Frequency of Violent Crime by Parcel Category, 2012-2017, Atlanta, GA.
Violent Crime Surrounding the 25 Parcels Experiencing the Highest Levels of Violent Crime, 2012-2017, Atlanta, GA.
Business and property types experiencing excess violent crime: a micro-spatial analysis

November 2021

·

42 Reads

·

4 Citations

Journal of Injury & Violence Research

Background: Beyond alcohol retail establishments, most business and property types receive limited attention in studies of violent crime. We sought to provide a comprehensive examination of which properties experience the most violent crime in a city and how that violence is distributed throughout a city. Methods: For a large urban city, we merged violent incident data from police reports with municipal tax assessor data from 2012-2017 and tabulated patterns of violent crime for 15 commercial and public property types. To describe outlier establishments, we calculated the proportion of individual parcels within each property-type that experienced more than 5 times the average number of crimes for that property-type and also mapped the 25 parcels with the highest number of violent incidents to explore what proportion of violent crime in these block groups were contributed by the outlier establishments. Results: While the hotel/lodging property-type experienced the highest number of violent crimes per parcel (2.72), each property-type had outlier establishments experiencing more than 5 times the average number of violent crimes per business. Twelve of 15 property-types (80%) had establishments with more than 10 times the mean number of violent incidents. The 25 parcels with the most violent crime comprised a wide variety of establishments, ranging from a shopping center, grocery store, gas station, motel, public park, vacant lot, public street, office building, transit station, hospital, pharmacy, school, community center, and movie theatre, and were distributed across the city. Eight of the 25 parcels with the highest amount of violent crime, accounted for 50% or more of the violent crime within a 400-meter buffer. Conclusions: All property-types had outlier establishments experiencing elevated counts of violent crimes. Furthermore, the 25 most violent properties in the city demonstrated remarkable diversity in property-type. Further studies assessing the risk of violent crime among additional property-types may aid in violence prevention.


Performance of Individual Models Built Using Each Data Source at the Intermediate Stage (First Phase)
Performance of the 6 Ensemble Models Built Using Different Combinations of the Data Sources
Development of a Machine Learning Model Using Multiple, Heterogeneous Data Sources to Estimate Weekly US Suicide Fatalities

December 2020

·

63 Reads

·

28 Citations

JAMA Network Open

Importance Suicide is a leading cause of death in the US. However, official national statistics on suicide rates are delayed by 1 to 2 years, hampering evidence-based public health planning and decision-making. Objective To estimate weekly suicide fatalities in the US in near real time. Design, Setting, and Participants This cross-sectional national study used a machine learning pipeline to combine signals from several streams of real-time information to estimate weekly suicide fatalities in the US in near real time. This 2-phase approach first fits optimal machine learning models to each individual data stream and subsequently combines predictions made from each data stream via an artificial neural network. National-level US administrative data on suicide deaths, health services, and economic, meteorological, and online data were variously obtained from 2014 to 2017. Data were analyzed from January 1, 2014, to December 31, 2017. Exposures Longitudinal data on suicide-related exposures were obtained from multiple, heterogeneous streams: emergency department visits for suicide ideation and attempts collected via the National Syndromic Surveillance Program (2015-2017); calls to the National Suicide Prevention Lifeline (2014-2017); calls to US poison control centers for intentional self-harm (2014-2017); consumer price index and seasonality-adjusted unemployment rate, hourly earnings, home price index, and 3-month and 10-year yield curves from the Federal Reserve Economic Data (2014-2017); weekly daylight hours (2014-2017); Google and YouTube search trends related to suicide (2014-2017); and public posts on suicide on Reddit (2 314 533 posts), Twitter (9 327 472 tweets; 2015-2017), and Tumblr (1 670 378 posts; 2014-2017). Main Outcomes and Measures Weekly estimates of suicide fatalities in the US were obtained through a machine learning pipeline that integrated the above data sources. Estimates were compared statistically with actual fatalities recorded by the National Vital Statistics System. Results Combining information from multiple data streams, the machine learning method yielded estimates of weekly suicide deaths with high correlation to actual counts and trends (Pearson correlation, 0.811; P < .001), while estimating annual suicide rates with low error (0.55%). Conclusions and Relevance The proposed ensemble machine learning framework reduces the error for annual suicide rate estimation to less than one-tenth of that of current forecasting approaches that use only historical information on suicide deaths. These findings establish a novel approach for tracking suicide fatalities in near real time and provide the potential for an effective public health response such as supporting budgetary decisions or deploying interventions.


Citations (15)


... Novel alternatives have been explored, whereby suicide numbers are modelled on the basis of other available data. 56 Cluster response plans recommend specific actions to prevent or contain clusters, including working with media organisations and supporting at-risk individuals. ...

Reference:

Public health measures related to the transmissibility of suicide
Predicting state level suicide fatalities in the united states with realtime data and machine learning

npj Mental Health Research

... Mientras que (Swedo et al. 2023), enfatizan que los homicidios con armas de fuego son un importante problema de salud pública; la falta de datos oportunos sobre mortalidad presenta desafíos considerables para una respuesta eficaz. Por lo tanto, los actuales datos investigativos vienen a proporcionar un aporte significativo, para contribuir en la generación de políticas públicas a partir de los aportes de la estadística. ...

Development of a Machine Learning Model to Estimate US Firearm Homicides in Near Real Time

JAMA Network Open

... Research finds that eviction, threat of eviction, and housing instability have significant health impacts on families and children, including premature birth, low birthweight, maternal depression, and more parenting stress [7][8][9][10] . Housing instability and eviction have been linked to decreased social support 11 , food insecurity 12,13 , increased conflict in the home 14 , and harsh parenting [15][16][17][18][19] . As such the multiple challenges associated with eviction may cascade to significantly impact child development and mental health [20][21][22][23] . ...

Are Home Evictions Associated with Child Welfare System Involvement? Empirical Evidence from National Eviction Records and Child Protective Services Data
  • Citing Article
  • September 2022

Child Maltreatment

... All models adjusted for confounders, including age, sex, race/ethnicity, education level, PIR, smoking status, alcohol consumption, BMI, physical activity, HEI, diabetes, and hypertension, to reduce bias and improve accuracy. Detailed information is available in Supplementary SM 1 [33][34][35][36][37][38][39]. ...

Estimating Weekly National Opioid Overdose Deaths in Near Real Time Using Multiple Proxy Data Sources

JAMA Network Open

... If loaded, respondents were further asked if any of the loaded firearms were unlocked. Based on responses to these questions, we constructed a categorical measure of firearm storage practices entailing the following three categories: (1) not loaded, (2) loaded but locked and (3) loaded and unlocked. The outcome variable was a three-category polychotomous variable with not loaded serving as the base, or reference, category. ...

Using the Centers for Disease Control and Prevention's National Syndromic Surveillance Program Data to Monitor Trends in US Emergency Department Visits for Firearm Injuries, 2018 to 2019
  • Citing Article
  • May 2022

Annals of Emergency Medicine

... Players should be considered an essential element when making sense of this emotional aesthetic. Through their interactions with games, they have the capacity to critically consume and reflect on their in-game actions (Bowman et al., 2022). Through considering the complexities of emotions in war games, it is important to understand how players make meaning from this aesthetic, especially when considering the possibility of videogames presenting players with more controversial aspects of war. ...

“I did it without hesitation. Am I the bad guy?”: Online conversations in response to controversial in-game violence

... Hotel buildings are classified as risky facilities (Eck et al., 2007). While most research results suggest that hotel buildings contribute to an increase in crime density in their surroundings (Sun et al., 2022) and on the plots on which they are located (Bowen et al., 2022), with a significant number of property-related crimes committed within the hotel buildings themselves (Huang et al., 1998), our research did not confirm these findings. Instead, we obtained a contrasting result, indicating that hotel buildings reduce the rate of drug-related crimes in the neighborhood. ...

Business and property types experiencing excess violent crime: a micro-spatial analysis

Journal of Injury & Violence Research

... Finally, these automated assessments only utilised information available at a single point in time. Further improvement in model performance may be achieved through the use of models to account for the evolving nature of the risk of SSI in the postoperative period [31][32][33] . This would have the potential to provide new insights into the diagnosis of SSI at a subclinical stage, and even prediction of wounds at high-risk to allow preventative interventions. ...

Development of a Machine Learning Model Using Multiple, Heterogeneous Data Sources to Estimate Weekly US Suicide Fatalities

JAMA Network Open

... Sentiment analysis could reveal public presumptions and emotions of people related to the COVID-19 outbreak [116], [117]. Other public opinions were widely discussed in this cluster were public mental health [118], vaccines and anti-vaccines opinion [119], [120], HIV prevention [121], fake news and misinformation detection [122], and hate speech and racial bullying related to the outbreak [123]. Moreover, social media data was also widely used for disease spread predictions [124], new cases and event detection [125], and illicit opioid and drug abuse detection [126]. ...

Conversational topics of social media messages associated with state-level mental distress rates
  • Citing Article
  • March 2020

... Unfortunately, the suicide rate increased by 30% from 1999 to 2019. 3 Suicide-related visits to the emergency department also increased by 42% from 2001 to 2016. 4 For certain groups (eg, adolescents, young adults, and racial and ethnic minority youth), suicide risk has risen even faster in the last decade. 5,6 Past directors of the US National Institute of Mental Health (NIMH) have reiterated a call for renewed efforts to combat this public health problem and reduce the rate of suicide. ...

Syndromic Surveillance of Suicidal Ideation and Self-Directed Violence - United States, January 2017-December 2018

MMWR. Morbidity and mortality weekly report