Florian von Wangenheim’s research while affiliated with ETH Zurich and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (61)


Von welchen Apps profitieren Kinder und Jugendliche mit Diabetes?
  • Research

November 2024

·

1 Read

Udo Meinhardt

·

Marc-Robin Gruener

·

·

Florian von Wangenheim


PRISMA flow diagram
Article count and year-to-year growth rate in the core dataset (2013–2023)
Network visualization of frequently co-cited articles. Each node represents a specific article. Thicker links represent a higher number of co-citations, while larger nodes indicate a greater number of occurrences
Network visualization of frequently co-cited sources (cited dataset). Each node represents a source in the cited dataset. Thicker links represent a higher number of co-citations, while larger nodes indicate a greater number of occurrences
Keyword co-occurrence network based on titles and abstracts. Subfigure (a) shows the co-occurrence and frequency of keywords in the dataset. Subfigure (b) shows the temporal evolution of the keyword co-occurrence network for the years 2019–2023

+5

Twenty-four years of empirical research on trust in AI: a bibliometric review of trends, overlooked issues, and future directions
  • Article
  • Full-text available

October 2024

·

92 Reads

AI & SOCIETY

Trust is widely regarded as a critical component to building artificial intelligence (AI) systems that people will use and safely rely upon. As research in this area continues to evolve, it becomes imperative that the research community synchronizes its empirical efforts and aligns on the path toward effective knowledge creation. To lay the groundwork toward achieving this objective, we performed a comprehensive bibliometric analysis, supplemented with a qualitative content analysis of over two decades of empirical research measuring trust in AI, comprising 1’156 core articles and 36’306 cited articles across multiple disciplines. Our analysis reveals several “elephants in the room” pertaining to missing perspectives in global discussions on trust in AI, a lack of contextualized theoretical models and a reliance on exploratory methodologies. We highlight strategies for the empirical research community that are aimed at fostering an in-depth understanding of trust in AI.

Download

Figure 1. Sequential Multiple Assignment Randomization Trial (SMART) design for LvL UP. Note:
The LvL UP Trial: Protocol for a Sequential, Multiple Assignment, Randomized Controlled Trial to Assess the Effectiveness of a Blended Mobile Lifestyle Intervention

August 2024

·

129 Reads

Background: Blended mobile health (mHealth) interventions – combining self-guided and human support components – could play a major role in preventing non-communicable diseases (NCDs) and common mental disorders (CMDs). This protocol paper describes a sequential, multiple assignment, randomised trial aimed at (i) evaluating the effectiveness and cost-effectiveness of LvL UP, an mHealth lifestyle intervention for the prevention of NCDs and CMDs, and (ii) establishing the optimal blended approach in LvL UP that balances effective personalised lifestyle support with scalability.Methods: LvL UP is a 6-month mHealth holistic intervention targeting physical activity, diet, and emotional regulation. In this trial, young and middle-aged Singaporean adults at risk of developing NCDs or CMDs will be randomly allocated to one of two initial conditions (‘LvL UP’ or ‘comparison’). After 4 weeks, participants categorised as non-responders from the LvL UP group will be re-randomised into second-stage conditions: (i) continuing with the initial intervention (LvL UP) or (ii) additional motivational interviewing (MI) support sessions by trained health coaches (LvL UP + adaptive MI). The primary outcome is mental well-being (via the Warwick-Edinburgh Mental Wellbeing Scale). Secondary outcomes include anthropometric measurements, resting blood pressure, blood metabolic profile, health status, health behaviours (physical activity, diet), work productivity, and healthcare utilisation. Outcomes will be measured at baseline, 6 months (post-intervention), and 12 months (follow-up).Discussion: In addition to evaluating the effectiveness of LvL UP, the proposed study design will contribute to increasing evidence on how to introduce human support in mHealth interventions to maximise their effectiveness while remaining scalable.Trial registration: The LvL UP Pilot trial was prospectively registered with ClinicalTrials.gov (NCT06360029) on 7 April 2024.


Figure 2: Illustration of time interval interface (left), case selection (middle), and highlighted visibility detection in recommendation slates (right) within EventChat's front end
Figure 3: Conceptual architecture of EventChat's back end
EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context

July 2024

·

36 Reads

Large language models (LLMs) present an enormous evolution in the strategic potential of conversational recommender systems (CRS). Yet to date, research has predominantly focused upon technical frameworks to implement LLM-driven CRS, rather than end-user evaluations or strategic implications for firms, particularly from the perspective of a small to medium enterprises (SME) that makeup the bedrock of the global economy. In the current paper, we detail the design of an LLM-driven CRS in an SME setting, and its subsequent performance in the field using both objective system metrics and subjective user evaluations. While doing so, we additionally outline a short-form revised ResQue model for evaluating LLM-driven CRS, enabling replicability in a rapidly evolving field. Our results reveal good system performance from a user experience perspective (85.5% recommendation accuracy) but underscore latency, cost, and quality issues challenging business viability. Notably, with a median cost of $0.04 per interaction and a latency of 5.7s, cost-effectiveness and response time emerge as crucial areas for achieving a more user-friendly and economically viable LLM-driven CRS for SME settings. One major driver of these costs is the use of an advanced LLM as a ranker within the retrieval-augmented generation (RAG) technique. Our results additionally indicate that relying solely on approaches such as Prompt-based learning with ChatGPT as the underlying LLM makes it challenging to achieve satisfying quality in a production environment. Strategic considerations for SMEs deploying an LLM-driven CRS are outlined, particularly considering trade-offs in the current technical landscape.




Taking Behavioral Science to the next Level: Opportunities for the Use of Ontologies to Enable Artificial Intelligence-Driven Evidence Synthesis and Prediction

January 2024

·

117 Reads

·

1 Citation

Decades of research have created a vast archive of information on human behavior, with relevant new studies being published daily. Despite these advances, knowledge generated by behavioral science – the social and biological sciences concerned with the study of human behavior – is not efficiently translated for those who will apply it to benefit individuals and society. The gap between what is known and the capacity to act on that knowledge continues to widen as current evidence synthesis methods struggle to process a large, ever-growing body of evidence characterized by its complexity and lack of shared terminologies. The purpose of the present position paper is twofold: (i) to highlight the pitfalls of traditional evidence synthesis methods in supporting effective knowledge translation to applied settings, and (ii) to sketch a potential alternative evidence synthesis approach which leverages on the use of ontologies – formal systems for organizing knowledge – to enable a more effec tive, artificial intelligence-driven accumulation and implementation of knowledge. The paper concludes with future research directions across behavioral, computer, and information sciences to help realize such innovative approach to evidence synthesis, allowing behavioral science to advance at a faster pace.


Fig. 1 Study protocol. Abbreviations: INT = Intervention group; WLC = Wait-list control group
Sample characteristics of the intervention group (INT) and wait-list control group (WLC)
The effectiveness and user experience of a biofeedback intervention program for stress management supported by virtual reality and mobile technology: A randomized controlled study

October 2023

·

143 Reads

·

4 Citations

BMC Digital Health

Background Heart rate variability biofeedback (HRV-BF) can be used for stress management. Recent feasibility studies suggest that delivering HRV-BF in virtual reality (VR) is associated with better user experience (UX) and might yield more beneficial changes in HRV than two-dimensional screens. The effectiveness of a VR-supported HRV-BF intervention program has, however, not been investigated yet. Methods In this study, 87 healthy women and men were assigned to a VR-supported HRV-BF intervention (INT; 𝑛=44 ) or a wait-list control (WLC; 𝑛=43 ) group. The INT came to the lab for four weekly HRV-BF sessions in VR using a head-mounted display. Between lab sessions, participants were asked to perform breathing exercises without biofeedback supported by a mobile application. Stress-related psychological and psychophysiological outcomes were assessed pre- and post-intervention and at a follow-up four weeks after the intervention in both groups. A psychosocial stress test was conducted post-intervention to investigate changes in stress reactivity. UX was assessed after each HRV-BF session in the INT. Results Analysis revealed that LF increased significantly from pre- to post-, whereas pNN50 increased and chronic stress decreased significantly from pre-intervention to follow-up in the INT compared to the WLC. Anxiety and mental fatigue decreased significantly, while mindfulness and health-related quality of life increased significantly from pre- to post- and from pre-intervention to follow-up in the INT compared to the WLC (all small effects). The two groups did not differ in their stress reactivity post-intervention. As for UX in the INT, the degree of feeling autonomous concerning technology adoption significantly decreased over time. Competence, involvement, and immersion, however, increased significantly from the first to the last HRV-BF session, while hedonic motivation significantly peaked in the second session and then gradually returned to first-session levels. Conclusions This HRV-BF intervention program, supported by VR and mobile technology, was able to significantly improve stress indicators and stress-related symptoms and achieved good to very good UX. Future studies should control for potential placebo effects and emphasize higher degrees of personalization and adaptability to increase autonomy and, thereby, long-term health and well-being. These findings may serve as a first step towards future HRV-BF applications of cutting-edge, increasingly accessible technologies, such as wearables, VR, and smartphones, in the service of mental health and healthcare. Trial registration The study was registered retrospectively as a clinical trial on ISRCTN registry (ISRCTN11331226, 26 May 2023).


(A) Example screenshots of engagement intervention components. (B) Example screenshots of lifestyle intervention components.
(A) Reasons for downloading Elena+ for English and Spanish speakers. (B) Chronic disease category classification for English and Spanish speakers. PA, physical activity; P.T.S.D, post-traumatic stress disorder.
Flowchart of outcome assessment responses by assessment periods for each health topic. HA, hours active; SB, sedentary behavior.
(A) Anxiety outcome assessment score in assessment periods 1 and 2. (B) Depression outcome assessment score across assessment periods 1, 2, and 3. GAD-7, general anxiety disorder 7-item instrument; PHQ-2, patient health questionnaire 2-item instrument. *p < 0.05, ns, non-significant.
(A) Anxiety and (B) depression outcome assessments: average causal mediated effect, average direct effect, and total effects. ACME, average causal mediated effect; ADE, average direct effect.
Can digital health researchers make a difference during the pandemic? Results of the single-arm, chatbot-led Elena+: Care for COVID-19 interventional study

August 2023

·

95 Reads

·

6 Citations

Background The current paper details findings from Elena+: Care for COVID-19, an app developed to tackle the collateral damage of lockdowns and social distancing, by offering pandemic lifestyle coaching across seven health areas: anxiety, loneliness, mental resources, sleep, diet and nutrition, physical activity, and COVID-19 information. Methods The Elena+ app functions as a single-arm interventional study, with participants recruited predominantly via social media. We used paired samples T-tests and within subjects ANOVA to examine changes in health outcome assessments and user experience evaluations over time. To investigate the mediating role of behavioral activation (i.e., users setting behavioral intentions and reporting actual behaviors) we use mixed-effect regression models. Free-text entries were analyzed qualitatively. Results Results show strong demand for publicly available lifestyle coaching during the pandemic, with total downloads (N = 7′135) and 55.8% of downloaders opening the app (n = 3,928) with 9.8% completing at least one subtopic (n = 698). Greatest areas of health vulnerability as assessed with screening measures were physical activity with 62% (n = 1,000) and anxiety with 46.5% (n = 760). The app was effective in the treatment of mental health; with a significant decrease in depression between first (14 days), second (28 days), and third (42 days) assessments: F2,38 = 7.01, p = 0.003, with a large effect size (η2G = 0.14), and anxiety between first and second assessments: t54 = 3.7, p = <0.001 with a medium effect size (Cohen d = 0.499). Those that followed the coaching program increased in net promoter score between the first and second assessment: t36 = 2.08, p = 0.045 with a small to medium effect size (Cohen d = 0.342). Mediation analyses showed that while increasing number of subtopics completed increased behavioral activation (i.e., match between behavioral intentions and self-reported actual behaviors), behavioral activation did not mediate the relationship to improvements in health outcome assessments. Conclusions Findings show that: (i) there is public demand for chatbot led digital coaching, (ii) such tools can be effective in delivering treatment success, and (iii) they are highly valued by their long-term user base. As the current intervention was developed at rapid speed to meet the emergency pandemic context, the future looks bright for other public health focused chatbot-led digital health interventions.


Citations (36)


... While good annotation tools and processes are important, the production of high-quality training data relies heavily on the data available for annotation. Behaviour change intervention reports, however, tend to use unclear and ambiguous language and this often makes it difficult to accurately interpret and classify data (Castro et al., 2024;West et al., 2023a). The HBCP found that intervention reports need to be much more structured and consistent in the way they present data. ...

Reference:

Creating a body of physical activity evidence to test the generalisation of annotation methods for automated evidence synthesis
Taking Behavioral Science to the next Level: Opportunities for the Use of Ontologies to Enable Artificial Intelligence-Driven Evidence Synthesis and Prediction

... Innovations like virtual reality (VR) therapy, biofeedback systems, and neurofeedback techniques are gaining attention for their potential in desensitizing individuals to traumatic memories or environments and retraining neural responses. VR therapy, for instance, recreates trauma-related scenarios in a controlled environment, providing a safe space for individuals to process and confront their traumatic experiences under therapeutic guidance (Calcerano & Ciurlia, 2023;Choi et al., 2023;Kerr et al., 2023). ...

The effectiveness and user experience of a biofeedback intervention program for stress management supported by virtual reality and mobile technology: A randomized controlled study

BMC Digital Health

... The initial screening (titles and abstracts) and full-text assessment of the searched studies will be conducted by the first author (WL) and subsequently verified by an independent reviewer (SCL). Since subject recruitment can be conducted through internet-based methods (eg, social media) which can target populations of different countries, 11 identification of whether the studies were conducted in Asian populations will be carried out through the full-text assessment (except relevant information explicitly specified in the abstracts or titles). To avoid missing any potentially eligible studies, the reference lists of the searched relevant systematic reviews 2-4 12-20 and all the included articles will also be screened for additional potential studies. ...

Can digital health researchers make a difference during the pandemic? Results of the single-arm, chatbot-led Elena+: Care for COVID-19 interventional study

... This conflicts with contemporary views on mental health, which emphasise the care of both mind and body and highlight the significance of the whole human entity and the interdependence of its parts, including body, mind, connectedness, and spirituality [90]. Qualitative studies with potential users have also shown a generally positive attitude towards holistic health [91,92] and thus it is likely that future commercial or research-based DHIDs will benefit from incorporating a range of lifestyle components. ...

Development of “LvL UP 1.0”: a smartphone-based, conversational agent-delivered holistic lifestyle intervention for the prevention of non-communicable diseases and common mental disorders

Frontiers in Digital Health

... This conflicts with contemporary views on mental health, which emphasise the care of both mind and body and highlight the significance of the whole human entity and the interdependence of its parts, including body, mind, connectedness, and spirituality [90]. Qualitative studies with potential users have also shown a generally positive attitude towards holistic health [91,92] and thus it is likely that future commercial or research-based DHIDs will benefit from incorporating a range of lifestyle components. ...

Exploring the potential of mobile health interventions to address behavioural risk factors for the prevention of non-communicable diseases in Asian populations: a qualitative study

BMC Public Health

... OS is highly prevalent among employees and varies according to their role and nature of their work. Various terms are used interchangeably to indicate stress and pressure in the workplace, such as occupational stress, job-induced stress, and work-related stress [1,2]. Te term "occupational stress" is used by psychologists and organizations to formally describe workplace stress and pressure [2]; hence, it has been adopted in the present study. ...

Investigating Employees’ Concerns and Wishes for Digital Stress Management Interventions with Value Sensitive Design: Mixed Methods Study

Journal of Medical Internet Research

... Mouse and keyboard data Behavioral data, such as keystrokes, mouse movements [15], and mobile phone activity [16], are used to identify behavioral patterns and derive emotional states. Recent studies by Naegelin et al. [17] and Shinde et al. [18] illustrate that the merging of physiological and behavioral data using machine learning models can lead to improved detection of workplace stress. Naegelin et al. [17] developed a machine learning method for stress detection based on multimodal data (mouse, keyboard, and cardiac data) and tested it in a simulated group office environment. ...

An interpretable machine learning approach to multimodal stress detection in a simulated office environment
  • Citing Article
  • January 2023

Journal of Biomedical Informatics

... Learnability emphasizes varied, engaging feedback for effective skill consolidation. Technological options, including visual, auditory, haptic methods, and virtual reality (VR), address these aspects, enhancing traditional MB-based interventions [78,79]. ...

Virtual reality-supported biofeedback for stress management: Beneficial effects on heart rate variability and user experience
  • Citing Article
  • December 2022

Computers in Human Behavior

... There is no definite way to measure how an influencer's video advertising a product influences its sales. Comparably, a study by Gross and von Wangenheim (2022) found that there are users who are more engaged with sponsored posts that tap into informational appeals by macro influencers compared to micro influencers. Additionally, the study found that there is a relationship between the type of post and social media engagement. ...

Influencer Marketing on Instagram: Empirical Research on Social Media Engagement with Sponsored Posts

Journal of Interactive Advertising

... 12 Regarding the task type, the Human-Computer Interaction (HCI) community has explored VAs to help older adults with various tasks, from functional tasks such as medication adherence [13][14][15] to social tasks such as chatting for entertainment or companionship. 16,17 However, past research did not experimentally investigate how task type as a factor might influence older adults' acceptance. Although a previous study examined the effect of interaction type on language tutors, 18 it did not explore VAs specifically for older adults. ...

The Effects of Health Care Chatbot Personas With Different Social Roles on the Client-Chatbot Bond and Usage Intentions: Development of a Design Codebook and Web-Based Study

Journal of Medical Internet Research