Ermal Toto’s research while affiliated with Worcester Polytechnic Institute and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (23)


Early Mental Health Uncovering with Short Scripted and Unscripted Voice Recordings
  • Chapter

November 2022

·

26 Reads

·

9 Citations

Advances in Intelligent Systems and Computing

M. L. Tlachac

·

·

Ermal Toto

·

Mental illnesses are often undiagnosed, highlighting the need for an effective alternative to traditional screening surveys. We propose our Early Mental Health Uncovering (EMU) framework that conducts rapid mental illness screening with active and passive modalities. We designed, deployed, and evaluated the EMU app to passively collect retrospective digital phenotype data and actively collect short voice recordings. The EMU app also administered a depression screening survey to label the data. We collected data from crowdsourced and student populations, both of whom shared sufficient voice recordings for modeling. We thus assess the classification ability of machine learning and deep learning models trained with scripted and unscripted voice recordings. For the crowdsourced participants, machine learning models screened for depression with an AUC of 0.78 and suicidal ideation with an AUC of 0.73. For the student participants, deep learning models screened for depression with an AUC of 0.70 and suicidal ideation with an AUC of 0.72. Combining datasets did not improve screening capabilities, though the best performing models on the combined dataset notably required voice transcripts. This research facilitates a better understanding of modality selection for mobile screening. We will make the features publicly available to further advance mental illness screening research.


Transfer Learning for Depression Screening from Follow-Up Clinical Interview Questions

November 2022

·

38 Reads

·

11 Citations

Advances in Intelligent Systems and Computing

Depression is a common mental health disorder with large social and economic consequences. It can be costly and difficult to detect, traditionally requiring hours of assessment by a trained clinical. Recently, machine learning models have been trained to screen for depression with patient voice recordings collected during an interview with a virtual agent. To engage the patient in a conversation and increase the quantity of responses, the virtual interviewer asks a series of follow-up questions. However, asking fewer questions would reduce the time burden of screening for the participant. We, therefore, assess if these follow-up questions have a tangible impact on the performance of deep learning models for depression classification. Specifically, we study the effect of including the vocal and transcribed replies to one, two, three, four, five, or all follow-up questions in the depression screening models. We notably achieve this using unimodal and multimodal pre-trained transfer learning models. Our findings reveal that follow-up questions can help increase F1 scores for the majority of the interview questions. This research can be leveraged for the design of future mental illness screening applications by providing important information about both question selection and the best number of follow-up questions.


StudentSADD: Rapid Mobile Depression and Suicidal Ideation Screening of College Students during the Coronavirus Pandemic

July 2022

·

46 Reads

·

14 Citations

Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies

ML Tlachac

·

·

Miranda Reisch

·

[...]

·

The growing prevalence of depression and suicidal ideation among college students further exacerbated by the Coronavirus pandemic is alarming, highlighting the need for universal mental illness screening technology. With traditional screening questionnaires too burdensome to achieve universal screening in this population, data collected through mobile applications has the potential to rapidly identify at-risk students. While prior research has mostly focused on collecting passive smartphone modalities from students, smartphone sensors are also capable of capturing active modalities. The general public has demonstrated more willingness to share active than passive modalities through an app, yet no such dataset of active mobile modalities for mental illness screening exists for students. Knowing which active modalities hold strong screening capabilities for student populations is critical for developing targeted mental illness screening technology. Thus, we deployed a mobile application to over 300 students during the COVID-19 pandemic to collect the Student Suicidal Ideation and Depression Detection (StudentSADD) dataset. We report on a rich variety of machine learning models including cutting-edge multimodal pretrained deep learning classifiers on active text and voice replies to screen for depression and suicidal ideation. This unique StudentSADD dataset is a valuable resource for the community for developing mobile mental illness screening tools.








Moodable: On feasibility of instantaneous depression assessment using machine learning on voice samples with retrospectively harvested smartphone and social media data

April 2020

·

174 Reads

·

51 Citations

Smart Health

Depression is a leading cause of disability and is associated with suicide risk. However, a quarter of patients with major depression remain undiagnosed. Prior work has demonstrated that a smartphone user's depression level can be detected by analyzing data gathered from their smartphone's sensors or from their social media posts over a few weeks after enrollment in a user study. These studies typically utilize a prospective study design, which is burdensome as it requires participants smartphone data to be gathered for prolonged periods before their depression level can be assessed. In contrast, we present a feasibility study of our Mood Assessment Capable Framework (Moodable) that facilitates almost instantaneous mood assessment by analyzing instantaneous voice samples provided by the user as well as historical sensor data harvested (scraped) from their smartphone and recent social media posts. Our retrospective, low-burden approach means that Moodable no longer requires study participants to engage with their phone for weeks before a depression score can be inferred. Moodable has the potential to minimize user data collection burden, increase user compliance, avoid study awareness bias and offer a near instantaneous depression screening. To lay a solid foundation for Moodable, we first surveyed 202 volunteer participants about their willingness to share voice samples and various smartphone and social media data types for mental health assessment. Based on these findings, we then developed the Moodable app. Thereafter, we utilized Moodable to collect short voice samples, and a rich array of retrospectively harvested data from users' smartphones (location, browser history, call logs) and social media accounts (instagram, twitter and facebook), with appropriate permissions, of 335 volunteer participants who also responded to 9 depression related questions of the Patient Health Questionaire (PHQ-9). Moodable then used machine learning to build classification models and classify the user's depression and suicidal ideation, for users which scores where unknown to the models. Results of Moodable's screening capability are promising. In particular, for the depression classification task we achieved F1 scores (the harmonic mean of the precision and recall) of 0.766, sensitivity of 0.750, and specificity of 0.792. For the suicidal ideation task we achieved F1 scores of 0.848, sensitivity of 0.864, and specificity of 0.725. This work could significantly increase depression-screening at the population level and opens numerous avenues for further research into this newly proposed paradigm of instantaneously screening depression and suicide risk levels from voice samples and retrospective smartphone and social media data.


Citations (19)


... Furthermore, as Multimodal Sentiment Analysis (MSA) [49,67] gained momentum with the recognition of various verbal and non-verbal symptoms of depression in psychological research, researchers made significant attempts to incorporate context-aware attention [10] and multimodal attention [26] information across multiple modalities [18,23,37,62,86,88]. Recently, there have been attempts that analyze the word-sentence relations on interviewee's answers [45,59,81,87,92] as well as the correlation between question-answer pairs [24,52,80], which can be cued in identifying depression. Unfortunately, no research exists yet that explains how an attention score of modality manifests in a specific question or how the degree of modality reflection changes with the sequence of primary and follow-up questions and answers, which can be crucial in modeling and analyzing the structure of a clinical interview. ...

Reference:

HiQuE: Hierarchical Question Embedding Network for Multimodal Depression Detection
Transfer Learning for Depression Screening from Follow-Up Clinical Interview Questions
  • Citing Chapter
  • November 2022

Advances in Intelligent Systems and Computing

... The application of these tools in mental health have shown even more explosive growth with a more than 4000% increase in publications in the last decade. Initial efforts targeting mental health have largely considered their use in adults, leveraging passively collected sensor data from smart phones and other sensors to identify phenotypes of mental health disorders and changes in their associated symptoms (e.g., [32][33][34][35][36][37]). A particular focus has been on vocal biomarkers of mental health [38][39][40][41] which have quickly emerged as one of the most promising and feasible measures to consider. More recent efforts are beginning to expand to consider additional data sources including biomarkers derived from wearable movement sensors [16,17,22], videos of body and facial movements [42,43], and a variety of physiological measurements such as heart rate, heart rate variability, respirations, and galvanic skin response [24][25][26]28,29]. ...

Early Mental Health Uncovering with Short Scripted and Unscripted Voice Recordings
  • Citing Chapter
  • November 2022

Advances in Intelligent Systems and Computing

... In the next step, they screened the 103 potentially relevant full texts and resolved any conflicts with an independent researcher, revealing 27 eligible articles. This included eleven studies investigating the predictive value of passive sensing for the prediction of STB [26][27][28][29][30][31][32][33][34][35][36] , ten trials focusing on the feasibility of passive sensing 28,[37][38][39][40][41][42][43][44][45] , and seven study protocols [46][47][48][49][50][51][52] . One article reported on two studies: a feasibility investigation and a prediction study 28 . ...

StudentSADD: Rapid Mobile Depression and Suicidal Ideation Screening of College Students during the Coronavirus Pandemic
  • Citing Article
  • July 2022

Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies

... d) DAIC-WOZ: In this platform, clinical interview data is provided. It is created to support the diagnosis of psychological distress conditions (anxiety, depression, and post-traumatic stress disorder) [20]. e) UCI: The UCI machine learning repository contains a collection of databases and domain theories. ...

Depression Screening Using Deep Learning on Follow-up Questions in Clinical Interviews
  • Citing Conference Paper
  • December 2021

... Despite searching for articles published from 2012 onwards, most of the studies were published in the last 5 years (n = 28, 84.8%; 26, 28, 53, 55, 57-59, 61-65, 67-71, 73, 75-84). The majority of studies were conducted in North America (n = 10, 30.3%; 53,57,60,64,66,72,75,78,80,84) or Europe (n = 9, 27.3%; 26,28,55,63,65,67,69,71,74), followed by those conducted in Asia (n = 8, 24.2%; 59, 61-62, 68, 70, 73, 76-77) and Australia (n = 1, 3.0%; 82). The remaining studies used international samples (n = 2, 6.1%; 56, 81) or did not contain information on geographical location (n = 3, 9.1%; 58,79,83). ...

EMU: Early Mental Health Uncovering Framework and Dataset
  • Citing Conference Paper
  • December 2021

... Transformer-based models gained their popularity in multi-modal depression detection given their high performance in the field of Natural Language Processing (NLP) and Computer Vision (CV). Audibert [18] designed by Ermal et al. is a depression detection model that takes two streams of input. A textual stream is sent to pre-trained BERT while an audio stream is fed into pre-trained audio networks such as Wav2vec [19] and SincNet [20], both of which are followed by a BiLSTM and their output are aggregated for depression prediction. ...

AudiBERT: A Deep Transfer Learning Multimodal Classification Framework for Depression Screening
  • Citing Conference Paper
  • October 2021

... The Pegasus and OOD tools will also be integrated with the AC-CESS Support Portal (ASP), so that a user can login to one place to use all of these tools. The ASP has underpinnings in the Connect.CI portal [4]. This tool provides a window into the ACCESS service, allowing users to monitor and apply for allocations and support, conduct account management activities, submit tickets, and join affinity groups. ...

The Connect.Cyberinfrastructure Portal
  • Citing Conference Paper
  • July 2021

... The Distress Analysis Interview Corpus/Wizard-of-Oz set (DAIC-WOZ) dataset DeVault et al., 2014) comprises voice and text samples from 189 interviewed healthy and control persons, as wells as their PHQ-8 depression detection questionnaire. This dataset is commonly used in many of the depression detection research works, including (Gong & Poellabauer, 2017;Sun et al., 2017) for text-based detection, (Dubagunta et al., 2019;Toto et al., 2020;Tlachac et al., 2020) for voice-based detection, and in multi-modal architectures such as (Alhanai et al., 2018;Yang et al., 2021). We also used both ...

Audio-based Depression Screening using Sliding Window Sub-clip Pooling
  • Citing Conference Paper
  • December 2020

... Wang et al. [19] validated that the topological features could identify the left temporal region variations and developed a seizure state detection approach. Tlachac et al. explored the audio topological features of depression screening with audio clips from open-ended clinical interviews and scripted crowd-sourced recordings [20]. Saba et al. [21] used the topological barcodes lifetime as a feature toward wheeze detection, similar works performed for preliminary action recognition were introduced in [22] and [23]. ...

Topological Data Analysis to Engineer Features from Audio Signals for Depression Detection
  • Citing Conference Paper
  • December 2020

... 2) Graphs were built using data from specific sensors and labels. Some sensor data can be privacy sensitive (e.g., GPS), and less than 50% of users are willing to grant permission access permission [13]. In contrast, our approach derives the graph directly from CHAR data based solely on label cooccurrence information observed in the training set. ...

Moodable: On feasibility of instantaneous depression assessment using machine learning on voice samples with retrospectively harvested smartphone and social media data
  • Citing Article
  • April 2020

Smart Health