Daimin Shi’s research while affiliated with Northwest Normal University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (9)


Speech Depression Recognition from the Selfreference Effect Using LSTM with ResNet
  • Conference Paper

December 2024

·

2 Reads

Daimin Shi

·

·

Yang Liu

·

[...]

·

Tao Pan


Fig. 1. The architecture of our system of depression recognition.
Fig. 2. The pipeline of the proposed architecture for the recognition of depression.
Fig. 4. Diagram of channel attention.
Fig. 7. Steps for calculating MFCC coefficients.
Fig. 9. The neural network structure for the proposed technique.

+10

Speech depression recognition based on attentional residual network
  • Article
  • Full-text available

December 2021

·

56 Reads

·

24 Citations

Background: Depressive disorder is a common affective disorder, also known as depression, which is characterized by sadness, loss of interest, feelings of guilt or low self-worth and poor concentration. As speech is easy to obtain non-offensively with low-cost, many researchers explore the possibility of depression prediction through speech. Adopting speech signals to recognize depression has important practical significance. Aiming at the problem of the complex structure of the deep neural network method used in the recognition of speech depression and the traditional machine learning methods need to manually extract the features and the low recognition rate. Methods: This paper proposes a model that combines residual thinking and attention mechanism. First, depression corpus is designed based on the classic psychological experimental paradigm self-reference effect (SRE), and the speech dataset is labeled; then the attention module is introduced into the residual, and the channel attention is used to learn the features of the channel dimension, the spatial attention feedback the features of the spatial dimension, and the combination of the two to obtain the attention residual unit; finally the stacking unit constructs a speech depression recognition model based on the attention residual network. Results: Experimental results show that compared with traditional machine learning methods, this model obtains better results in the recognition of depression, which can meet the need for actual recognition application of depression. Conclusions: In this study, we not only predict whether person is depressed, but also estimate the severity of depression. In the designed corpus, the depression binary classification of an individual is given based on the severity of depression which is measured using BDI-II scores. Experimental results show that spontaneous speech can obtain better results than automatic speech, and the classification of speech features corresponding to negative questions is better than other tasks under negative emotions. Besides, the recognition accuracy rate of both male and female subjects is higher than that under other emotions.

Download






Citations (6)


... Bailey [13] found that the gender bias in DAIC-WOZ may affect classification performance. Liu [14] used gender recognition as an auxiliary task for depression recognition, and the average accuracy was improved by 6.1% compared with the nonauxiliary task. Verma [15] divided the features into four categories according to gender and emotion and tested the influence of gender and emotion on depression recognition, and used 1D-Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) for depression recognition. ...

Reference:

Hierarchical Multi-Task Learning Based on Interactive Multi-Head Attention Feature Fusion for Speech Depression Recognition
Improved Depression Recognition Using Attention and Multitask Learning of Gender Recognition
  • Citing Conference Paper
  • December 2021

... The metal samples produced by 4D printing exhibit greater mechanical qualities when compared with the majority of 4D printing materials, like hydrogels and polymers, demonstrating that this approach offers a wide variety of technical applications. In addition, laser stimulation's metallic 4D printing technology enables very flexible deformations of Carbon-based materials, such as carbon nanotubes (CNTs) or graphene 10 2 to 10 5 S/m Non-conductive polymers and composites 10 -12 to 10 2 S/m metallic 3D components [74,75]. As demonstrated in Fig. 9, the shape-morphing theory involves using a laser to serve as a stimulating source of heat to deliberately generate internal thermal stress in the specimen in order in order to accomplish 2D-to-3D structural transformation. ...

Depressive Tendency Recognition Using the Gated Recurrent Unit From Speech and Text Features
  • Citing Conference Paper
  • December 2021

... Similarly, [24] and [111] introduced models emphasizing comprehensive representation and non-content speech parameters, both showing promise in the accurate detection of depressive states. Adding to this, [71] introduced an attention-based residual network that outperforms traditional methods by using speech features related to negative questions under negative emotions, showing high accuracy for both male and female subjects. [80] and [130] used audiovisual and multimodal data, respectively, to predict depression and its relapse with promising results. ...

Speech depression recognition based on attentional residual network

... Arroz et al. [35] compared algorithms for unimodal, automatic, and multimodal classification conversations, with LSTM and gated recurrent units (GRU). Alternative approaches to multimodal depression detection encompass the examination of various indicators such as the dynamics of acoustic, facial, head movement [27], [39], behavioural and physiological signals [40], brain functional abnormalities, heart rate variability, hemodynamic parameters [41], and partially convergent structural features [23]. ...

Design on Modeling of Multimodal Depression Aided Diagnosis from Psychological Perspective
  • Citing Conference Paper
  • December 2020

... However, this research stream has primarily focused on specific aspects of emotions, such as identifying depression (Rejaibi et al. 2022;Lin et al. 2020;An et al. 2019), early signs of diseases from patients' speech patterns (DeSouza et al. 2021;Khanbhai et al. 2021;Beltrami et al. 2018;Perez et al. 2018), or challenges in emotion recognition among older adults (Schuller et al. 2020), without fully harnessing the potential of those technologies for the automated assessment and monitoring of subjective wellbeing. By utilizing research on NLP analytic capabilities, we aim to address the global challenge of healthy aging by assessing subjective well-being through speech analysis. ...

Mental Health Detection from Speech Signal: A Convolution Neural Networks Approach
  • Citing Conference Paper
  • December 2019