Lab

Socio Cognitive


About the lab

Socio Cognitive is a data science lab devoted to social media mining, opinion mining, and social computing directed by Dr. Saddam Hossian Mukta. The primary focus of this lab is to analyze human psychological attributes from their interactions with Social Networking Sites (SNS). We aim to build machine learning models with real-life applications using extracted cognitive attributes from multi-sourced data.

Featured research (17)

Traditional Machine Learning (ML) models are generally preferred for classification tasks on tabular datasets, which often produce unsatisfactory results in complex tabular datasets. Recent works, using Convolutional Neural Networks (CNN) with embedding techniques, outperform the traditional classifiers on tabular dataset. However, these embedding techniques fail to use an automated approach after analyzing the importance of the features in the dataset accurately. This study introduces a novel feature embedding technique named Dynamic Weighted Tabular Method (DWTM), which dynamically uses feature weights based on their strength of the correlations to the class labels during applying any CNN architectures on the tabular datasets. DWTM converts each data point into images and then feeds to a CNN architecture. It dynamically embeds the features of the tabular dataset based on their strength and assigns pixel positions to the appropriate features in the image canvas space instead of using any static configuration. In this paper, DWTM embedding method is applied over six benchmark tabular datasets independently by using three different CNN architectures (i.e., ResNet-18, DenseNet and InceptionV1) and an outstanding performance (an average accuracy of 98%) has obtained, which outperforms any traditional and CNN based classifiers as well.
Background Many people suffer from insomnia, a sleep disorder characterized by difficulty falling and staying asleep during the night. As social media have become a ubiquitous platform to share users’ thoughts, opinions, activities, and preferences with their friends and acquaintances, the shared content across these platforms can be used to diagnose different health problems, including insomnia. Only a few recent studies have examined the prediction of insomnia from Twitter data, and we found research gaps in predicting insomnia from word usage patterns and correlations between users’ insomnia and their Big 5 personality traits as derived from social media interactions. Objective The purpose of this study is to build an insomnia prediction model from users’ psycholinguistic patterns, including the elements of word usage, semantics, and their Big 5 personality traits as derived from tweets. Methods In this paper, we exploited both psycholinguistic and personality traits derived from tweets to identify insomnia patients. First, we built psycholinguistic profiles of the users from their word choices and the semantic relationships between the words of their tweets. We then determined the relationship between a users’ personality traits and insomnia. Finally, we built a double-weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits as derived from user tweets. Results Our classification model showed strong prediction potential (78.8%) to predict insomnia from tweets. As insomniacs are generally ill-tempered and feel more stress and mental exhaustion, we observed significant correlations of certain word usage patterns among them. They tend to use negative words (eg, “no,” “not,” “never”). Some people frequently use swear words (eg, “damn,” “piss,” “fuck”) with strong temperament. They also use anxious (eg, “worried,” “fearful,” “nervous”) and sad (eg, “crying,” “grief,” “sad”) words in their tweets. We also found that the users with high neuroticism and conscientiousness scores for the Big 5 personality traits likely have strong correlations with insomnia. Additionally, we observed that users with high conscientiousness scores have strong correlations with insomnia patterns, while negative correlation between extraversion and insomnia was also found. Conclusions Our model can help predict insomnia from users’ social media interactions. Thus, incorporating our model into a software system can help family members detect insomnia problems in individuals before they become worse. The software system can also help doctors to diagnose possible insomnia in patients.
Basic human values represent a set of values such as security, independence, success, kindness, and pleasure, which we deem important to our lives. Each of us holds different values with different degrees of significance. Existing studies show that values of a person can be identified from their social network usage. However, the value priority of a person may change over time due to different factors such as life experiences, influence, social structure and technology. Existing studies do not conduct any analysis regarding the change of users' value from the social influence, i.e., group persuasion, form the social media usage. In our research, first, we predict users' value score by the influence of friends from their social media usage. We propose a Bounded Confidence Model (BCM) based value dynamics model from 275 different ego networks in Facebook that predicts how social influence may persuade a person to change their value over time. Then, to predict better, we use particle swarm optimization based hyperparameter tuning technique. We observe that these optimized hyperparameters produce accurate future value score. We also run our approach with different machine learning based methods and find support vector regression (SVR) outperforms other regressor models. By using SVR with the best hyperparameters of BCM model, we find the lowest Mean Squared Error (MSE) score 0.00347.
Federated Learning (FL) is a new technology that has been a hot research topic. It enables training an algorithm across multiple decentralized edge devices or servers holding local data samples, without exchanging them. There are many application domains where large amounts of properly labeled and complete data are not available in a centralized location, for example, doctors’ diagnosis from medical image analysis. There are also growing concerns over data and user privacy as Artificial Intelligence is becoming ubiquitous in new application domains. As such, very recently, a lot of research has been conducted in several areas within the nascent field of FL. A variety of surveys on different subtopics exist in current literature, focusing on specific challenges, design aspects and application domains. In this paper, we review existing contemporary works in the related areas in order to understand the challenges and topics that are emphasized by each type of FL surveys. Furthermore, we categorize FL research in terms of challenges, design factors and applications, conducting a holistic review of each and outlining promising research directions.
Basic human values represent a set of values such as security, independence, success, kindness, and pleasure, which we deem important to our lives. The value priority of a person may change over time due to different factors such as life experiences, influence, social structure and technology. In this study, we show that we can predict the value change of a person by considering both the influence of her friends and her social media usage. This is the first work in the literature that relates the influence of social media friends on the human value dynamics of a user. We propose a Bounded Confidence Model (BCM) based value dynamics model from 275 different ego networks in Facebook that predicts how social influence may persuade a person to change her value over time. Then, to predict better, we use a particle swarm optimization based hyperparameter tuning technique. We observe that these optimized hyperparameters produce more accurate future value score. We also run our approach with different machine learning based methods and find support vector regressor (SVR) outperforms other regressor models. By using SVR with the best hyperparameters of BCM model, we find the lowest Mean Squared Error (MSE) score as 0.00347.

Lab head

Md. Saddam Hossain
Department
  • Department of Computer Science and Engineering
About Md. Saddam Hossain
  • My research interest includes social network analysis and mining, social computing, data mining, and machine learning.

Members (2)

Subrina Shahid Binty
  • United International University
Proshanta Kumer Das
  • United International University

Alumni (2)

Ahmed Shahriar Sakib
  • University of Waterloo
Tanjima Nasreen Jenia
  • University of Alberta