Scott Counts’s research while affiliated with Microsoft and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (82)


Spontaneous Inference of Personality Traits and Effects on Memory for Online Profiles
  • Article

September 2021

·

2 Citations

Proceedings of the International AAAI Conference on Web and Social Media

Kristin Stecher

·

Scott Counts

As users navigate online social spaces, they encounter numerous personal profiles, each displaying a unique constellation of attributes. How do users make sense of this information? In our first study, we provide evidence that users spontaneously make personality trait inferences about people from profiles they encounter online, and for certain profiles, preferentially remember this inferred trait content over actual profile content. Study 2 uses several measures of profile coherence to assess how the coherence of user profiles interacts with trait inferences to influence memory for profiles. Findings provide a better understanding of specific profile content that makes profiles memorable and the social-cognitive process utilized when extracting information from profiles.


Thin Slices of Online Profile Attributes

September 2021

·

4 Reads

·

1 Citation

Proceedings of the International AAAI Conference on Web and Social Media

People form consistent impressions of others given surprisingly little information. With the advent of social networks, impressions now may form online rather than in a face-to-face context. This research explores aspects of online impression formation and discusses the crucial role of user profiles in this process. By examining users' decisions in an experimentally controlled social network, we show that users need only a thin slice of profile information in order to form impressions of others online. Additionally, specific profile attributes are evaluated for their perceived utility (how much do users choose to view these attributes), predictiveness (how well they serve as a proxy for a full profile), and diagnosticity (their ability to help users choose between online profiles). Findings provide design suggestions for better profile displays when space is restricted.


Understanding Anti-Vaccination Attitudes in Social Media

August 2021

·

56 Reads

·

128 Citations

Proceedings of the International AAAI Conference on Web and Social Media

The anti-vaccination movement threatens public health by reducing the likelihood of disease eradication. With social media’s purported role in disseminating anti-vaccine information, it is imperative to understand the drivers of attitudes among participants involved in the vaccination debate on a communication channel critical to the movement: Twitter. Using four years of longitudinal data capturing vaccine discussions on Twitter, we identify users who persistently hold pro and anti attitudes, and those who newly adopt anti attitudes towards vaccination. After gathering each user’s entire Twitter timeline, totaling to over 3 million tweets, we explore differences in the individual narratives across the user cohorts. We find that those with long-term anti-vaccination attitudes manifest conspiratorial thinking, mistrust in government, and are resolute and in-group focused in language. New adoptees appear to be predisposed to form anti-vaccination attitudes via similar government distrust and general paranoia, but are more social and less certain than their long-term counterparts. We discuss how this apparent predisposition can interact with social media-fueled events to bring newcomers into the anti-vaccination movement. Given the strong base of conspiratorial thinking underlying anti-vaccination attitudes, we conclude by highlighting the need for alternatives to traditional methods of using authoritative sources such as the government when correcting misleading vaccination claims.


Taking It All In? Visual Attention in Microblog Consumption

August 2021

·

6 Reads

·

11 Citations

Proceedings of the International AAAI Conference on Web and Social Media

Microblogging environments such as Twitter present a modality for interacting with information characterized by exposure to information “streams”. In this work, we examine what information in that stream is attended to, and how that attention corresponds to other aspects of microblog consumption and participation. To do this, we measured eye gaze, memory for content, interest ratings, and intended behavior of active Twitter users as they read their tweet steams. Our analyses focus on three sets of alignments: first, whether attention corresponds to other measures of user cognition such as memory (e.g., do people even remember what they attend to?); second, whether attention corresponds to behavior (e.g., are users likely to retweet content that is given the most attention); and third, whether attention corresponds to other attributes of the content and its presentation (e.g., do links attract attention?). We show a positive but imperfect alignment between user attention and other measures of user cognition like memory and interest, and between attention and behaviors like retweeting. To the third alignment, we show that the relationship between attention and attributes of tweets, such as whether it contains a link or is from a friend versus an organization, are complicated and in some cases counterintuitive. We discuss findings in relation to large scale phenomena like information diffusion and also suggest design directions to help maximize user attention in microblog environments


Predicting Depression via Social Media

August 2021

·

157 Reads

·

509 Citations

Proceedings of the International AAAI Conference on Web and Social Media

Major depression constitutes a serious challenge in personal and public health. Tens of millions of people each year suffer from depression and only a fraction receives adequate treatment. We explore the potential to use social media to detect and diagnose major depressive disorder in individuals. We first employ crowdsourcing to compile a set of Twitter users who report being diagnosed with clinical depression, based on a standard psychometric instrument. Through their social media postings over a year preceding the onset of depression, we measure behavioral attributes relating to social engagement, emotion, language and linguistic styles, ego network, and mentions of antidepressant medications. We leverage these behavioral cues, to build a statistical classifier that provides estimates of the risk of depression, before the reported onset. We find that social media contains useful signals for characterizing the onset of depression in individuals, as measured through decrease in social activity, raised negative affect, highly clustered egonetworks, heightened relational and medicinal concerns, and greater expression of religious involvement. We believe our findings and methods may be useful in developing tools for identifying the onset of major depression, for use by healthcare agencies; or on behalf of individuals, enabling those suffering from depression to be more proactive about their mental health.


Find Me the Right Content! Diversity-Based Sampling of Social Media Spaces for Topic-Centric Search

August 2021

·

5 Reads

·

2 Citations

Proceedings of the International AAAI Conference on Web and Social Media

Social media and networking websites, such as Twitter and Facebook, generate large quantities of information and have become mechanisms for real-time content dissipation to users. An important question that arises is: how do we sample such social media information spaces in order to deliver relevant content on a topic to end users? Notice that these large-scale information spaces are inherently diverse, featuring a wide array of attributes such as location, recency, degree of diffusion effects in the network and so on. Naturally, for the end user, different levels of diversity in social media content can significantly impact the information consumption experience: low diversity can provide focused content that may be simpler to understand, while high diversity can increase breadth in the exposure to multiple opinions and perspectives. Hence to address our research question, we turn to diversity as a core concept in our proposed sampling methodology. Here we are motivated by ideas in the "compressive sensing" literature and utilize the notion of sparsity in social media information to represent such large spaces via a small number of basis components. Thereafter we use a greedy iterative clustering technique on this transformed space to construct samples matching a desired level of diversity. Based on Twitter Firehose data, we demonstrate quantitatively that our method is robust, and performs better than other baseline techniques over a variety of trending topics. In a user study, we further show that users find samples generated by our method to be more interesting and subjectively engaging compared to techniques inspired by state-of-the-art systems, with improvements in the range of 15--45%.


What's in a @name? How Name Value Biases Judgment of Microblog Authors

August 2021

·

2 Reads

·

3 Citations

Proceedings of the International AAAI Conference on Web and Social Media

Bias can be defined as selective favoritism exhibited by human beings when posed with a task of decision making across multiple options. Online communities present plenty of decision making opportunities to their users. Users exhibit biases in their attachments, voting and ratings and other tasks of decision making. We study bias amongst microblog users due to the value of an author's name. We describe the relationship between name value bias and number of followers, and cluster authors and readers based on patterns of bias they receive and exhibit, respectively. For authors we show that content from known names (e.g., @CNN) is rated artificially high, while content from unknown names is rated artificially low. For readers, our results indicate that there are two types: slightly biased, heavily biased. A subsequent analysis of Twitter author names revealed attributes of names that underlie this bias, including effects for gender, type of name (individual versus organization), and degree of topical relevance. We discuss how our work can be instructive to content distributors and search engines in leveraging and presenting microblog content.


Distribution of area and population of census tracts
A: Frequency distribution of area in kilometer squared and B: population of census tracts included in the study.
Correlation of prediction on training data against the number of nearby census tracts used to perform prediction (m)
A: Search query based model for asthma. B: Search query based model for COPD. C: Land cover model for asthma. D: Land cover model for COPD.
Predicted against original prevalence of asthma
A: Model based on search queries. B: Model based on search queries and land cover data. C: Model based on search queries, census and land cover data. Each point denotes a distinct census tract in the dataset. Dotted line represent x = y.
Predicted against original prevalence of COPD
A: Model based on search queries. B: Model based on search queries and land cover data. C: Model based on search queries, census and land cover data. Each point denotes a distinct census tract in the dataset. Dotted line represent x = y.
Census tract level prevalence of asthma for the city of Chicago in U.S
A: Ground truth as estimated by CDC. B: Estimates from search query based model. C: Estimates from model based on search queries, land cover and census data. Greyed areas show census tracts for which data was not available from any one of the data sources.

+4

Neighborhood level chronic respiratory disease prevalence estimation using search query data
  • Article
  • Full-text available

June 2021

·

23 Reads

·

2 Citations

Estimation of disease prevalence at sub-city neighborhood scale allows early and targeted interventions that can help save lives and reduce public health burdens. However, the cost-prohibitive nature of highly localized data collection and sparsity of representative signals, has made it challenging to identify neighborhood scale prevalence of disease. To overcome this challenge, we utilize alternative data sources, which are both less sparse and representative of localized disease prevalence: using query data from a large commercial search engine, we identify the prevalence of respiratory illness in the United States, localized to census tract geographic granularity. Focusing on asthma and Chronic Obstructive Pulmonary Disease (COPD), we construct a set of features based on searches for symptoms, medications, and disease-related information, and use these to identify illness rates in more than 23 thousand tracts in 500 cities across the United States. Out of sample model estimates from search data alone correlate with ground truth illness rate estimates from the CDC at 0.69 to 0.76, with simple additions to these models raising those correlations to as high as 0.84. We then show that in practice search query data can be added to other relevant data such as census or land cover data to boost results, with models that incorporate all data sources correlating with ground truth data at 0.91 for asthma and 0.88 for COPD.

Download



Citations (65)


... В настоящее время большинство исследований посвящены распространению [24,25] и радикализации [26,27] антивакцинаторских настроений в социальных сетях. Большую роль в понимании механизма трансформации критических мнений и настроений по поводу вакцинации играют исследования влияния различных стационарных и интерактивных элементов сайтов (посты, комментарии, лайки) на формирование и проявление антивакцинаторских установок [28,29,30]. В статьях также рассматриваются структурные характеристики такого онлайн-взаимодействия. ...

Reference:

Possibilities of analyzing the network connectivity of ideological and monothematic radical online communities on VKontakte
Understanding Anti-Vaccination Attitudes in Social Media
  • Citing Article
  • August 2021

Proceedings of the International AAAI Conference on Web and Social Media

... By analyzing various data sources such as social media posts, surveys, and EHRs, SVMs can identify individuals at risk for conditions like depression, anxiety, and suicidal tendencies. For instance, De Choudhury et al. (2013) used SVMs to analyze social media data to predict the onset of depression, identifying behavioral markers that could serve as early warning signs [15]. This approach to mental health monitoring allowed for proactive intervention before the onset of severe symptoms, improving outcomes for individuals at risk. ...

Predicting Depression via Social Media
  • Citing Article
  • August 2021

Proceedings of the International AAAI Conference on Web and Social Media

... These approaches have been adopted on social media to examine the effects of COVID-19 on mental health (Saha et al. 2020;Jha et al. 2021;Vowels et al. 2023). In the context of college students, research has adopted causal approaches in social media data on the effects of on-campus gun violence (Saha and De Choudhury 2017), public service announcements after student deaths (Saha et al. 2018), hateful speech , and alcohol use (Kıcıman, Counts, and Gasser 2018) on college campuses. Methodologically, our work draws on the above causal-inference literature, particularly the potential outcomes framework (Rubin 2005), interrupted time series (McDowall, McCleary, and Bartos 2019), and causal impact estimation (Brodersen et al. 2015) to mitigate the confounds and unveil temporal effects of the pandemic on the mental health of college students. ...

Using Longitudinal Social Media Analysis to Understand the Effects of Early College Alcohol Use
  • Citing Article
  • June 2018

Proceedings of the International AAAI Conference on Web and Social Media

... Related to workplace wellbeing in particular, several studies in CSCW and HCI have leveraged web and social media data to study employee behavior [26,40,71,78,99]. Ehrlich and Shami studied employees' motivations for using social media [48]. ...

Measuring Professional Skill Development in U.S. Cities Using Internet Search Queries
  • Citing Article
  • July 2019

Proceedings of the International AAAI Conference on Web and Social Media

... Although there have been a lot of works on social recommendation, most of them ignore the attention factor which results in the constraint that only a small portion of information can be processed in real time by each individual due to her limited mind strength and brain capacity [13,27]. Recent works [4,8,10,37] have also confirmed the important role this factor plays in affecting people's behaviours and their interactions in social media. Actually people now with online social networks are easier to get connected, especially for those who are not close enough to become friends off-line, causing the fact that many of our friends on social networks may produce noisy/useless information. ...

Taking It All In? Visual Attention in Microblog Consumption
  • Citing Article
  • August 2021

Proceedings of the International AAAI Conference on Web and Social Media

... M e s h in (http://www.meshin.com) and Salsa [35] achieve this by providing insights about the worker's inbox from outside sources. These tools are all centripetal to email, as they pull in relevant information from various sources and past activity while relating it to emails in an inbox. ...

Salsa: Leveraging Email to Create a Social Network for the Enterprise
  • Citing Article
  • March 2009

Proceedings of the International AAAI Conference on Web and Social Media

... As to specific internal motives in engaging in retweets, scholars have suggested, "to amplify or spread tweets to new audiences", "to publicly agree with someone" and "to validate others' thoughts" among others [28]. In addition, studies show that users adopt their communication behaviors to their audiences' preferences [29] and that they use Twitter to make statements about their positions in a social context [30], therefore, retweeting could be said to have a strong social component [31]. ...

Predicting the Speed, Scale, and Range of Information Diffusion in Twitter
  • Citing Article
  • May 2010

Proceedings of the International AAAI Conference on Web and Social Media

... A ranking of workers will be considered unfair if it is biased toward certain groups of people, such as white males. This commonly happens since ranking usually depends on the social feedback received by workers in the form of reviews and ratings, and on the number of their past jobs, both of which perpetuate bias against certain groups of workers [9,16,17,29]. In this paper, we propose the first theoretical framework that can be used to assess and compare worker fairness of multiple jobs on multiple platforms. ...

An Experimental Study of Bias in Platform Worker Ratings: The Role of Performance Quality and Gender
  • Citing Article
  • April 2020

... [18] developed a personalized tweet ranking method based on a retweet metric, useful in reordering feeds or distributing items to users more likely to retweet. Paek et al. [19] asked Facebook users about the perceived importance of items in their timeline, developed classifiers to identify important messages and friends, and studied the predictive power of a number of features including likes, number of comments, presence of links and images, textual information, and shared background information. Both the tie strength and network structure approaches rely on explicit interaction as a tool for estimating tie strength; just as with retweet prediction, being able to identify non-explicit responses might improve these models. ...

Predicting the Importance of Newsfeed Posts and Social Network Friends
  • Citing Article
  • July 2010

Proceedings of the AAAI Conference on Artificial Intelligence

... Data sources used by the studies included longitudinal survey data (n = 27, 41.54%) [20, 27-31, 33, 34, 38-41, 43-45, 49, 51, 54, 55, 57, 63, 64, 71, 74, 75, 78, 82], biomedical databases (n = 9, 13.85%) [21,24,26,36,50,53,56,59,80], electronic medical records (n = 16, 24.62%) [22,25,32,35,37,52,58,62,65,66,70,73,77,81,83,84], social media textual elements (n = 1, 1.54%) [23], administrative claims (n = 8, 12.31%) [47,48,60,61,67,69,72,79,82], laboratory data (n = 1, 1.54%) [42], cellular data (n = 1, 1.54%) [76], search-engine queries (n = 1, 1.54%) [68], and wearable sensors (n = 1, 1.54%) [46] . ...

Neighborhood level chronic respiratory disease prevalence estimation using search query data