ArticlePDF Available

Abstract and Figures

Filler words (I mean, you know, like, uh, um) are commonly used in spoken conversation. The authors analyzed these five filler words from transcripts recorded by a device called the Electronically Activated Recorder (EAR), which sampled participants' language use in daily conversations over several days. By examining filler words from 263 transcriptions of natural language from five separate studies, the current research sought to clarify the psychometric properties of filler words. An exploratory factor analysis extracted two factors from the five filler words: filled pauses (uh, um) and discourse markers (I mean, you know, like). Overall, filled pauses were used at comparable rates across genders and ages. Discourse markers, however, were more common among women, younger participants, and more conscientious people. These findings suggest that filler word use can be considered a potential social and personality marker.
Content may be subject to copyright.
http://jls.sagepub.com/
Psychology
Journal of Language and Social
http://jls.sagepub.com/content/33/3/328
The online version of this article can be found at:
DOI: 10.1177/0261927X14526993
March 2014
2014 33: 328 originally published online 27Journal of Language and Social Psychology
Charlyn M. Laserna, Yi-Tai Seih and James W. Pennebaker
Gender, and Personality
: Filler Word Use as a Function of Age,You Know . . . Who Like Says Um
Published by:
http://www.sagepublications.com
at: can be foundJournal of Language and Social PsychologyAdditional services and information for
http://jls.sagepub.com/cgi/alertsEmail Alerts:
http://jls.sagepub.com/subscriptionsSubscriptions:
http://www.sagepub.com/journalsReprints.navReprints:
http://www.sagepub.com/journalsPermissions.navPermissions:
http://jls.sagepub.com/content/33/3/328.refs.htmlCitations:
What is This?
- Mar 27, 2014OnlineFirst Version of Record
- May 8, 2014Version of Record >>
by guest on May 8, 2014jls.sagepub.comDownloaded from by guest on May 8, 2014jls.sagepub.comDownloaded from
Journal of Language and Social Psychology
2014, Vol. 33(3) 328 –338
© 2014 SAGE Publications
DOI: 10.1177/0261927X14526993
jls.sagepub.com
Short Research Reports
Um . . . Who Like Says You
Know: Filler Word Use as a
Function of Age, Gender, and
Personality
Charlyn M. Laserna1, Yi-Tai Seih1,
and James W. Pennebaker1
Abstract
Filler words (I mean, you know, like, uh, um) are commonly used in spoken conversation.
The authors analyzed these five filler words from transcripts recorded by a device
called the Electronically Activated Recorder (EAR), which sampled participants’
language use in daily conversations over several days. By examining filler words
from 263 transcriptions of natural language from five separate studies, the current
research sought to clarify the psychometric properties of filler words. An exploratory
factor analysis extracted two factors from the five filler words: filled pauses (uh,
um) and discourse markers (I mean, you know, like). Overall, filled pauses were used
at comparable rates across genders and ages. Discourse markers, however, were
more common among women, younger participants, and more conscientious people.
These findings suggest that filler word use can be considered a potential social and
personality marker.
Keywords
filler word, filled pause, discourse marker, LIWC, EAR
The way we use language in natural spoken conversation is revealing. For instance,
certain aspects of language such as dialects and colloquialisms can be used to deter-
mine where a person was raised. How someone speaks can also indicate whether the
listener is a friend or a stranger. Language may even reveal characteristics such as
1The University of Texas at Austin, TX, USA
Corresponding Author:
Yi-Tai Seih, Department of Psychology, University of Texas, 1 University Place, 108 East Dean Keeton,
Austin, TX 78712, USA.
Email: yitai@utexas.edu
526993JLSXXX10.1177/0261927X14526993Laserna et al.
research-article2014
by guest on May 8, 2014jls.sagepub.comDownloaded from
Laserna et al. 329
gender, age, and personality. One widely used but often overlooked feature of lan-
guage are filler words, which are speech irregularities used in spoken conversation and
commonly regarded as superfluous language spoken by careless speakers (Strassel,
2004). Surprisingly little is known about whether filler words have psychological
implications with regard to communication. The current study examines filled pauses
and discourse markers, two primary categories of filler words. Unlike traditional lin-
guistic research, which investigates how filler words are used, we examined individual
differences to determine who is using these filler words when they converse. To set the
stage for our research, we review filled pauses and discourse markers in language use.
Filled pauses are short utterances commonly used in spontaneous speech (Brennan
& Williams, 1995; Swerts, 1998), uh and um being two of the most frequently used
filled pauses within the English language (Strassel, 2004). In verbal communication,
filled pauses are hypothesized to either act as an unconscious sign of speech disflu-
ency or serve as a signal sent by speakers to convey a certain message. The content of
this message varies and may inform listeners that the speaker needs a pause to collect
his or her thoughts (Fox Tree, 2007) or block the listener from taking the speaker’s
turn away (Maclay & Osgood, 1959). The use of filled pauses tends to increase when
a speaker is faced with challenging choices (Christenfeld, 1994), yet at the same time,
listeners view speakers as less anxious when the speakers use filled pauses (Christenfeld,
1995). Listeners also tend to view filled pauses as an indication that speakers are
unsure about what is being said, suggesting that filled pauses may be a more deliberate
signal sent from the speaker (Brennan & Williams, 1995; Fox Tree, 2007). In either
theory, filled pauses appear to be associated with the processing of complex thoughts.
Since filled pauses have a linguistic effect on spontaneous speech, is this effect
influenced by any variables? In a study performed by Bortfeld, Leon, Bloom, Schober,
and Brennan (2001) where transcriptions of conversation pairs were analyzed, an
increase in the use of uh and um in addition to other disfluency rates were associated
with being older, discussing unfamiliar domains, and taking on a directive role during
conversation. Another study by Tottie (2011) analyzed the frequency of uh and um in
two subcorpora from the British National Corpus that consists of transcribed tele-
phone, face-to-face, and interview conversations. The study discovered that older
people, males, and those with higher levels of education used more filled pauses in
speech than younger people, females, and individuals with lower levels of education.
In a sense, filled pauses may act as markers that identify speakers’ gender, age, and
socioeconomic status.
Unlike filled pauses, discourse markers are short phrases that do not contain any
grammatical information yet are prevalent in natural speech (Fox Tree & Schrock,
2002; Fuller, 2003; Matei, 2011; Strassel, 2004). Although they do not serve a gram-
matical purpose, both laypeople and researchers alike perceive discourse markers as
purposeful signals to a listener rather than as mere signs of disfluency (Fox Tree,
2007). They are generally proposed to act as transitions between different sections of
conversation (Clark, 1996), but discourse marker use seems to heavily depend on the
specific discourse marker. Often, the actual basic meanings of the words that consti-
tute a discourse marker determine its function. For example, the phrase I mean serves
by guest on May 8, 2014jls.sagepub.comDownloaded from
330 Journal of Language and Social Psychology 33(3)
as an indication that a speaker is planning to modify what is said, and you know is used
when the speaker is asking a listener to make inferences about the conversation (e.g.,
Fox Tree, 2007). Other research suggests that another purpose of you know is to con-
firm the understanding of a listener (Erman, 2001). The purpose of the discourse
marker like is more ambiguous, but some studies suggest that speakers use it as a
hedge when they do not want to fully commit to what they say (Fuller, 2003; Sharifian
& Malcolm, 2003). However, Liu and Fox Tree (2012) have countered the suggestion
that like acts as a hedge by showing that this discourse marker exhibits different pat-
terns from other hedges and likely has its own unique function.
Filled pauses and discourse markers are considered to be two categories of filler
words. If the use of filled pauses is affected by certain demographic variables, is the
use of discourse markers also affected by similar variables? A study has examined the
frequencies of discourse markers like and you know with the MICASE corpus (Schleef,
2005). This corpus contains 68 people (18 instructors and 50 students) and consists of
8 hours of lectures and 10 hours of seminars from an equal number of male and female
instructors. The results showed that female students used the discourse marker like
more than male students. In addition, students were more likely to use like than profes-
sors. Since professors are generally older than students, this finding concerning con-
versational roles may suggest that age affects discourse marker use.
Although previous research has described the underlying meanings and functions
of the two types of filler words, some limitations still exist in current literature. For
instance, past research examined discourse markers and filled pauses within one study
and discovered differences between these two categories (e.g., Fox Tree, 2006; Fox
Tree, Mayer, & Betts, 2011), but little research has been conducted on exploring the
personalities of the people who tend to use filler words. Although Mairesse and Walker
(2008) have shown that it is possible to estimate personality by examining certain
language parameters such as filled pauses (e.g., I mean, err, you know), their personal-
ity results were generated by human judges instead of original speakers and did not
show any direct correlation with filler words.
Personality traits can be assessed by self-report measures or judges’ ratings, but
little research examines the correlation between self-report personality traits and filler
words. It may be worthwhile to determine if self-reported personality is comparable to
assessed personality deduced from judges’ ratings on the use of filler words. In addi-
tion, if filler words are found to be reliable personality markers, further research using
self-report personality measures may be able to use filler word frequency to quickly
approximate personality traits in participants. Overall, the purpose of the current
research was to investigate the psychometric properties of filler words and revisit the
relationships between filler words, demographic variables, and personality traits.
The current research aimed to investigate how the frequency of filled pauses and
discourse markers used in the English language varies with two basic demographic
variables (gender and age) and personality traits. The present study focused on three
common discourse markers in the English language (I mean, you know, and like) and
two filled pauses (uh and um). The psychometric properties of these five filler words
and two categories were examined. Because most past research on filler words has
by guest on May 8, 2014jls.sagepub.comDownloaded from
Laserna et al. 331
Table 1. The Descriptions of Transcriptions From Each Study for Analysis.
Study
Number of
transcriptions
Percent
female Age (SD)
Word count
(SD) Days Description
Mehl and
Pennebaker
(2003a)
13 62 21.4 (3.50) 6,750 (2,784) 10 Analysis of participants’
reactions to September 11,
2001
Mehl and
Pennebaker
(2003b)
50 54 19.0 (1.31) 1,007 (590) 4 A study on patterns in the
natural language of college
students
Baddeley,
Pennebaker,
and Beevers
(2013)
27 63 32.5 (13.8) 4,066 (2,950) 4 An examination of linguistic
indicators of negative social
functioning with depressive
disorder
Fellows (2009) 76 51 35.2 (5.88) 4,786 (4,469) 1 A study about how preschool-
aged children and parents
use emotion language
Mehl,
Gosling, and
Pennebaker
(2006)
97 47 18.7 (0.91) 995 (526) 2 A study that examined
personality traits by using
natural language
Note. The average word count for each participant was 2,692, and the total word count from the participants was
708,217.
been based on experimental data (Tottie, 2011), the present study focused on transcrip-
tions that were transcribed from daily conversations recorded by Electronically
Activated Recorders (EARs). The EAR is an electronic device designed for sampling
natural spoken conversation during daily activities (Mehl, Pennebaker, Crow, Dabbs,
& Price, 2001). By using the EAR, the present study could examine filler words within
natural, extended interactions over the course of several days.
Method
Participants
The transcriptions of 263 participants (137 females) were included in the current study.
The participants of the transcribed conversations ranged in age from 17 to 69 years
(M = 25.1, SD = 9.38). The 263 participants were from five studies whose detailed
information is shown in Table 1.
EAR Corpus and Coding
This study used a corpus of transcriptions obtained through the EAR, which is a device
programmed to automatically take audio recordings after set intervals of time (Mehl et
al., 2001). The EAR was worn by participants for a period of 2 to 3 days while they
went about their daily lives, giving the EAR the ability to collect truly spontaneous
conversation. Any clearly audible conversations between participants and the listener
by guest on May 8, 2014jls.sagepub.comDownloaded from
332 Journal of Language and Social Psychology 33(3)
were then transcribed. Those performing the transcribing were instructed to not omit
filled pauses and discourse markers.
Procedure
The present study used the computerized text analysis program Linguistic Inquiry and
Word Count (Pennebaker, Booth, & Francis, 2007) to calculate the rates of filled
pauses and discourse markers used within each transcription as well as the total num-
ber of words spoken by an individual during a conversation. These calculations were
then used to determine the proportion of conversation devoted to filled pauses and
discourse markers. The proportions for each age and gender were then statistically
analyzed and compared.
Three of the transcription sets were from studies where participants’ personalities
were determined using the Big Five Inventory (Fellows, 2009; Mehl & Pennebaker,
2003a; Mehl & Pennebaker, 2003b). One study used the Ten-Item Personality Scale on
participants (Baddeley et al., 2013), and one study used the NEO Personality Inventory
on participants (Mehl et al., 2006). Since these three different versions of personality
scales were highly related to each other (Gosling, Rentfrow, & Swann, 2003), all per-
sonality scores were standardized for the current study and examined according to the
Big Five personality traits. Eleven participants did not complete any personality mea-
sure, resulting in a total of 252 participants included in personality analysis.
Results
The current study sought to examine three aspects of filler words. First, the psycho-
metric properties of the five filler words were examined to clarify the associations
between filler words. Second, filled pauses and discourse markers were correlated
with age and gender. Third, the two types of filler words were examined according to
personality traits.
Each of the five filler words was analyzed by its base rates, which are presented in
Table 2. A one-way within-subjects analysis of variance showed that filler word rates
were used at significantly different rates, F(4, 1,048) = 141.8, p < .001. The least sig-
nificant difference post hoc comparison indicated that participants used like more than
the other four filler words included in the study (ps < .001). Correlation analysis was
performed to determine any associations between filler words. As shown in Table 2, uh
was not related to the discourse markers I mean, you know, and like, implying that the
underlying mechanism behind certain filler words might have different concepts. The
correlations between gender, age, and each filler word are also reported in Table 2 as
additional information.
To understand the structure of the filler words, we employed an exploratory factor
analysis with the five filler words. A principal component method with a varimax rota-
tion was used. The Kaiser-Meyer-Olkin measure of sampling adequacy was signifi-
cant (Kaiser-Meyer-Olkin = .65, p < .001), indicating that these five filler words were
factorable. The scree plot suggested two factors, and the two factors together accounted
by guest on May 8, 2014jls.sagepub.comDownloaded from
Laserna et al. 333
for 60.8% of the total variance. The factor loading matrix is shown in Table 2. The first
extracted factor included the discourse markers I mean, you know, and like, whereas
the second extracted factor included the filled pause uh and um, supporting past theo-
ries of filled pauses.
Thus, the first factor referred to discourse markers, and the second factor referred
to filled pauses. According to the findings of our factor analysis, the rates of I mean,
you know, and like were summed to be the rate of discourse markers (M = 1.43; SD =
1.40), and the rates of uh and um were summed to be the rate of filled pauses (M =
0.64; SD = 0.63). Importantly, the rates of filled pauses and discourse markers were
positively correlated with each other (r = .26, p < .001), strengthening the idea that
both filled pauses and discourse markers belong within the same category. These two
categories were used in the following analyses.
The rate of discourse markers was positively associated with gender (male = 1,
female = 2; r = .20, p < .01) but negatively associated with age (r = −.50, p < .001),
suggesting that female and young participants were more likely to use discourse mark-
ers. On the contrary, the rate of filled pauses was not associated with gender (r = −.04,
p = .50) but associated with age (r = −.12, p = .05).
With these correlational findings, we became curious about the developmental
trend of these two types of filler words. We divided participants into four categories:
early college (17-19), late college (20-22), early adulthood (23-34), and adulthood (35
and older). Two 2 (gender) × 4 (age categories) between-subjects analyses of variance
were conducted separately on the two categories of filler words. The mean rates are
presented in Figure 1. With regard to discourse markers, there was a significant inter-
action effect between gender and age, F(3, 255) = 4.08, p < .01. The least significant
difference post hoc comparisons indicated that females used more discourse markers
than males in early and late college (ps < .001). The main effect on gender was signifi-
cant, F(1, 255) = 8.71, p < .01, and so was the main effect on age, F(3, 255) = 45.2,
p < .001. On the contrary, the interaction effect and the main effect of gender on filled
pause rates were not significant. Only the main effect on age on filled pause rates was
significant, F(3, 255) = 2.67, p = .05. Overall, the use of discourse markers and filled
pauses displayed a developmental trend.
Last, to examine the relationship between filler word use and personality, we cor-
related personality scores with the rates of discourse markers and filled pauses while
Table 2. Basic Psychometric Properties, Correlations on Gender and Age, and Component
Loadings for the Five Filler Words.
Mean (SD) (1) (2) (3) (4) (5) Gender Age Factor 1 Factor 2
(1) I mean 0.12 (0.18) −.05 −.24*** .67 .18
(2) you know 0.18 (0.28) −.26*** — −.16** −.11 .74 .10
(3) like 1.13 (1.17) −.32*** .48*** — −.19*** −.54*** .80 −.13
(4) uh 0.35 (0.42) −.03 .05 .05 −.15* −.01 −.13 .89
(5) um 0.29 (0.40) −.19** .22*** .36*** .21*** — −.09 −.21*** .45 .60
Note. Gender: Male = 1; Female = 2. Factor 1 is discourse markers, whereas Factor 2 is filled pauses.
***p < .001. **p < .01. *p < .05.
by guest on May 8, 2014jls.sagepub.comDownloaded from
334 Journal of Language and Social Psychology 33(3)
controlling for gender and age (df = 248). Only conscientiousness was found to be
related to discourse markers (r = .14, p = .03), which could, in theory, be attributed to
a Type I error given the number of correlations tested. None of the Big Five personal-
ity traits were related to the use of filled pauses.
Figure 1. Mean rates of discourse markers and filled pauses by gender and age per person.
Note. The sample size was 123 for early college, 36 for late college, 59 for early adulthood, and 45 for
adulthood. The discourse marker category included I mean, you know, and like. The filled pause category
included uh and um.
by guest on May 8, 2014jls.sagepub.comDownloaded from
Laserna et al. 335
Discussion
Past research has mainly discussed filled pauses and discourse markers separately and
neglected to examine the relation between these two types of filler words. The current
research sought to look at a bigger picture and analyze filled pauses and discourse
markers in relation to one another. There were several interesting findings regarding
the psychometric properties of the filler words in this study. First, two factors were
extracted from our factor analysis and were found to be related to each other. This
finding suggests that the use of filled pauses and discourse markers is not identical
despite both categories having been discussed together as filler words (e.g., Strassel,
2004). In addition to the factor analysis, the use of filled pauses was found to be asso-
ciated with age but not with gender, whereas the use of discourse markers was found
to be associated with both gender and age. This suggests that people who were young,
female, or both young and female are more likely to use discourse markers. This result
supports previous findings regarding the use of the discourse marker like (Schleef,
2005). Finally, the use of discourse markers was associated with conscientiousness,
indicating that discourse markers can potentially serve as personality markers.
The present research has practical significance because it has shown that filler
words can serve as markers for age and gender. Our results extended previous research
by demonstrating a developmental trend that indicates that the gender effect on the use
of discourse markers only emerges during early and late college. As people become
older, the gender effect disappears. This trend may be indicative of a normative life
transition into adult roles, such as when one graduates from college and enters a job
market. A career role change may be the possible factor that leads people to change
their use of filler words.
What type of people are more likely to use discourse markers or filled pauses? In
our correlational results, conscientious people used more discourse markers. The pos-
sible explanation for this association is that conscientious people are generally more
thoughtful and aware of themselves and their surroundings. When having conversa-
tions with listeners, conscientious people use discourse markers, such as I mean and
you know, to imply their desire to share or rephrase opinions to recipients. Thus, it is
expected that the use of discourse markers may be used to measure the degree to which
people have thoughts to express. As for filled pauses, their use has been considered to
be a reflection of anxiety (e.g., Christenfeld & Creager, 1996; Scherer & Scherer,
1981). However, our measure of neuroticism was not related to the use of filled pauses
in this research. The claim that speaker anxiety is related to the use of filled pauses
should be more carefully examined in future research.
Previous research has documented filler words as markers of people’s psychologi-
cal states (Erman, 2001; Fuller, 2003). In the current study, we not only clarified the
psychometric structure of the two types of filler words but also extended the work to
personality traits. When people first meet people, they usually approximate strangers’
personalities and base their opinions on what is said and how they say it. From a meth-
odological standpoint, the use of discourse markers can provide a quick behavioral
measure of personality traits. More important, we used extended conversations with
by guest on May 8, 2014jls.sagepub.comDownloaded from
336 Journal of Language and Social Psychology 33(3)
speakers to study how filler words function in daily lives. This strategy provides better
ecological validity to investigate filler word use. With an increased understanding of
why and how filler words are used in verbal communication, we anticipate that people
may one day be able to use the active interpretation of filler words to improve the qual-
ity of their communication with others.
Acknowledgments
We would like to thank Matthias Mehl, Jenna Baddeley, and Michelle Fellows, who provided
data to allow us to reanalyze it. We also thank the editor Howard Giles and our anonymous
reviewers for their valuable comments on our article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship,
and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship,
and/or publication of this article: The research was supported in part by the Army Research
Institute (W5J9CQ-12-C-0043) and the National Science Foundation (IIS-1344257; NSCC-
0904913; BCS-1228693).
References
Baddeley, J. L., Pennebaker, J. W., & Beevers, C. G. (2013). Everyday social behavior during a
major depressive episode. Social Psychological and Personality Science, 4, 445-452.
Bortfeld, H., Leon, S. D., Bloom, J. E., Schober, M. F., & Brennan, S. E. (2001). Disfluency
rates in conversation: Effects of age, relationship, topic, role, and gender. Language and
Speech, 44, 123-147.
Brennan, S. E., & Williams, M. (1995). The feeling of another’s knowing: Prosody and filled
pauses as cues to listeners about the metacognitive states of speakers. Journal of Memory
and Language, 34, 383-398.
Christenfeld, N. (1994). Options and ums. Journal of Language and Social Psychology, 13,
192-199.
Christenfeld, N. (1995). Does it hurt to say um? Journal of Nonverbal Behavior, 19, 171-186.
Christenfeld, N., & Creager, B. (1996). Anxiety, alcohol, aphasia, and ums. Journal of
Personality and Social Psychology, 70, 451-460.
Clark, H. (1996). Using language. Computational Linguistics, 23, 625-628.
Erman, B. (2001). Pragmatic markers revisited with a focus on you know in adult and adoles-
cent talk. Journal of Pragmatics, 33, 1337-1359.
Fellows, M. D. (2009). An exploration of emotion language use by preschool-aged children
and their parents: Naturalistic and lab settings (Doctoral dissertation). The University of
Texas at Austin.
Fox Tree, J. E. (2006). Placing like in telling stories. Discourse Studies, 8, 723-743.
Fox Tree, J. E. (2007). Folk notions of um and uh, like, and you know. Text and Talk, 27, 297-314.
Fox Tree, J. E., Mayer, S. A., & Betts, T. E. (2011) Grounding in instant messaging. Journal of
Educational Computing Research, 45, 455-475.
by guest on May 8, 2014jls.sagepub.comDownloaded from
Laserna et al. 337
Fox Tree, J. E., & Schrock, J. C. (2002). Basic meanings of you know and I mean. Journal of
Pragmatics, 34, 727-747.
Fuller, J. (2003). Use of the discourse marker like in interviews. Journal of Sociolinguistics, 7,
365-377.
Gosling, S. D., Rentfrow, P. J., & Swann, W. B., Jr. (2003). A very brief measure of the Big-
Five personality domains. Journal of Research in Personality, 37, 504-528.
Liu, K., & Fox Tree, J. E. (2012). Hedges enhance memory but inhibit retelling. Psychonomic
Bulletin and Review, 19, 892-898.
Maclay, H., & Osgood, C. E. (1959). Hesitation phenomena in spontaneous English Speech.
Word, 15, 19-44.
Mairesse, F., & Walker, M. (2008). Trainable generation of Big-Five personality styles through
data-driven parameter estimation. In J. Moore, S. Teufel, J. Allan, & S. Furui (Eds.),
Proceedings of ACL-08: HLT. Proceedings of the 46th Annual Meeting of the Association
for Computational Linguistics (pp. 165-173), Columbus, OH.
Matei, M. (2011). The influence of age and gender on the selection of discourse markers in
casual conversations. Bulletin of the Transilvania University of Braşov, 4, 213-220.
Mehl, M. R., Gosling, S. D., & Pennebaker, J. W. (2006). Personality in its natural habitat:
Manifestations and implicit folk theories of personality in daily life. Journal of Personality
and Social Psychology, 90, 862-877.
Mehl, M. R., & Pennebaker, J. W. (2003a). The social dynamics of a cultural upheaval: Social
interactions surrounding September 11, 2001. Psychological Science, 14, 579-585.
Mehl, M. R., & Pennebaker, J. W. (2003b). The sounds of social life: A psychometric analysis
of students’ daily social environments and natural conversations. Journal of Personality
and Social Psychology, 84, 857-870.
Mehl, M. R., Pennebaker, J. W., Crow, D. M., Dabbs, J., & Price, J. H. (2001). The Electronically
Activated Recorder (EAR): A device for sampling naturalistic daily activities and conver-
sations. Behavior Research Methods, Instruments, and Computers, 33, 517-523.
Pennebaker, J. W., Booth, R. J., & Francis, M. E. (2007). Linguistic inquiry and word count:
LIWC2007—Operator’s manual. Retrieved from http://www.liwc.net/
Scherer, K. R., & Scherer, U. (1981). Speech behavior and personality. In J. Darby (Ed.), Speech
evaluation in psychiatry (pp. 115-135). New York, NY: Grune & Stratton.
Schleef, E. (2005). Gender, power, discipline, and context: On the sociolinguistic variation of
okay, right, like, and you know in English academic discourse. Texas Linguistic Forum,
48, 177-186.
Sharifian, F., & Malcolm, I. G. (2003). The pragmatic marker like in English teen talk: Australian
aboriginal usage. Pragmatics and Cognition, 11, 327-344.
Strassel, S. (2004). Simple metadata annotation specification V6.2 (Linguistic Data Consortium).
Retrieved from http://projects.ldc.upenn.edu/MDE/Guidelines/SimpleMDE_V6.2.pdf
Swerts, M. (1998). Filled pauses as markers of discourse structure. Journal of Pragmatics, 30,
485-496.
Tottie, G. (2011). Uh and um as sociolinguistic markers in British English. International Journal
of Corpus Linguistics, 16, 173-197.
Author Biographies
Charlyn M. Laserna is a medical student at the University of Texas at Houston Medical
School. She received her undergraduate degree at the University of Texas at Austin in 2012. Her
research interests surround language and verbal social cues.
by guest on May 8, 2014jls.sagepub.comDownloaded from
338 Journal of Language and Social Psychology 33(3)
Yi-Tai Seih currently works for the Department of Psychology at the University of Texas at
Austin as a research associate. He received his PhD at the University of Texas at Austin in 2013.
His research focuses on the interplay between language and interpersonal relationships. His
most recent research focuses on how recipients perceive complaint language.
James W. Pennebaker is a professor and chair for the Department of Psychology at the
University of Texas at Austin. His most recent research focuses on the nature of language and
social dynamics in the real world. The words people use serve as powerful reflections of their
personality and social worlds.
by guest on May 8, 2014jls.sagepub.comDownloaded from
... Lastly, neither study has considered another process all transcripts of human speech should go through before LIWC analysis (cf., Laserna et al., 2014): tagging. Manual tagging to semantically disambiguate words is a simple task for a human, but text without tagging poses analytic challenges for LIWC of which the implications for measurement precision in the context of spoken language analysis are currently unknown. ...
... Research has shown that while filled pauses remain comparable across ages, filler word use varies with both age and gender, such that older individuals tend to use less fillers than younger adults, and women tend to use more fillers than men. Laserna et al. (2014) analyzed transcripts of spontaneous speech recorded as sound bites from 263 participants between the ages of 17 and 69, focusing on filled pauses (um, uh) and three fillers, labeled discourse markers by the authors (you know, I mean, like). Like was the most frequent filler word with 1.13% of all words, compared to 0.12 for I mean and 0.18 for you know. ...
... We do not believe that this is a task-related effect. Instead, this might be an age-related effect, in line with prior work showing that filler word use decreases with age (Laserna et al., 2014). However, our number of 'likes' uttered by our younger adults is much larger than the averages reported in Laserna et al. (2014), suggesting that either our younger adults found it particularly difficult to participate in our speech task, had particular difficulty to participate in a Zoom-based study, or were in other ways less focused than our older adults. ...
Article
For the longest time, the gold standard in preparing spoken language corpora for text analysis in psychology was using human transcription. However, such standard comes at extensive cost, and creates barriers to quantitative spoken language analysis that recent advances in speech-to-text technology could address. The current study quantifies the accuracy of AI-generated transcripts compared to human-corrected transcripts across younger (n = 100) and older (n = 92) adults and two spoken language tasks. Further, it evaluates the validity of Linguistic Inquiry and Word Count (LIWC)-features extracted from these two kinds of transcripts, as well as transcripts specifically prepared for LIWC analyses via tagging. We find that overall, AI-generated transcripts are highly accurate with a word error rate of 2.50% to 3.36%, albeit being slightly less accurate for younger compared to older adults. LIWC features extracted from either transcripts are highly correlated, while the tagging procedure significantly alters filler word categories. Based on these results, automatic speech-to-text appears to be ready for psychological language research when using spoken language tasks in relatively quiet environments, unless filler words are of interest to researchers.
... Mary has her lips pursed as she looks up and says "..umm=" can be interpreted as an expression of thoughtfulness as she searches for the right words to be used in the exemplar rationale. My interpretation is based on research on um and uh, which are examples of short utterances also known as filled pauses commonly used in spontaneous speech and associated with the processing of complex thoughts (Laserna et al., 2014). Without a gap in speech, denoted by the latching symbol "=", Pearl offers her first suggestion in the form of a question to develop the rationale at Line 11, "= reduce the product?" ...
... Mary's continued search for a word, shown by the repetition of the word "umm" and "will", with pursed lips at Lines 10 and 12, suggests that she is encouraging another response from Pearl because research shows that listeners tend to understand filled pauses as an indication that speakers are unsure about what is being said, suggesting that filled pauses may be a more purposeful signal by the speaker (cf. Laserna et al., 2014). My interpretation is facilitated through Pearl's response with a speech overlap at Line 13, "and [that will] affect (.) ...
... Karina's utterance at Line 1 shows her lack of understanding of valid trends because listeners tend to understand filled pauses such as umm as an indication that speakers are unsure about what is being said, suggesting that filled pauses may be a more purposeful signal by the speaker (cf. Laserna et al., 2014). I infer at this point that Karina's lack of understanding of valid trends would give rise to negative emotions. ...
Thesis
Full-text available
In science education, science inquiry is an approach used to engage students, develop their knowledge and understanding of scientific ideas, improve scientific literacy, and model how scientists explore phenomena. There is, however, limited research on student emotions and how they influence learning experiences during science inquiry. A paucity in understanding student emotion and how it shapes learning experiences limits current science inquiry teaching practices that are responsive to student emotions. Understanding students’ unpleasant emotional experiences can support teachers in developing responsive teaching practices that address common challenges in classrooms to support student needs. To address the limitations of existing research, this study explores student emotions before and after responsive teaching practices during science inquiry within two Australian Year 10 chemistry classes. An interpretive study design and a multi-method design using a two-element conceptual framework, emotion and cogenerative dialogue, enabled me to develop theoretical understandings of students’ challenging emotional experiences as they worked on different science inquiry activities during their science inquiry project. An understanding of the unpleasant emotions associated with common challenges experienced by students during the science inquiry project provided the teachers with insights into developing responsive teaching practices to address student needs. The findings of this study provide evidence that understanding students’ unpleasant emotional experiences can support teachers in developing responsive teaching practices that address student needs. In addition, positive changes in student emotions follow the implementation of responsive teaching practices. To this end, my study contributes new knowledge on the enacted practices of how student emotions influence students’ learning experiences during science inquiry and how to use this understanding to develop responsive teaching practices that address student needs. The findings also have implications for teaching and research by identifying ways to address barriers to inquiry learning through teachers’ use of the model of emotional inquiry practice and extend emotion research in science education by using the concepts and techniques developed in my study.
... Nakonec přívětivost korelovala záporně s proměnnými negemo, anxiety, anger, swear a death. Laserna et al. (2014) zjistili, že používání slov jako OK, dobře jako značek diskursu v komunikaci souvisí se svědomitostí. ...
Book
Full-text available
This monograph deals with the computer analysis of Chinese text and the use of the LIWC program in this analysis. The first chapter describes the process of automatic text analysis and provides examples of research in Czech. The second chapter describes the LIWC program, the variables it measures, the Chinese version of the program, and its use in studying the relationship between personality and text in various languages. The third chapter contains a selection of research on Chinese texts where LIWC was used. Texts under investigation might be newspaper and literary texts, blogs, social media texts, or essays created by research participants. In the fourth chapter, I conducted a LIWC analysis of texts from messenger communication in Chinese in QQ and compared it with the personalities of the communicators measured by the National Character Survey, which measures the Big Five. Neuroticism correlated with the frequency of auxiliary verbs and negatively with the frequency of words indicating reward and words indicating positive emotions in the messages writen by participant. Extraversion correlated with the frequency of impersonal pronouns, words indicating emotions, positive emotions, social processes, and related to the willingness to take risks. Openness correlated with the frequency of words indicating food and negatively with the frequency of postpositions, words indicating disagreement, future-oriented words, and words related to health. Agreeableness correlated with the percentage of words captured by LIWC, the frequency of grammatical words, modal particles, and informal conversational words. Conscientiousness correlated with the frequency of words related to reward, indicating spatial organization, and words from the field of religion. I also observed how the text relates to the estimate of the participant’s personality, which was made by their communication partner after the messenger conversation using the same questionnaire. The final chapter discusses the possibilities of LIWC use in cross-cultural comparisons, personality prediction, and scientometric analysis.
... Non-italicized text is included for context. strings that add length to an utterance without also adding any appreciable content (cf., discourse markers, Laserna et al., 2014;Tree & Schrock, 2002). They serve to (a) indicate hesitation, (b) initiate, maintain, or yield a talk turn, (c) attract attention, (d) highlight a preceding or ensuing utterance, and (e) signal corrections (Kjellmer, 2003;Tottie, 2014). ...
... Recall that we accounted for subject heterogeneity by using subject-random effects as our focus was on within-subject effects. However, including sociodemographic variables allows us to analyze whether cognitive and affective processing depends on age, gender, or political attitude [e.g., 42,57,69]. We find that older, European, and more liberal subjects engage more in cognitive processing. ...
Article
Fake news on social media has large, negative implications for society. However, little is known about what linguistic cues make people fall for fake news and, hence, how to design effective countermeasures for social media. In this study, we seek to understand which linguistic cues make people fall for fake news. Linguistic cues (e.g., adverbs, personal pronouns, positive emotion words, negative emotion words) are important characteristics of any text and also affect how people process real vs. fake news. Specifically, we compare the role of linguistic cues across both cognitive processing (related to careful thinking) and affective processing (related to unconscious automatic evaluations). To this end, we performed a within-subject experiment where we collected neurophysiological measurements of 42 subjects while these read a sample of 40 real and fake news articles. During our experiment, we measured cognitive processing through eye fixations, and affective processing in situ through heart rate variability. We find that users engage more in cognitive processing for longer fake news articles, while affective processing is more pronounced for fake news written in analytic words. To the best of our knowledge, this is the first work studying the role of linguistic cues in fake news processing. Altogether, our findings have important implications for designing online platforms that encourage users to engage in careful thinking and thus prevent them from falling for fake news.
... The study concludes that, firstly, the apparent time data shows a statistically significant gender difference between men and women; women use ʔinnu: much more than men, which is in accordance with other pragmatic studies that have suggested that males and females differ in their communicative behavior (Coates, 1989) and that females use more hedges (Coates, 1988;Mirzapour, 2016), qualifiers, tag questions (Lakoff, 1975;Leaper and Robnett, 2011) and DMs or fillers than males (e.g. Laserna et al., 2014). On the other hand, this gender effect among adults is missing among children as the gender difference between boys and girls is statistically nonsignificant. ...
Article
Full-text available
Objectives: This study aims to assess the impact of linguistic and social factors on the compatibility of source tools in the Jordanian dialect. Method: Thirty-six sociolinguistic interviews were conducted with 36 speakers residing in Jordan's capital, Amman. These interviews were recorded using a high-sensitivity, low-disturbance digital voice recorder and were transcribed into pre-prepared Excel tables. Statistical work was conducted, considering the coding protocol and using the GoldVarb X software. The study centered on three social factors: gender, educational level, and age, and three linguistic factors: sentence type, actor definition, and actor identification. Results: The study found that complementizer agreement is not limited by any social factors but is connected to one of the linguistic factors discussed in the study, namely, nominal characteristics (1st person, 2nd person, and 3rd person). This is due to the presence of a "common context" in cases involving the 1st person and 2nd person, while this context is not evident with the 3rd person. Conclusion: The study concluded that social factors do not influence grammatical diversity, unlike phonological diversity, which can be affected by social factors.
Article
Full-text available
This study employs a corpus-based approach to examine and compare the use of two discourse markers (DMs), “you know” and “I mean”, within the context of two mediatised English political interviews. The analysis encompasses frequencies, functions, co-occurrences, and positional distributions of these DMs. The study utilizes specialized corpora from two political interview programs: CGTN’s The Point with Liu Xin and BBC’s HARDtalk. The frequency analysis reveals that “you know” is statistically more prevalent than “I mean” in both programs, reflecting the spontaneity, interactivity, and need for clarification characteristic of political interviews. Notably, the Chinese interviewer (IR) uses “you know” more extensively, possibly due to a cultural preference for ensuring mutual understanding and engaging the audience, while the British IR employs “I mean” slightly more frequently, likely reflecting a tendency to clarify or reframe statements for precision. Functionally, these DMs serve diverse purposes such as hedging, agreeing, and monitoring across various domains including interpersonal, sequential, and rhetorical. Positional analysis shows “you know” typically appearing medially and “I mean” often in initial positions. These results underscore the distinctive interviewing styles of the two IRs and the pivotal role of these DMs in fulfilling a spectrum of communicative functions. This research offers valuable insights into the interviewer’s perspective in political interviews.
Article
Celem artykułu jest opis oraz analiza frekwencyjna i statystyczna jednego z wykładników potoczności – przerywnika leksykalnego. Badania mają charakter historyczny i zostały przeprowadzone na podstawie Korpusu dawnych polskich tekstów dramatycznych (1772–1939). Analiza zorientowana jest socjopragmatycznie, to znaczy pokazuje zależność pomiędzy używaniem wyodrębnionych przerywników (560 użyć) a wiekiem, płcią i statusem nadawców. Wyróżniono trzy podstawowe funkcje pragmatyczne, jakie te wyrażenia mogą pełnić w wypowiedzi (ekspresywna, retardacyjna, fatyczna).
Article
This paper explores a lesser studied topic in Romanian linguistic research related to the acoustic features (duration, pitch, F1/F2 values) and pragmatic functions of filler particles, often referred to as “filled pauses”, in spontaneous speech. From a theoretical perspective, we align our analysis with the recent definition proposed by Belz (2023), where “a phonetic exponent which is segmentally structured, semantically empty, syntactically unconstrained, and does not show an interjectional function is classified as a filler particle”. Moreover, as a way to reinforce the idea that filler particles are, from a phonetic perspective, extra-pausal phenomena, we classified fillers in terms of their positioning relative to the silent pause (i.e., pre-pausal, post-pausal, inter-pausal and concatenated). Prior to carrying out extensive quantitative analyses of filler particles on Romanian data, in this article we proposed performing an initial qualitative exploration of fillers in connected speech based on the Ro-Phon corpus. Our results reveal that: (1) the prototypical filler particle outputs in Romanian are vocalic (/ə, ɨ /), vocalic-nasal (/əm/) and nasal (/m/), (2) the length of the pause preceding a filler particle is systematically longer than the pause following it, (3) inter-pausal filler particles display the longest average duration while concatenated fillers were the shortest in our data, (4) in terms of formant frequencies, there is a greater degree of movement along the F1 axis compared to the average F2 frequencies extracted from both vocalic and vocalic-nasal tokens, indicative of an acoustic continuum present within the central vowels /ɨ/ and /ə/, (5) all fillers display a low, flat f0 contour, with a similar frequency as that of neighbouring phones, and that (6) filler particles perform various and often cumulative discursive roles, ranging from a cognitive function (indicative of planning processes), marker of a repair (self-initiated, content-based repairs), to a discourse management function used to signal upcoming new (sensitive) information within the narrative sequence. Future studies aim at extending the data and performing in-depth quantitative analyses related to duration, frequency distribution, voice and vowel quality of filler particles in non-pathological native and non-native Romanian speech data.
Article
Empirical studies of conversational recall show that the amount of conversation that can be recalled after a delay is limited and biased in favor of one’s own contributions. What aspects of a conversational interaction shape what will and will not be recalled? This study aims to predict the contents of conversation that will be recalled based on linguistic features of what was said. Across 59 conversational dyads, we observed that two linguistic features that are hallmarks of interactive language use—disfluency (um/uh) and backchannelling (ok, yeah)—promoted recall. Two other features—disagreements between the interlocutors and use of “like“—were not predictive of recall. While self-generated material was better remembered overall, both hearing and producing disfluency and backchannels improved memory for the associated utterances. Finally, the disfluency-related memory boost was similar regardless of the number of disfluencies in the utterance. Overall, we conclude that interactional linguistic features of conversation are predictive of what is and is not recalled following conversation.
Article
Full-text available
Although several studies have documented a link between anxiety and filled pauses (um s, er s, and uh s), numerous failures make it impossible to believe that the two are linked in any simple way. This article suggests anxiety may increase um s not when it makes the speech task harder but when it causes the speaker to pay attention to the speech. Two experiments examined this idea. One manipulated evaluation apprehension, and the other manipulated self-consciousness. Both showed dramatic increases in um s. Two more studies examined the real-world implications of this approach. Alcohol, which makes speaking harder but also makes speakers care less about what they say, was found to reduce um s. The second study found that Broca’s aphasics, who produce simple speech but must deliberate over every word, produce many um s. Wernicke’s aphasics may not talk well, but do not mind, and manage with few um s.
Article
In two experiments, we investigated predictions of the collaborative theory of language use (Clark, 1996) as applied to instant messaging (IM). This theory describes how the presence and absence of different grounding constraints causes people to interact differently across different communicative media (Clark & Brennan, 1991). In Study 1, we document how IM changes as users increase in expertise. In Study 2, we compare adaptations across telephoning and IM with a focus on multitasking.
Article
The current study measures laypeople's uses of um, uh, you know, and like, including folk notions of meanings, self-assessments of use, history of discussing use, and attitudes toward the words. Unlike the prevalent idea in the popular press that these discourse markers are interchangeable speaker production flaws, respondents in this study demonstrated that people do possess folk notions of meanings and uses that dramatically distinguish markers from each other. Um and uh were thought to indicate production trouble, you know was thought to be used in checking for understanding and connecting with listeners, and like defied definition. The folk notions of um, uh, and you know accord well with researchers' ideas about the meanings of these words. The use of like may be too subtle for laypeople to articulate. Most researchers' views of like involve some kind of discrepancy between what's said and what's meant. Even if they cannot state a meaning, people do treat the different markers differently.
Article
This paper is a pragmatic, functional and discursive analysis of actual conversations. The aim of this research is to discover the extent to which the contributions of the participants in casual verbal interactions are influenced by variables such as age or gender. Casual conversation is the interactional pattern in which discourse markers could acquire the most innovative pragmatic meanings and functions due to the lack of discursive constraints that characterize this type of verbal exchange. Among the elements that generate such discursive individuality are the variables of age and gender. The latter variables could either contribute to the confirmation of the core pragmatic meanings and functions of discourse markers or they could trigger the speaker's distancing from these central functional descriptions.
Article
Although you know and I mean are frequent in spontaneous talk, researchers have not agreed on what purpose they serve. They have been thought by some to be used similarly and by others to be used differently. Similarities of uses at a surface level encouraged historical discussions of these two markers in the same breath. The current synthesis details how both the apparent multifunctionality of you know and I mean and their surface similarities can arise out of each discourse marker’s basic meaning, with you know’s basic meaning being to invite addressee inferences (Jucker, A.H., & Smith, S.W. (1998). And people just you know like ‘wow’: Discourse markers as negotiating strategies. In A. H. Jucker & Y. Ziv (Eds.), Discourse Markers: Descriptions and Theory (pp. 171–201). Philadelphia: John Benjamins), and I mean’s basic meaning being to forewarn upcoming adjustments (Schiffrin, D. (1987). Discourse Markers. Cambridge: Cambridge University Press). # 2002 Elsevier Science B.V. All rights reserved.
Article
Although social functioning deficits are believed to characterize major depressive disorder (MDD), few studies have examined the social behavior of individuals with MDD in everyday life. The current study’s aim is to assess the everyday social behavior of individuals in a current major depressive episode. Participants with current MDD (n = 29) and healthy controls (n = 28) wore the electronically activated recorder (EAR), an ambulatory assessment device, for 3–4 days. The EAR recorded 90-second sound clips from participants’ immediate environments. Participants’ conversations were transcribed and locations and activities coded. Indicators of social isolation and negative emotional expression were examined. Individuals with and without MDD spent similar amounts of time talking, laughing, and with another person. However, depressed people spent less time in groups and used more negative emotion words, particularly in reference to the self, and particularly around romantic partners. Findings suggest depressed people’s social interactions suffer in quality but not quantity.
Article
This study aims to test whether filled pauses (FPs) may highlight discourse structure. This question is tackled from the perspectives of both the speaker and the listener. More specifically, it is first investigated whether FPs are more typical in the vicinity of major discourse boundaries. Secondly, FPs are analyzed acoustically, to check whether those occurring at major discourse boundaries are segmentally and prosodically different from those at shallower breaks. Analyses of twelve spontaneous monologues (Dutch) show that phrases following major discourse boundaries more often contain FPs. Additionally, FPs after stronger breaks tend to occur phrase-initially, whereas the majority of the FPs after weak boundaries are in phrase-internal position. Also, acoustic observations reveal that FPs at major discourse boundaries are both segmentally and prosodically distinct. They also differ with respect to the distribution of neighbouring silent pauses. Finally, a general linear model reveals that discourse structure can to some extent be predicted from characteristics of the FPs.
Article
The discourse marker use of the word like (‘we hitch a ride out of there with uh this like one crazy like music major guy’) is considered by many to be superfluously sprinkled into talk, a bad habit best avoided. But a comparison of the use of like in successive tellings of stories demonstrates that like can be anticipated in advance and planned into stories. In this way, like is similar to other words and phrases tellers recycle during story telling. The anticipation of like contrasted with the uses of other discourse markers such as oh, you know, and well, which almost never re-occurred in similar locations across tellings. Um and uh did sometimes re-occur; these uses are contrasted with like. Although discourse markers are generally used on the fly to handle various issues that come up in coordinating talk as it unfolds, like can be used as an integral part of the story -a marked contrast to the prevalent idea that likes are speech tics.