Content uploaded by Kumiko Fukumura
Author content
All content in this area was uploaded by Kumiko Fukumura on Mar 11, 2014
Content may be subject to copyright.
1 23
Memory & Cognition
ISSN 0090-502X
Mem Cogn
DOI 10.3758/s13421-014-0400-7
The effect of noun phrase length on the
form of referring expressions
Hossein Karimi, Kumiko Fukumura,
Fernanda Ferreira & Martin J.Pickering
1 23
Your article is protected by copyright and all
rights are held exclusively by Psychonomic
Society, Inc.. This e-offprint is for personal
use only and shall not be self-archived
in electronic repositories. If you wish to
self-archive your article, please use the
accepted manuscript version for posting on
your own website. You may further deposit
the accepted manuscript version in any
repository, provided it is only made publicly
available 12 months after official publication
or later and provided acknowledgement is
given to the original source of publication
and a link is inserted to the published article
on Springer's website. The link must be
accompanied by the following text: "The final
publication is available at link.springer.com”.
The effect of noun phrase length on the form of referring
expressions
Hossein Karimi &Kumiko Fukumura &
Fernanda Ferreira &Martin J. Pickering
#Psychonomic Society, Inc. 2014
Abstract The length of a noun phrase has been shown to
influence choices such as syntactic role assignment (e.g.,
whether the noun phrase is realized as the subject or the
object). But does length also affect the choice between differ-
ent forms of referring expressions? Three experiments inves-
tigated the effect of antecedent length on the choice between
pronouns (e.g., he) and repeated nouns (e.g., the actor)usinga
sentence-continuation paradigm. Experiments 1and 2found
an effect of antecedent length on written continuations:
Participants used more pronouns (relative to repeated nouns)
when the antecedent was longer than when it was shorter.
Experiment 3used a spoken continuation task and replicated
the effect of antecedent length on the choice of referring
expressions. Taken together, the results suggest that longer
antecedents increase the likelihood of pronominal reference.
The results support theories arguing that length enhances the
accessibility of the associated entity through richer semantic
encoding.
Keywords Length .Language production .Referring
expressions .Accessibility
Introduction
Speakers can express the same meaning in many different
ways. For example, after saying the boy liked the girl,a
speaker can refer back to the girl with a pronoun (she)orwith
arepeatednoun(the girl). What makes people choose one
referring expression over another? Many theories of reference
assume that accessibility affects the choice of referring ex-
pressions: People produce less explicit referring expressions,
such as pronouns, more frequently when the referent is more
salient in the prior discourse and, hence, more easily retrieved
from memory (i.e., accessible; see Bock, 1982;Bock&
Warren, 1985), and they produce more explicit referring ex-
pressions, such as repeated nouns, more frequently when the
referent is less accessible (e.g., Ariel, 1990; Givón, 1983;
Gundel, Hedberg, & Zacharski, 1993). However, the question
of what sources of information affect accessibility and, hence,
choice of expressions has not been fully settled.
Previous research has identified some important factors
that influence the referent’s accessibility and, hence, the
choice of referring expressions. It has been shown, for exam-
ple, that pronouns are used more often when the antecedent is
the syntactic subject of the sentence than when it plays some
other grammatical role (Arnold, 2001;Brennan,1995;
Fletcher, 1984; Fukumura & Van Gompel, 2010,2011;
Stevenson, Crawley, & Kleinman, 1994). According to vari-
ous theoretical accounts (Brennan, 1995; Brennan, Friedman,
&Pollard,1987; Gordon, Grosz, & Gilliom, 1993;Grosz,
Joshi, & Weinstein, 1995), the syntactic subject is more ac-
cessible than other syntactic functions. Moreover, the pres-
ence of a referential competitor in the linguistic (Arnold &
Griffin, 2007) or visual (Fukumura, Van Gompel, &
Pickering, 2010) context reduces the use of pronouns, possi-
bly because similarity between referential candidates results in
semantic interference, thereby reducing the referent’saccessi-
bility (Fukumura, Hyönä, & Scholfield, 2013;Fukumura,Van
H. Karimi :F. Ferr ei ra
University of South Carolina, Columbia, SC, USA
K. Fukumura
University of Strathclyde, Glasgow, UK
M. J. Pickering
University of Edinburgh, Edinburgh, UK
H. Karimi (*)
Institute for Mind and Brain, Department of Psychology, University
of South Carolina, 1800 Gervais Street, Columbia, SC 29201, USA
e-mail: karimihussein@gmail.com
Mem Cogn
DOI 10.3758/s13421-014-0400-7
Author's personal copy
Gompel, Harley, & Pickering, 2011). Additionally, people are
more likely to use pronouns to refer to animate rather than
inanimate entities (Fukumura & Van Gompel, 2011), and
animate entities have been argued to be more accessible
(Bock, 1982; Bock & Warren, 1985).
In the present study, we investigated whether the length of
an antecedent (hereafter, antecedent length) affects subse-
quent choice of referring expressions to that antecedent (pro-
noun vs. repeated noun). According to functional-linguistic
theories of reference (e.g., Ariel, 1990; Givón, 1983), the
amount of information attached to a noun phrase (henceforth,
NP) signals the referent’s accessibility in discourse. For ex-
ample, in Ariel’s(1990) accessibility hierarchy, long definite
descriptions such as thefirstwomanselectedtobeontheteam
of an American spaceship are ranked lower (i.e., are deemed
to be less accessible) than short definite descriptions such as
the woman or she in terms of the accessibility of the referent in
discourse. According to Ariel (1990,1996), longer NPs are
typically used when the referent is less accessible in the
context because they refer to new information in discourse,
whereas shorter NPs are more common when the associated
entity is given and, hence, more accessible (Ariel, 1990,1996;
Givón, 1983,1988,1989;Gundeletal.,1993). Importantly,
Ariel assumes that the amount of information predicated of an
NP indicates how accessible the referent is—that is, the
shorter the NP, the more accessible the referent in discourse.
Thus, on the basis of this account, there should be a greater
preference for reduced referring expressions following shorter
NPs.
Another possibility, however, is that longer NPs are more
accessible than shorter ones, because extra information tends
to lead to richer memory representations. Research on mem-
ory suggests that elaborative information on words enhances
later retrieval of those words, possibly because extra informa-
tion provides additional retrieval cues (e.g., Craik & Tulving,
1975; Fisher & Craik, 1980;Marks,1987). Therefore, it may
be that longer antecedents are more easily retrieved from
memory, increasing the probability of pronominal reference.
Consistent with this possibility, Hofmeister (2011) found that
semantically richer antecedents resulted in faster reading times
in long-distance dependencies than did semantically more
impoverished antecedents. For example, in (1a), the direct
object of banned is a communist, but this phrase does not
appear in its standard (“canonical”)location.
(1) a. It was a communist who the members of the club
banned from ever entering the premises.
b. It was an alleged Venezuelan communist who the
members of the club banned from ever entering the
premises.
Efficient comprehension involves rapidly associating a
communist with banned. Hofmeister found that people read
the words immediately following banned faster in (1b) than in
(1a), presumably because semantically richer phrases are
encoded more clearly in memory and, hence, are easier to
access.
Although no study thus far has examined the effect of
antecedent length on the choice of referring expressions,
previous research has shown that the length of an NP affects
constituent order. For instance, Stallings, MacDonald, and
O’Seaghdha (1998) showed that English speakers preferen-
tially produce the prepositional object of a ditransitive sen-
tence before the direct object when the latter is long (the
manger exhibited to Jill the new line of bright summer beach
and resort fashions)(seealsoArnold,Wasow,Losongco,&
Ginstrom, 2000). Importantly, such an effect of length on
word order, often termed heavy NP shift, has been assumed
to occur because of accessibility. Researchers argue that
shorter NPs tend to precede longer NPs in English because
shorter NPs are more accessible (e.g., Arnold et al., 2000;
Stallings & MacDonald, 2011; Stallings et al., 1998), which
makes sense in light of the given-before-new ordering prefer-
ences observed more generally (E. Clark & Clark, 1978,H.H.
Clark & Haviland, 1977,Halliday,1967). Interestingly, how-
ever, Yamashita and Chang (2001) showed that, unlike
English speakers, Japanese speakers prefer to place the longer
NPs before the shorter NPs when producing both transitive
and ditransitive structures. Consistent with Hofmeister (2011),
Yamashita and Chang argued that longer NPs are more acces-
sible than shorter ones because extra linguistic material adds
more information to the referent, making it semantically richer
and, therefore, conceptually more salient. Thus, although
length may affect accessibility differently in different lan-
guages (e.g., Chang, 2009;Hawkins,1994), if the effect of
length on constituent order is indeed mediated by accessibil-
ity, we might expect length to also affect referential forms.
We thus carried out three experiments to investigate wheth-
er and how the length of a potential antecedent affects the form
of the associated referring expressions. A functional-linguistic
account (Ariel, 1990,1996; Givón, 1983,1988,1989;Gundel
et al., 1993) predicts that shorter NPs signal higher accessibil-
ity, as compared with longer NPs, and that this should cause
people to produce more pronouns when the antecedent is
shorter than when it is longer. In contrast, the semantic
richness account claims that the more information predicated
of an NP, the more conceptually accessible it becomes, be-
cause extra semantic information leads to an enriched seman-
tic representation of the referent and/or provides additional
recall cues for retrieval. As was discussed earlier, research on
memory and recall (Craik & Tulving, 1975;Fisher&Craik,
1980;Marks,1987) and filler-gap dependencies (Hofmeister,
2011) is consistent with this account. If length indeed en-
hances the accessibility of the referent, we could expect more
pronouns following a long antecedent than a short antecedent.
In all experiments, we used a free sentence continuation task
Mem Cogn
Author's personal copy
(Fukumura & Van Gompel, 2010; Garvey & Caramazza,
1974; Stevenson et al., 1994; Stevenson, Knott, Oberlander,
& McDonald, 2000). To preview the results, in all three
experiments, we found that participants produced more pro-
nouns when the antecedent was longer rather than shorter,
consistent with the semantic richness account.
Experiment 1
We examined the effect of antecedent length by manipulating
the presence or absence of a relative clause attached to a
potential antecedent. We created three preceding sentence
conditions,asin(2).Inthelong–short condition (2a), a
relative clause was attached to NP1 (the actor), but not to
NP2 (the actress), making NP1 longer than NP2. In the short–
long condition (2b), the same relative clause was attached to
NP2, making NP2 longer than NP1. Finally, in the short–short
(2c) condition, neither NP had a relative clause.
(2) a. Long–short: The actor who was frustrated and visibly
upset about the night’s disastrous performance
walked away from the actress.
b. Short–long: The actor walked away from the actress
who was frustrated and visibly upset about the night’s
disastrous performance.
c. Short–short: The actor walked away from the actress.
Participants were asked to read the sentences and then
provide a meaningful continuation for it. We were primarily
interested in how participants referred back to the entity they
chose to talk about in their continuations—with a pronoun (he
or she) or with a repeated noun (the actor or the actress)—and
how this might be affected by the presence of a relative clause
attached to the antecedent.
The functional account predicts that shorter referents are
more accessible than longer ones in discourse (Ariel, 1990;
Givón, 1983). Specifically, in her corpus analyses, Ariel
(1996) found that NPs followed by relative clauses typically
occur when the referent is less accessible in the context, which
led her to argue that long descriptions are “low accessibility
markers.”Therefore, if the form of the antecedent is indeed
taken to signal the referent’s accessibility, participants should
produce fewer pronouns (relative to repeated head nouns)
when the antecedent is longer than when it is shorter. That
is, when the antecedent is NP1, more pronouns are expected in
the short–long (2b) than in the long–short (2a) condition,
whereas when the antecedent is NP2, more pronouns are
expected in the long–short (2a) than in the short–long (2b)
condition. Alternatively, the semantic richness account (e.g.,
Hofmeister, 2011;Marks,1987; Yamashita & Chang, 2001)
predicts that longer antecedents boost the referent’saccessi-
bility because of the additional information that is predicated
of the referent. Therefore, more pronouns are expected in the
long–short (2a) than in the short–long (2b) condition for NP1
antecedents, whereas more pronouns are expected in the
short–long (2b) than in the long–short (2a) condition for
NP2 antecedents. In addition, it is possible that the presence
of a relative clause generally influences the referent’saccessi-
bility (i.e., regardless of which NP it is attached to). For
instance, the additional words might increase the distance
between the antecedent and the anaphor, which may decrease
the referent’s accessibility (Ariel, 1990;Givón,1983). As a
result, participants may generally produce fewer pronouns,
rather than repeated head nouns, when the preceding sentence
contains a relative clause anywhere (the long–short and short–
long conditions) than when both NPs are short (the short-short
condition).
Furthermore, because the participants were free to talk
about either of the entities in the preceding sentence, an
interesting question was how the length of the antecedent
would affect the choice of referent (i.e., which antecedent
the participants chose to talk about). Some previous studies
have argued that choice of referent and form of referring
expression are driven by the same underlying forces (e.g.,
Arnold, 2001,2008;Givón1988,1989); that is, language
users produce more reduced expressions such as pronouns
for the referent that is most likely to be referred to, because
the more predictable the referent is, the more accessible it is.
But other research suggests that choice and form of reference
are guided by different mechanisms (Fukumura & Van
Gompel, 2010; Kehler, Kertz, Rohde, & Elman, 2008;
Stevenson et al., 1994).
Method
Participants
Thirty-six undergraduate students studying at the University
of Edinburgh took part in the experiment in exchange for £6.
They were all native speakers of British English.
Materials and design
We constructed 42 experimental sentences such as (2). Each
sentence included two NPs of different genders (NP1 and
NP2), and a relative clause was attached to NP1, NP2, or
neither, creating three preceding sentence conditions. We
counterbalanced NP order by including three additional condi-
tions, in which the order of the NPs was reversed. This resulted
in six conditions for each experimental sentence: preceding
sentence (short–short vs. long–short vs. short–long) × NP order
(male–female vs. female–male). We also constructed 60 fillers.
The fillers did not contain relative clauses, but about half of
them contained constituents, such as prepositional phrases, that
made them appear similar to the long experimental sentences.
Mem Cogn
Author's personal copy
The 42 experimental and 60 filler sentences were distributed in
a fixed random order across six lists, subject to the constraints
that at least one filler sentence occurred between two experi-
mental sentences and that no more than two experimental
sentences of the same condition occurred consecutively. Each
experimental list contained one version of each item and seven
items from each condition, together with all 60 fillers. Six
participants were randomly assigned to each list.
Procedure
Participants were given a booklet that contained the to-be-
continued sentences and were asked to write a meaningful
continuation for each sentence. The participants were encour-
aged to produce continuations “quickly”and “with the first
thing that comes to mind,”but there was no time limit.
Participants were permitted to take a short break halfway
through. The experimental session lasted about 45–60 min.
Scoring
We scored whether the subject of the continuation referred to
NP1 or NP2 in the preceding sentence and what referring
expressions were used to refer to them. Responses were
scored as other responses if (1) the referring expression re-
ferred to neither NP1 nor NP2, (2) neither a pronoun nor a
repeated noun was produced to refer to NP1 or NP2, (3) the
referent was not the first-mentioned entity in the continuation,
(4) participants did not produce a new sentence, and (5) the
referring expression was part of a subordinate clause in the
continuation (e.g., When he/the actor asked for an explana-
tion, she/the actress didn’t provide one).
Results
Throughout this article, we analyze the effect of our manipu-
lations on the choice of referring expression (i.e., how the
participants referred to the antecedent they talked about, with
a pronoun or with a repeated noun), as well as on the choice of
referent (i.e., which antecedent the participants talked about in
the continuations). Because our dependent variable was cate-
gorical for both choice of referent and choice of referring
expression, we always used logit mixed-effects models
(Baayen, Davidson, & Bates, 2008;Jaeger,2008). For random
effects, we always included by-participants and by-items ran-
dom intercepts. We attempted fitting the maximum random
effect structure (Barr, Levy, Scheepers, & Tily, 2013), but
because the models failed to converge, as is often the case
with categorical data, we included by-participants and by-
items random slopes only if their inclusion was justified by
the model (Baayen et al., 2008). We also ran more traditional
F
1
and F
2
analyses of variance (ANOVAs) on by-participants
and by-items means, which, according to Barr et al., control
type 1 error rate well. The results were consistent with those
from the mixed effects reported below.
Choice of referring expressions
Tab le 1reports the percentage of pronominal reference rela-
tive to repeated noun reference for NP1 and NP2 references in
each preceding sentence. Participants rarely produced refer-
ring expressions other than pronouns and repeated nouns; the
ones they did generate included null pronouns (…and went
out to smoke) and modified nouns (…The frustrated actress …)
(long–short, N=2;short–long, N=1;short–short, N=2).
These responses were therefore excluded from further analy-
ses on choice of referring expressions.
We analyzed the number of pronouns and repeated nouns
as functions of antecedent position (NP1 vs. NP2) and pre-
ceding sentence. We first analyzed how the inclusion of a
relative clause modulated the effect of antecedent position
by comparing (1) the short–short condition with the long–
short condition and (2) the short–short condition with the
short–long condition. We then analyzed the effect of anteced-
ent length on pronoun use by comparing (3) the long–short
condition with the short–long condition. In all three analyses,
both antecedent position and preceding sentence were cen-
tered, so that the results could be interpreted in the same way
as in traditional ANOVAs. We collapsed across NP order,
because this was merely a counterbalancing variable.
Tab le 2provides a summary of all the coefficients for choice
of referring expression analyses.
The comparison of the short–short and the long–short
conditions revealed a significant main effect of antecedent
position, with more pronominal reference to NP1 than to
NP2, but there was no significant main effect of preceding
sentence, nor any interaction between antecedent position and
preceding sentence (short–short vs. long–short). The compar-
ison between the short–short and the short–long condition,
which included by-items random slopes for antecedent posi-
tion and preceding sentence, also revealed a significant main
effect of antecedent position, with more pronoun use for NP1
Ta b l e 1 The percentage of pronouns out of all pronouns and repeated
nouns for NP1 and NP2 referenceby preceding sentence in Experiment 1
Antecedent Position Mean
NP1 NP2
Preceding sentence Long–short 99.3%
(146/147)
74.4%
(172/231)
84.1%
Short–long 93.9%
(218/232)
80.0%
(116/145)
88.5%
Short–short 98.7%
(152/154)
76.4%
(149/195)
86.2%
Mean 96.8% 76.5%
Mem Cogn
Author's personal copy
than for NP2, but there was no significant main effect of
preceding sentence. However, the effect of antecedent posi-
tion was significantly modulatedbyprecedingsentence
(short–short vs. short–long). We followed up this interaction
by analyzing simple effects of preceding sentence (long–short
vs. short–long) for NP1 and NP2 separately. When the ante-
cedent was NP1, there was no difference between the short–
short and the short–long condition, but when the antecedent
was NP2, there were significantly more pronouns in the short–
long condition than in the short–short condition.
Finally, the comparison between the long–short and the
short–long conditions, which included by-item random slopes
for antecedent position and preceding sentence, revealed that
antecedent position had a significant influence on pronoun
use, indicating that there were more pronouns following NP1
than NP2 antecedents. There was no overall effect of preced-
ing sentence (long–short vs. short–long), however, suggesting
that participants produced similar numbers of pronouns fol-
lowing long–short and short–long sentences. Importantly,
there was a significant antecedent position × preceding sen-
tence (long–short vs. short–long) interaction, indicating that
the effect of antecedent position was larger in the long–short
than in the short–long condition. When the antecedent was
NP1, although there was a numerical tendency for more
pronouns for the long–short than for the short–long condition,
this difference did not reach significance. However, when the
antecedent was NP2, there were significantly more pronouns
in the short–long condition than in the long–short condition,
indicating that a longer antecedent was more likely to be
realized with a pronoun, as compared with a shorter
antecedent.
Choice of referent
We also analyzed whether and how the choice of referent was
affected by our manipulations. Table 3reports the percentage
of NP1, NP2, and other references for each condition.
There were slightly more other responses in the short–short
condition than in both the long–short condition (p=.08) and
the short–long condition (p=.08), but there was no difference
in other responses between the long–short and the short–long
conditions (p= .92). Given that the number of other responses
differed between the conditions, we analyzed the number of
NP1 references and NP2 references relative to all trials (in-
cluding NP1, NP2, and other). As in the analyses on the
choice of referring expression, we first compared the short–
short (baseline) condition with (1) the long–short and with (2)
the short–long conditions and then compared (3) the long–
short and the short–long conditions with respect to the num-
bers of NP1 and N2 references (out of all trials). Table 4
reports the results.
The results revealed no significant difference in NP1 refer-
ence between the short–short and the long–short conditions,
but there weresignificantly fewer NP2 references in the short–
short than in the long–short condition. There were significant-
ly fewer NP1 references in the short–short condition than in
the short–long condition and significantly more NP2 refer-
ences in the short–short condition than in the short–long
condition. The comparison between the long–short and the
short–long conditions included by-items random slopes for
preceding sentence. The results showed significantly fewer
NP1 references in the long–short condition than in the short–
long condition and significantly more NP2 references in the
Ta b l e 2 Summary of the coefficients from the analyses on choice of
referring expressions in Experiment 1
(1) Model summaries for the comparison between short–short and long–
short
Predictor βSE Z p
(Intercept) 3.29 0.35 9.24 <.001
Antecedent position −1.35 0.15 −8.48 <.001
Preceding sentence (SS vs. LS ) 0.02 0.15 0.18 .85
Antecedent position × preceding
sentence (SS vs. LS)
0.10 0.16 0.64 .51
(2) Model summaries for the comparison between short–short and short–
long
Predictor βSE Z p
(Intercept) 3.74 0.41 9.11 <.001
Antecedent position −1.72 0.23 −7.43 <.001
Preceding sentence (SS vs. SL) 0.30 0.22 1.36 .17
Antecedent position × preceding
sentence (SS vs. SL)
−0.43 0.20 −2.08 .03
Effect of preceding sentence
(SS vs. SL) for NP1
(Intercept) 5.43 0.83 6.47 <.001
Preceding sentence (SS vs. SL) −0.95 0.87 −1.08 .27
Effect of preceding sentence
(SS vs. SL) for NP2
(Intercept) 2.00 0.39 5.04 <.001
Preceding sentence (SS vs. SL) 1.09 0.52 2.07 .03
(3) Model summaries for the comparison between long–short and short–
long
Predictor βSE Z p
(Intercept) 4.76 0.62 7.61 <.001
Antecedent position 2.01 0.46 4.34 <.001
Preceding sentence (LS vs. SL) 0.25 0.47 0.54 .58
Antecedent position × preceding
sentence (LS vs. SL)
0.99 0.42 2.31 .02
Effect of preceding sentence
(LS vs. SL) for NP1
(Intercept) 6.38 1.13 5.63 <.001
Preceding sentence (LS vs. SL) −1.38 1.17 −1.17 .23
Effect of preceding sentence
(LS vs. SL) for NP2
(Intercept) 1.83 0.39 4.62 <.001
Preceding sentence (LS vs. SL) 1.33 0.54 2.46 .01
Note.SS,LS,and SL stand for short–short, long–short, and short–long
conditions, respectively
Mem Cogn
Author's personal copy
long–short condition than in the short–long condition, indi-
cating that there was a tendency to talk about the relatively
shorter, rather than the longer, antecedent.
Discussion
The results of Experiment 1revealed that participants pro-
duced more pronouns for NP2 in the short–long condition than
in the long–short condition, which indicated that people tend to
use more pronouns when referring to long rather than short
antecedents. When the antecedent was NP1, although there
was a numerical tendency toward more pronouns in the long–
short than in the short–long condition, the effect was not
significant. In addition, there were more pronouns for NP2 in
the short–long condition than in the short–short condition.
However, participants did not generally use more pronouns
when the preceding sentence was longer: for NP1 reference,
there was no significantdifference between the short–short and
the short–long condition. Instead, participants may have pref-
erentially produced more pronouns for longer antecedents.
The tendency to use pronouns more often with longer
antecedents is compatible with the semantic richness account,
which claims that additional semantic information in long NPs
makes them conceptually more accessible, and is incompati-
ble with the functional linguistic account, which claims that
longer descriptions signal low accessibility in discourse. In
addition, participants used more pronouns when the anteced-
ent was NP1 (subject) than NP2 (object), in accord with
previous research (Arnold, 2001; Brennan, 1995; Fletcher,
1984; Fukumura & Van Gompel, 2010,2011; Stevenson
et al., 1994). The preference to use pronouns to refer to NP1
antecedents was so strong that pronominal reference
approached ceiling level in the NP1 conditions (over 90%).
This might explain why the effect of antecedent length did not
reach significance in this condition.
Interestingly, there were more NP1 references in the short–
long condition than in the long–short condition, whereas there
were more NP2 references in the long–short condition than in
the short–long condition. This indicated that participants re-
ferred to the shorter antecedents more often. Such preference to
refer to relatively shorter antecedents may also explain why
there were more NP1 and fewer NP2 references in the short–
long condition than in the short–short condition and more NP2
references in the long–short than in the short–short condition.
Thus, although participants were less likely to refer to
longer antecedents, they tended to use more pronouns to refer
to them. These findings run counter to theories of reference
production that assume that choice of referent and choice of
referring expression are both driven by the same underlying
force (Arnold, 2001,2008;Givón1988,1989), because these
theories maintain that the more likely an entity is to be referred
Ta b l e 3 Percentage of NP1, NP2, and other references by preceding sentence in Experiment 1
Choice of Referent
NP1 NP2 Other
Preceding sentence Long–short 29.3% (147) 46.0% (231) 24.7% (124)
Short–long 46.1% (232) 28.9% (145) 25.0% (126)
Short–short 30.7% (154) 38.8% (195) 30.5% (153)
Mean 35.3% 37.9% 26.8%
Note. Numbers in parentheses represent frequencies
Ta b l e 4 Summary of the coefficients from the analyses on the choice of
referent in Experiment 1
(1) Model summaries for the comparison between short–short and long–
short
Predictor βSE Z p
(Intercept) −0.92 0.15 −5.82 <.001
Effect of preceding sentence
(SS vs. LS) on NP1 reference
0.09 0.16 0.56 .57
(Intercept) −0.53 0.17 −3.13 <.01
Effect of preceding sentence
(SS vs. LS) on NP2 reference
−0.34 0.15 −2.17 .03
(2) Model summary for the comparison between short–short and short–
long
Predictor βSE Z p
(Intercept) −0.92 0.15 −5.82 <.001
Effect of preceding sentence
(SS vs. SL) on NP1 reference
0.73 0.14 4.98 <.001
(Intercept) −0.53 0.17 −3.13 <.01
Effect of preceding sentence
(SS vs. SL) on NP2 reference
−0.49 0.15 −3.14 <.01
(3) Model summary for the comparison between long–short and short–
long
Predictor βSE Z p
(Intercept) −0.71 0.13 −5.36 <.001
Effect of preceding sentence
(LS vs. SL) on NP1 reference
−0.34 0.08 −4.29 <.001
(Intercept) −0.58 0.14 −3.92 <.001
Effect of preceding sentence
(LS vs. SL) on NP2 reference
0.34 0.07 4.77 <.001
Note.SS,LS,and SL stand for short–short, long–short, and short–long
conditions, respectively
Mem Cogn
Author's personal copy
to, the more accessible it is and, hence, more reduced referring
expressions should be used to refer to it.
Experiment 2
In Experiment 1, the tendency to use a pronoun to refer to the
NP modified by a relative clause was significant for NP2 but
not for NP1. This difference might be related to the fact that
pronouns were generally used much more often than repeated
nouns in all conditions, and this was especially the case for
NP1 reference, where participants hardly produced any re-
peated nouns at all (2.6%). Thus, a data set containing fewer
pronouns overall might allow us to observe an effect of NP
length on both NP1 and NP2. In Experiment 2,therefore,we
attempted to increase the proportion of repeated nouns relative
to pronouns by matching the genders of the two NPs in the
preceding sentence, as in (3). Previous studies have shown
that people tend to use fewer pronouns and more repeated
nouns to refer to one of two NPs that share gender or other
semantic features than otherwise (Arnold & Griffin, 2007;
Fukumura & Van Gompel, 2011; Fukumura et al., 2013;
Fukumura et al., 2011;Fukumuraetal.,2010).
(3) a. Long–short: The actor who was frustrated and visibly
upset about the night’s disastrous performance walked
away from the cameraman.
b. Short–long: The actor walked away from the camera-
man who was frustrated and visibly upset about the
night’s disastrous performance.
c. Short–short: The actor walked away from the
cameraman.
Method
Participants
Thirty-six participants were drawn from the same population
as in Experiment 1and compensated in the same manner.
None had participated in Experiment 1.
Materials, design, and procedure
Unlike in Experiment 1, the two NPs in the preceding
sentences were always of the same gender, as in (3). A few
other minor changes were also made to keep the sentences as
natural as possible given the gender changes (see the
Appendix). Fillers, design, and procedure were the same as
in Experiment 1.
Scoring
The criteria for scoring were the same as in Experiment 1.
Because the two NPs had the same gender, a pronoun referring
back to either of the NPs was ambiguous. We thus asked two
native speakers of British English who were blind to the
purposes of the study to score all the continuations containing
pronouns and decide whether the pronoun referred to NP1 or
NP2 or was ambiguous. Any continuation for which the
scorers did not agree about the referent of the pronoun was
also excluded. This resulted in the removal of 18
continuations.
Results
Choice of referring expression
Tab le 5reports the percentage of pronominal reference rela-
tive to repeated noun reference to NP1 and NP2 by preceding
sentence. As before, participants almost never produced other
referring expressions (long–short, N=0;short–long, N=0;
short–short, N= 1), so these (as well as other responses) were
excluded.
We analysed the choice of referring expression (pronouns
vs. repeated nouns) in the same way as in Experiment 1,by
including antecedent position (NP1 vs. NP2) and preceding
sentence (short–short vs. long–short and short–short vs.
short–long) as the fixed factors. Table 6provides a summary
of all the coefficients for choice of referring expression
analyses.
First, we compared the short–short condition with (1) the
long–short and (2) the short–long conditions in terms of
pronoun use. The comparison between the short–short and
Ta b l e 5 The percentage of pronouns out of all pronouns and repeated nouns for NP1 and NP2 reference by preceding sentence in Experiment 2
Antecedent Position Mean
NP1 NP2
Preceding sentence Long–short 92.5% (185/200) 16.3% (28/171) 57.4%
Short–long 78.8% (157/199) 37.5% (51/136) 62.0%
Short–short 96.0% (170/177) 32.9% (51/155) 66.5%
Mean 88.8% 28.1%
Mem Cogn
Author's personal copy
the long–short conditions (which included by-subjects ran-
dom slopes for antecedent position) revealed a significant
main effect of antecedent position, with more pronouns used
to refer to NP1 than to NP2. Also, there was a main effect of
preceding sentence (short–short vs. long–short), with signifi-
cantly more pronouns in the short–short condition than in the
long–short condition. There was also a significant interaction
between antecedent position and preceding sentence (short–
short vs. long–short). For NP1 reference, there was no differ-
ence between the short–short and the long–short conditions,
but for NP2 reference, there were significantly more pronouns
in the short–short condition than in the long–short condition.
The comparison between the short–short and the short–long
condition, which included by-subjects random slopes for an-
tecedent position and by-items random slopes for antecedent
position, preceding sentence (short–short vs. short–long), as
well as their interaction, also revealed a significant antecedent
position effect, with more pronominal reference to NP1 than
to NP2. In addition, there were significantly more pronouns in
the short–short condition than in the short–long condition.
There was also a significant interaction between antecedent
position and preceding sentence (short–short vs. short–long).
When the antecedent was NP1, there were more pronouns in
the short–short than in the short–long condition, but when the
antecedent was NP2, there was no significant difference be-
tween the short–short condition and the short–long condition.
Next, we examined the effect of antecedent length by
comparing (3) the long–short and the short–long conditions.
As before, there was a significant effect of antecedent posi-
tion, with more pronouns following NP1 than NP2 anteced-
ents, but there was no effect of preceding sentence.
Importantly, there was a significant antecedent position ×
preceding sentence (long–short vs. short–long) interaction,
which indicated that the effect of antecedent position was
greater in the long–short than in the short–long condition.
Simple effects further revealed that when the antecedent was
NP1, there were significantly more pronouns in the long–short
than in the short–long condition. In contrast, when the ante-
cedent was NP2, there were significantly fewer pronouns in
the long–short than in the short–long condition, indicating that
participants used more pronouns for longer antecedents.
Choice of referent
Tab le 7reports the frequencies of NP1, NP2, and other refer-
ences. There were significantly more other responses in the
short–short condition than in the long–short condition (p<.01),
but there was no difference between the short–short
condition and the short–long condition (p= .95). In addition,
there were more other responses in the short–long condition
than in the long–short condition (p< .05). Therefore, we
analyzed the number of NP1 references and that of NP2
references, relative to all trials (including other responses).
The analysis was the same as in Experiment 1.
Tab le 8reports the results of analyses on choice of
referent for Experiment 2.
Ta b l e 6 Summary of the coefficients from the analyses on the choice of
referring expressions in Experiment 2
(1) Model summaries for the comparison between short–short and long–
short
Predictor βSE Z p
(Intercept) 0.94 0.30 3.10 <.01
Antecedent position −1.88 0.15 −12.18 <.001
Preceding sentence (SS vs. LS ) 0.29 0.10 2.66 <.01
Antecedent position × preceding
sentence (SS vs. LS)
0.24 0.10 2.28 .02
Effect of preceding sentence
(SS vs. LS) for NP1
(Intercept) 4.13 0.61 6.69 <.001
Preceding sentence (SS vs. LS) 0.55 0.62 0.89 .37
Effect of preceding sentence
(SS vs. LS) for NP2
(Intercept) −2.42 0.49 −4.93 <.001
Preceding sentence (SS vs. LS) 1.49 0.38 3.86 <.001
(2) Model summaries for the comparison between short–short and short–
long
Predictor βSE Z p
(Intercept) 1.17 0.33 3.50 <.001
Antecedent position −2.17 0.18 −11.54 <.0 01
Preceding sentence (SS vs. SL) 0.34 0.14 2.34 .01
Antecedent position × preceding
sentence (SS vs. SL)
−0.51 0.14 −3.44 <.001
Effect of preceding sentence
(SS vs. SL) for NP1
(Intercept) 3.96 0.54 7.21 <.001
Preceding sentence (SS vs. SL) −1.84 0.55 −3.32 <.001
Effect of preceding sentence
(SS vs. SL) for NP2
(Intercept) −0.86 0.50 −1.72 .08
Preceding sentence (SS vs. SL) 0.36 0.38 0.93 .35
(3) Model summaries for the comparison between long-short and short-
long
Predictor βSE Z p
(Intercept) 0.90 0.30 2.94 <.01
Antecedent position 1.88 0.14 12.88 <.001
Preceding sentence (LS vs. SL) 0.13 0.13 0.98 .32
Antecedent position × preceding
sentence (LS vs. SL)
0.79 0.13 5.92 <.001
Effect of preceding sentence
(LS vs. SL) for NP1
(Intercept) 3.39 0.43 7.74 <.001
Preceding sentence (LS vs. SL) −1.62 0.39 −4.15 <.001
Effect of preceding sentence
(LS vs. SL) for NP2
(Intercept) −2.36 0.49 −4.73 <.001
Preceding sentence (LS vs. SL) 1.91 0.39 4.90 <.001
Note. SS, LS, and SL stand for short–short, long–short, and short–long,
respectively
Mem Cogn
Author's personal copy
We first compared the short–short condition with (1) the
long–short and (2) the short–long conditions in terms of choice
of referent. There was no significant difference between the
short–short and the long–short conditions for NP1 reference or
for NP2 reference. Similarly, the comparison between the
short–short and the short–longconditionrevealednosignifi-
cant difference in NP1 or NP2 reference. We then compared
(3) the long–short and the short–long conditions, which in-
cluded by-item random slopes for preceding sentence (long–
short vs. short–long). The results showed no significant differ-
ence between the long–short and the short–long conditions in
NP1 references, but there were more NP2 references in the
long–short condition than in the short–long condition.
Discussion
We obtained even clearer results concerning the effect of
antecedent length in this experiment, as compared with
Experiment 1. Participants were more likely to use pronouns
for NP1 in the long–short condition than for NP1 in the short–
long condition, whereas they were less likely to use pronouns
for NP2 in the long–short than for NP2 in the short–long
condition, indicating that antecedent length increases pronoun
use for both NP1 and NP2 antecedents. These results are in
line with the semantic richness account, which assumes that
the amount of information predicated of the antecedent in-
creases its accessibility (Hofmeister, 2011; Yamashita &
Chang, 2001). However, there were more pronouns for NP2
reference in the short–short than in the long–short condition
and more pronouns for NP1 reference in the short–short than
in the short–long condition. These results are unlikely to be
due to a general tendency to use more pronouns when the
preceding sentence is shorter, because there were no more
pronouns in the short–short condition, as compared with the
long–short condition for NP1, and similarly, the short–short
and short–long conditions did not differ in NP2 reference.
Instead, the effect may have occurred because participants
were less likely to use pronouns when referring to an anteced-
ent shorter than the referential alternative.
As in Experiment 1, there was a preference to refer to the
shorter of the two potential antecedents in a sentence, although
the effect was less pronounced: Although there were more
NP2 references in the long–short condition than in the short–
long condition, indicating a preference to refer to shorter
antecedents, there was no difference in the number of NP1
references between the long–short and short–long conditions.
Experiment 3
In the first and second experiments, we found that antecedent
length affects the use of pronouns. Experiment 3examined
Ta b l e 7 The percentages of NP1, NP2 and other references by preceding sentence in Experiment 2
Choice of Referent
NP1 NP2 Other
Preceding sentence Long–short 39.7% (200) 33.9% (171) 26.4% (133)
Short–long 39.5% (199) 27.0% (136) 33.5% (169)
Short–short 35.2% (177) 30.8% (155) 34.0% (171)
Mean 38.1% 30.6% 31.3%
Note. Numbers in brackets represent frequencies
Ta b l e 8 Summary of the coefficients from the analyses on the choice of
referent in Experiment 2
(1) Model summaries for the comparison between short–short and long–
short
Predictor βSE Z p
(Intercept) −0.78 0.21 −3.64 <.001
Effect of preceding sentence
(SS vs. LS) on NP1 reference
0.30 0.19 1.59 .10
(Intercept) −0.92 0.16 −5.69 <.001
Effect of preceding sentence
(SS vs. LS) on NP2 reference
0.16 0.14 1.16 .24
(2) Model summary for the comparison between short–short and short–
long
Predictor βSE Z p
(Intercept) −0.78 0.21 −3.64 <.001
Effect of preceding sentence
(SS vs. SL) on NP1 reference
0.29 0.24 1.20 .22
(Intercept) −0.92 0.16 −5.69 <.001
Effect of preceding sentence
(SS vs. SL) on NP2 reference
−0.20 0.14 −1.41 .15
(3) Model summary for the comparison between long–short and short–
long
Predictor βSE Z p
(Intercept) −0.54 0.13 −4.18 <.001
Effect of preceding sentence
(LS vs. SL) on NP1 reference
0.01 0.07 0.17 .86
(Intercept) −0.93 0.13 −6.70 <.001
Effect of preceding sentence
(LS vs. SL on NP2 reference
0.15 0.05 2.56 .01
Note. SS, LS, and SL stand for short–short, long–short, and short–long
conditions, respectively.
Mem Cogn
Author's personal copy
whether the effect of antecedent length could also be observed
in spoken language production. Some research suggests that
task demands can modulate language processing (Ferreira,
Ferraro, & Bailey, 2002; Ferreira, Foucart, & Engelhardt,
2013; Ferreira & Patson, 2007; Sanford & Sturt, 2002;
Swets, Desmet, Clifton, & Ferreira 2008). Since the linguistic
signal is transient in speech, the representations of the dis-
course entities might fade faster in memory, as compared with
written language processing, and this difference could poten-
tially influence the choice of referring expressions. Therefore,
in Experiment 3, we presented the preceding sentences audi-
torily and asked participants to continue the discourse orally.
The stimuli for this experiment were the ones used in
Experiment 1.
Method
Participants
Twenty-four undergraduate students were drawn from the
participant pool of the University of South Carolina. They
were all native speakers of American English and participated
in the study in exchange for course credit.
Materials and design
The materials and design were identical to those in
Experiment 1, except that the sentences were recorded by a
female native speaker of North American English at a slightly
slower than normal speech rate.
Procedure
The recorded experimental lists were programmed in the
Experiment Builder software such that each sentence was
orally played to the participants once they pressed the “space”
button on the keyboard. Immediately after the sentence was
finished, a “speak”prompt appeared on the screen and indi-
cated that they could start speaking their continuations into a
microphone. They were required to press the “space”button
again to stoprecording themselves and one more time to move
to the next sentence. All the participants were tested
individually in a quiet testing room. As in the previous exper-
iments, the participants were encouraged to respond quickly
and “with the first thing that comes to mind,”but there was no
time limit for starting to speak the continuation. In addition,
they could take a break in the middle of the experiment if they
needed to do so. The experiment took approximately 30 min
to complete.
Scoring
The scoring of this experiment was identical to that in
Experiment 1.
Results
Choice of referring expression
Tab le 9reports the percentage of pronominal reference rela-
tive to repeated noun reference to NP1 and NP2 for each
preceding sentence. As in Experiments 1and 2, other referring
expressions were extremely rare (long–short, N=2;short–
long, N=2;short–short, N= 0), so we focused on the number
of pronouns relative to repeated nouns in our analyses.
Tab le 10 contains a summary of the coefficients for choice
of referring expression analyses. As before, we included an-
tecedent position (NP1 vs. NP2) and preceding sentence
(short–short vs. long–short and short–short vs. short–long)
as the fixed factors. The comparison between the short–short
and the long–short conditions (1) revealed a significant effect
of antecedent position, with more pronoun use for NP1 than
for NP2, and also an effect of preceding sentence (short–short
vs. long–short), with more pronoun use in the long–short
condition than in the short–short condition, but there was no
significant interaction between antecedent position and pre-
ceding sentence (short–short vs. long–short). The comparison
between the short–short and the short–long conditions (2) also
revealed a significant effect of antecedent position, with more
pronominal reference to NP1 than to NP2, and a significant
effect of preceding sentence (short–short vs. short–long),
showing that there were significantly more pronouns in the
short–long condition than in the short–short condition.
Additionally, there was a significant interaction between
Ta b l e 9 Percentages of pronouns (out of all pronouns and repeated nouns) for NP1 and NP2 reference by preceding sentence in Experiment 3
Antecedent Position Mean
NP1 NP2
Preceding sentence Long–short 64.5% (71/110) 13.3% (17/127) 37.1%
Short–long 48.6% (56/115) 38.0% (38/100) 43.7%
Short–short 50.0% (44/88) 11.2% (15/133) 26.6%
Mean 54.6% 19.4%
Mem Cogn
Author's personal copy
antecedent position and preceding sentence (short–short vs.
short–long). When the antecedent was NP1, there was no
significant difference between the short–short and the short–
long conditions, but when the antecedent was NP2, there were
significantly more pronouns in the short–long condition than
in the short–short condition. The comparison between the
long–short and the short–long conditions (3) revealed a main
effect of antecedent position, with more pronouns following
NP1 than NP2 antecedents, but there was no main effect of
preceding sentence (long–short vs. short–long). Most
important and in line with the results of the previous experi-
ments, we found a significant antecedent position × preceding
sentence (long–short vs. short–long) interaction, indicating
that that the effect of antecedent position was larger in the
long–short than in the short–long condition. Simple effects
revealed that when NP1 was the antecedent, there were more
pronouns in the long–short than in the short–long condition.
In contrast, when the antecedent was NP2, there were fewer
pronouns in the long–short than in the short–long condition.
Thus, there were more pronoun references to longer
antecedents.
Choice of referent
Tab le 11 reports choice of referent by preceding sentence.
There was no significant difference in the number of other
responses between the short–short condition and the long–
short condition (p=.25)orbetweentheshort–short condition
and the short–long condition (p= .35). However, there were
significantly more other responses in the short–long condition
than in the long–short condition (p< .05). We thus analyzed
the numbers of NP1 and NP2 references relative to all trials
(NP1, NP2, and other references). We also analyzed the
choice of referent, as in Experiments 1and 2. Table 12 reports
the results of these analyses. The comparisons between the
short–short and the long–short condition (1) revealed that
there were significantly fewer NP1 references in the short–
short condition than in the long–short condition, but there was
no significant difference in NP2 reference. The comparisons
between the short–short and the short–long condition (2)
found fewer NP1 references in the short–short condition than
in the short–long condition, whereas there were significantly
more NP2 references in the short–short condition than in the
short–long condition. There were significantly more NP2
references in the long–short condition than in the short–long
condition (3), but no difference between these two conditions
for NP1 reference.
Discussion
Experiment 3replicated the effect of length in speech; partic-
ipants were more likely to use pronouns to refer to longer than
to refer to shorter antecedents. Specifically, similar to
Experiment 2, participants were more likely to use a pronoun
to refer to NP1 in the long–short condition than in the short–
long condition and more likely to use a pronoun to refer to
NP2 in the short–long condition than in the long–short con-
dition. The advantage of longer antecedents was also found
between the short–short and the short–long conditions; there
were significantly more pronouns for NP2 in the short–long
condition than in the short–short condition. This effect was
unlikely to be due to the length of the preceding sentence,
Ta b l e 1 0 Summary of the coefficients from the analyses on the choice of
referring expressions in Experiment 3
(1) Model summaries for the comparison between short–short and long–
short
Predictor βSE Z p
(Intercept) −0.86 0.52 −1.64 .09
Antecedent position −1.27 0.14 −8.90 <.001
Preceding sentence (SS vs. LS) −0.26 0.12 −2.21 .02
Antecedent position × preceding
sentence (SS vs. LS)
0.19 0.12 1.64 .10
(2) Model summaries for the comparison between short–short and short–
long
Predictor βSE Z p
(Intercept) −0.88 0.54 −1.61 .10
Antecedent position −1.35 0.15 −8.94 <.001
Preceding sentence (SS vs. SL) −0.46 0.13 −3.43 <.001
Antecedent position × preceding
sentence (SS vs. SL)
−0.61 0.13 −4.58 <.001
Effect of preceding sentence
(SS vs. SL) for NP1
(Intercept) 0.30 0.59 0.50 .61
Preceding sentence (SS vs. SL) −0.19 0.39 −0.49 .61
Effect of preceding sentence
(SS vs. SL) for NP2
(Intercept) −2.89 0.63 −4.58 <.001
Preceding sentence (SS vs. SL) 2.30 0.49 4.70 <.001
(3) Model summaries for the comparison between long-short and short-
long
Predictor βSE Z p
(Intercept) −0.60 0.52 −1.15 .250
Antecedent position 1.19 0.17 6.92 <.001
Preceding sentence (LS vs. SL) −0.16 0.14 −1.11 .265
Antecedent position × preceding
sentence (LS vs. SL)
0.89 0.15 5.74 <.001
Effect of preceding sentence
(LS vs. SL) for NP1
(Intercept) 1.36 0.61 2.21 .02
Preceding sentence (LS vs. SL) −1.35 0.37 −3.60 <.001
Effect of preceding sentence
(LS vs. SL) for NP2
(Intercept) −2.81 0.65 −4.26 <.001
Preceding sentence (LS vs. SL) 2.21 0.48 4.56 <.001
Note. SS, LS, and SL stand for short–short, long–short, and short–long,
respectively
Mem Cogn
Author's personal copy
because there were no significant differences in NP1 reference
between these two conditions. Instead, the effect is most
straightforwardly explained by the semantic richness account:
Longer antecedents were more salient, and participants were
more likely to use pronouns to refer to them.
Antecedent length also affected choice of referent.
Participants were more likely to refer to NP2 in the long–short
than in the short–long condition, although the two conditions
did not differ in NP1 reference. In addition, more NP1 refer-
ences and fewer NP2 references in the short–long condition
than in the short–short condition indicate that participants
tended to refer to the relatively shorter antecedents, consistent
with the findings from Experiments 1and 2. The only result
that goes against this interpretation is the comparison between
the short–short and long–short conditions for NP1 reference,
where we found more NP1 references in the long–short con-
dition. However, neither Experiments 1nor 2revealed such an
effect, which suggests the effect is rather weak.
General discussion
Across experiments, there was a clear and strong effect of
antecedent position, with NP1 (the sentence subject) being
considerably more likely to be realized with a pronoun than
NP2 (the sentence object). This effect has repeatedly been
shown in previous studies and is attributed to the greater
prominence associated with the first-mentioned entity and
the syntactic subject of a sentence (Arnold, 2001; Brennan,
1995; Fletcher, 1984; Fukumura & Van Gompel, 2010,2011;
Stevenson et al, 1994). More important, all experiments con-
sistently showed an interaction between antecedent length and
antecedent position. Specifically, the advantage of NP1 over
NP2 antecedents for pronominal reference increased when
NP1 was longer than NP2, whereas it decreased when NP2
was longer than NP1, indicating that longer antecedents re-
ceived more pronoun referring expressions. These results are
in line with theories of language processing that maintain that
length increases the accessibility of the associated NP (e.g.,
Almor, 1999,2004;Hofmeister,2011; Yamashita & Chang,
2001) and run counter to the functionalist view of accessibility
which assumes that a longer NP might be perceived as less
“given”by comprehenders and, therefore, should be rendered
less accessible (Ariel, 1990,1996;Givón,1988,1989;Gundel
et al., 1993). Although Ariel’s corpus analyses found that NPs
that are modified by relative clauses tend to be produced when
the referent is less accessible in the prior discourse context, we
found that long antecedents attenuate anaphoric forms, sug-
gesting that length increases accessibility for subsequent ref-
erence. Therefore, longer antecedents do not seem to signal
low referent accessibility, contrary to what Ariel has proposed.
Ta b l e 1 1 Percentages of NP1, NP2 and other references by preceding sentence in Experiment 3
Choice of Referent
NP1 NP2 Other
Preceding sentence Long–short 33.0% (110) 38.0% (127) 29.0% (97)
Short–long 34.4% (115) 30.0% (100) 35.6% (119)
Short–short 26.2% (88) 39.6% (133) 34.2% (115)
Mean 31.1% 35.9% 33.0%
Note. Numbers in brackets represent frequencies
Ta b l e 1 2 Summary of the coefficients from the analyses on the choice of
referent in Experiment 3
(1) Model summaries for the comparison between short–short and long–
short
Predictor βSE Z p
(Intercept) −1.16 0.19 −6.11 <.001
Effect of preceding sentence
(SS vs. LS) on NP1 reference
0.34 0.17 1.95 .05
(Intercept) −0.46 0.16 −2.81 <.01
Effect of preceding sentence
(SS vs. LS) on NP2 reference
−0.08 0.16 −0.51 .61
(2) Model summary for the comparison between short–short and short–
long
Predictor βSE Z p
(Intercept) −1.16 0.19 −6.11 <.001
Effect of preceding sentence
(SS vs. SL) on NP1 reference
0.42 0.17 2.44 .01
(Intercept) −0.46 0.16 −2.81 <.01
Effect of preceding sentence
(SS vs. SL) on NP2 reference
−0.47 0.16 −2.83 <.01
(3) Model summary for the comparison between long–short and short–
long
Predictor βSE Z p
(Intercept) −0.89 0.15 −5.77 <.001
Effect of preceding sentence
(LS vs. SL) on NP1 reference
−0.03 0.07 −0.53 .59
(Intercept) −0.65 0.13 −4.73 <.001
Effect of preceding sentence
(LS vs. SL) on NP2 reference
0.15 0.06 2.30 .02
Note. SS, LS, and SL stand for short–short, long–short, and short–long
conditions, respectively
Mem Cogn
Author's personal copy
But the question then is: Why does length enhance acces-
sibility? One possibility, as Hofmeister (2011) pointed out, is
that the extra information predicated of longer NPs causes
them to be encoded more firmly in memory and, therefore,
allows them to be retrieved faster, by providing additional
retrieval cues when language users reaccess the referent.
This possibility is consistent with theories proposing that
processing of an NP whose representation depends on some
other modifying information requires reactivating that NP so
that the new information can be efficiently incorporated into
the discourse representation (Hofmeister, 2011;Lewis&
Vasishth, 2005). In our study, the head noun of the long
antecedent has likely been repeatedly reactivated in the pro-
cess of incorporating the words included in the attached
relative clause, facilitating subsequent retrieval (i.e., higher
accessibility).
Another possibility is that since length necessarily adds
more information to the associated antecedent, it renders it
more predicable (Keil, 1979). Simply put, predicability refers
to the number of possible conceptual pathways for retrieving a
certain concept from memory (Bock, 1982;Bock&Warren,
1985). For example, an animate entity such as an actor can
move, stop, fall, think, sleep, and eat. However, an inanimate
entity such as a car can move, stop, and fall but cannot think,
sleep, or eat. Consequently, “actor”is connected to more con-
cepts than “car,”making it possible to predicate more ideas of
“actor”and, therefore, making it more conceptually accessible.
It could be the case that length acts in a similar fashion. That is,
the extra information attached to the long entity necessarily
connects it to more concepts, rendering it more accessible.
Regardless of the precise mechanism underlying the effect
of length, an important question is why more accessible
antecedents are realized with less marked referring expres-
sions. One possible explanation of this inverse relationship
between accessibility and explicitness of referring expressions
is that subsequent retrieval of an accessible referent is easier
than that of a less accessible referent. Fukumura et al. (2011;
Fukumura & Van Gompel, 2010) thus suggested that when
referring to less accessible referents, speakers need to reacti-
vate more information about the referent, which affects the
activation of concepts associated with more explicit referring
expressions. In contrast, when referring to more accessible
antecedents, less information needs to be activated to retrieve
the referent, which leads to the production of less explicit
referring expressions.
Another potential explanation is that the language process-
ing system might have a bias toward keeping a constant ratio
between information to be relayed and amount of linguistic
signal, a view proposed by Jaeger (2010) and dubbed the
“uniform information density”(UID) hypothesis. Originally,
it aimed at explaining optional that realization inEnglish (e.g.,
Ferreira & Dell, 2000). Take the following sentences, for
example:
(4) My boss confirmed/thought (that) I was absolutely crazy.
According to UID, that is more likely to be included in (4)
when the main clause verb is confirmed rather than thought,
because the verb confirm is statistically less likely to take a
complement clause.(This pattern occurs because confirm can
take both complement clauses and direct objects, whereas
think almost always takes only complement clauses;
Garnsey, Pearlmutter, Myers, & Lotocky, 1997)Therefore,
speakers are more likely to produce that after confirmed,as
compared with thought as a way of preventing a sudden
disturbance in the “information-to-linguistic signal”balance,
thus achieving a more uniform distribution of information
over words. We might therefore wonder whether participants
produced more pronouns following longer antecedents in our
study because longer antecedents conveyed more information
about the referent, and therefore, the use of a pronoun, which
encoded less information than repeated nouns, was more
efficient from an information-theoretic point of view.
However, UID is primarily based on statistical predictabil-
ity: Jaeger (2010)arguedthat“speakers should be more likely
to produce pronouns (e.g. she) instead of full NPs (e.g. the
girl) when reference to the expression’s referent is probable in
that context”(p.48). Whether speakers are more likely to
choose pronouns over repeated NPs when the referent is more
likely to be referred to is controversial, however (Arnold,
2001; Fukumura & Van Gompel, 2010; Stevenson et al.,
1994). In the present study, whereas pronouns were more
frequently used to refer to longer antecedents, participants
were more likely to refer to shorter than to longer antecedents
in their continuations. Thus, UID is not likely to be the
underlying force behind the effects of length. How can this
tendency to refer to relatively shorter antecedents be ex-
plained? Can this tendency be linked to the results for the
choice of referring expression? Although we can only specu-
late on what factors might be behind the findings related to
choice of referent, one possibility is that the more information
is predicated of an antecedent, the more specific it becomes
and, as a result, the harder itbecomes to add more information.
This is because the new information that is added to a long
antecedent should semantically fit the information already
predicated of it. However, because short antecedents (in our
case, bare nouns such as “the actor”) have no extra informa-
tion, the participants had more freedom in describing them
(because almost any new information would fit). Thus, talking
about a longer antecedent is perhaps more difficult than
talking about a shorter antecedent. Note that, under this view,
both choice of referent and choice of referring expression are
guided by the general tendency to minimize processing effort
(Jaeger, 2010;Zipf,1949).
Another possibility is that speakers prefer to refer to the
shorter NP in order to balance the amount of information
associated with the discourse entities. That is, speakers may
Mem Cogn
Author's personal copy
not decide to refer to a particular antecedent merely because it
is accessible. Instead, they choose what to refer to depending
on whether it is communicatively informative or semantically
sensible. In our experiments, participants tended to refer more
often to the NPs that did not include a relative clause, possibly
because there was little description of those characters, as
compared with the characters described with a relative clause.
In other words, participants may have produced reference in
order to fill the gap in their knowledge (cf. Simner &
Pickering, 2005). But because of the relatively limited infor-
mation about those characters, participants used fewer pro-
nouns to refer to them through the mechanisms discussed
earlier. In support of this idea, Brennan (1995) found that
participants typically first make a character salient in the
discourse by referring to it in subject position and with a fuller
form of reference before referring to that same character with a
pronoun.
An alternative explanation of our results for choice of
referent and choice of referring expression comes from the
connectionist account proposed by Chang (2002,2009).
Under this account, the notion of accessibility is not restricted
to conceptual factors such as animacy (Bock, 1982;Bock&
Warren, 1985) and can be extended to include lexical factors.
Therefore, the choice between different forms of referring
expressions may be driven by the conceptual accessibility of
the antecedent, as well as the lexical accessibility of the
anaphoric expression itself. Assuming that lexical accessibil-
ity is affected by competition among different alternatives
(Chang, 2002,2009), there may have been fewer linguistic
forms that can be used to refer to shorter antecedents (“he,”
“the actor”) than longer ones (“he,”“the actor,”“the frustrated
actor,”“the upset actor”,“the frustrated and upset actor,”etc.).
If so, there would be less competition between referring forms
for short antecedents than for long antecedents.This may have
made it easier to refer to shorter antecedents, as compared with
longer ones, and may have led to more pronouns for longer
antecedents: Under stronger competition, pronouns may be
more likely to win over other alternative forms of reference,
due to their ease of production.
However, if the lexical competition account (Chang, 2002,
2009) is correct, we would expect to have found more vari-
ability in the form of referring expressions for longer anteced-
ents. For example, participants would have used more modi-
fied NPs (e.g., “the frustrated actor,”“the frustrated and upset
actress”) to refer to the longer antecedents. Examination of our
data revealed that such modified referring expressions were
almost nonexistent. Thus, the lexical competition account is
not fully supported by our data. In fact, Fukumura et al. (2013)
recently showed that factors that affect pronoun use are dif-
ferent from those that affect lexical competition. Although
pronoun use can be affected by competing representations of
different discourse entities (Arnold & Griffin, 2007;
Fukumura et al., 2013; Fukumura & Van Gompel, 2011;
Fukumura et al., 2011), the competition is assumed to occur
at a nonlinguistic level (between the nonlexicalized represen-
tations), so the idea that lexical competition drives pronoun
use goes against current theories of pronoun production.
In conclusion, our study demonstrates that antecedent
length has a major effect on form of reference: Language users
were more likely to use a pronoun when the antecedent was
elaborated on with a modifier than when it was a simple NP.
This finding is consistent with the proposal that the additional
information increases the referent’s prominence and, thus,
supports the view that semantic enrichment enhances
accessibility.
Appendix
Experimental sentences for Experiment 1,2and 3. The words
to the left of the slash belong to Experiment 1and 3, and those
to the right are for Experiment 2.
1. The actor who was frustrated and visibly upset about the
night's disastrous performance walked away from the
actress/cameraman.
2. The bridesmaid who had enjoyed the wedding ceremony
and was getting ready to leave photographed the
choirboy/choirgirl.
3. The kitchenmaid who had a chronic psychological ill-
ness and had recently been taking medication mistreated
the milkman/housewife.
4. The cowboy who was unpredictable and terribly dis-
tressed because of a recent bitter argument ignored the
barmaid/waiter.
5. The waitress who had time off work and was traveling in
Africa missed the waiter/barmaid.
6. The monk who was getting seriously worried about lack
of formalities at the cathedral supported the nun/
clergyman.
7. The boy who was extremely unpopular at school be-
cause of uncontrollable impulsiveness approached the
girl/choirboy.
8. The gunman who had formerly served as an officer in the
American army shot the lady/pilot.
9. The mermaid who was adventurous and was travelling
to discover the unknown world encountered the sailor/
goddess.
10. The stewardess who had experienced many long flights
and could foresee the hazards informed the pilot/woman.
11. The stableman who was fussy and was disgusted by the
shed’s filthy conditions helped the maid/shepherd.
12. The butler who was determined to provide an unforget-
table experience at the Christmas celebrations talked to
the duchess/duke.
Mem Cogn
Author's personal copy
13. The postman who came from a different culture and was
feeling increasingly insecure insulted the housewife/
Dutchman.
14. The businesswoman who was rich and successful and
was going to open a restaurant met the gentleman/
saleswoman.
15. The countess who was struggling to accumulate wealth
for the impending retirement years blackmailed the
duke/noblewoman.
16. The god who was confused and had been distracted by
the irreparable situation misunderstood the goddess/
sorcerer.
17. The fisherman who was made fully aware of the volatile
market conditions negotiated with the saleswoman/
gentleman.
18. The housemaid who was tall and good-looking and
was wearing fashionable clothes followed the
shepherd/lady.
19. The wizard who was an ardent patriot opposing foreign
interference in the kingdom trapped the witch/sailor.
20. The governess who was lively and energetic and was
walking in the park poked the schoolboy/schoolgirl.
21. The spokeswoman who had learned many valuable ar-
gument skills and was quite eloquent debated with the
congressman/maid.
22. The ballerina who was outgoing and was very well
connected in the society befriended the boxer/
cheerleader.
23. The man who was walking on the cliff and looking at the
shore below spotted the woman/schoolboy.
24. The camerawoman who had filmed/was involved in
many famous movies and had won several awards wel-
comed the cameraman/actress.
25. The friar who was enthusiastic and truly proud of the
singing in the church spoke to the choirgirl/
congressman.
26. The policeman who had been traveling to police stations
as an undercover investigator fell in love with/sought
help from the policewoman/detective.
27. The colonel who was becoming increasingly powerless
and had failed to prevent the war betrayed the empress/
prince.
28. The godmother who was wise and highly considerate of
the diverse problems in society wrote to the clergyman/
policewoman.
29. The huntsman who had discovered the cause of the
repeated fires in the forest called the shepherdess/
horseman.
30. The baroness who was deeply scared by the recent
armed uprising in the country visited the bishop/
empress.
31. The chairwoman who was trying to find a way to solve
the tax problem argued with the lord/duchess.
32. The deliveryman who was impatient and terribly irate
about the mistaken receipt phoned the salesgirl/
milkman.
33. The sportsman who had drunk a lot of whiskey and had
taken some drugs kissed/patted the cheerleader/boxer.
34. The sportswoman who was a backpacker and was
climbing a mountain in the Alps rescued the
Dutchman/girl.
35. The princess who was deeply happy and relieved after
hearing the final verdict hugged the prince/shepherdess.
36. The nanny who had not been in a relationship for ages
and was lonely married/consulted with the garbageman/
salesgirl.
37. The baron who was completely panicked and was inca-
pable of overcoming the crisis defied the duchess/
bishop.
38. The horsewoman who was an inexperienced fighter and
had not been in many battles attacked the horseman/nun.
39. The nobleman who was attempting to make a huge
fortune by buying the land misled the noblewoman/lord.
40. The mayoress who had been cunning and appallingly
dishonest in the recent election campaign bribed the
governor/congresswoman.
41. The headmaster who was lazy and disorganized and had
forgotten about the appointment bumped into the
schoolgirl/governor.
42. The heroine who was hiding behind a tree and was
holding a big mace killed the knight/witch.
References
Almor, A. (1999). Noun-phrase anaphora and focus: The informational
load hypothesis. Psychological Review, 106, 748–765.
Almor, A. (2004). A computational investigation of reference in produc-
tion and comprehension. In J. Trueswell & M. Tanenhaus (Eds.),
Approaches to studying world-situated language use: Bridging the
language-as-product and language-as-action traditions (pp. 285–
301). Cambridge, MA: MIT Press.
Ariel, M. (1990). Accessing noun-phrase antecedents. London:
Routledge.
Ariel, M. (1996). Referring expressions and the +/- coreference distinc-
tion. In J. K. G. T. Fretheim (Ed.), Reference and referent
accessibility (pp. 13–35). Amsterdam: John Benjamins.
Arnold, J. E. (2001). The effect of thematic roles on pronoun use and
frequency of reference continuation. Discourse Processes, 31, 137–
162.
Arnold, J. E. (2008). Reference production: Production-internal and
addressee-oriented processes. Language and Cognitive Processes,
23, 495–527.
Arnold, J. E., & Griffin, Z. (2007). The effect of additional characters on
choice of referring expression: Everyone counts. Journal of Memory
and Language, 56, 521–536.
Arnold, J. E., Wasow, T., Losongco, A., & Ginstrom, R. (2000).
Heaviness vs Newness: The effect of structural complexity and
discourse status on constituent ordering. Language, 76(1), 28–55.
Mem Cogn
Author's personal copy
Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J. (2013). Random effects
structure for confirmatory hypothesis testing: Keep it maximal.
Journal of Memory and Language, 68(3), 255–278.
Baayen, R. H., Davidson, D. J., & Bates, D. M. (2008). Mixed-effects
modeling with crossed random effects for subjects and items.
Journal of Memory and Language, 59, 390–412.
Bock, J. K. (1982). Toward a cognitive psychology of syntax: informa-
tion processing contributions to sentence formulation.
Psychological Review, 89, 1–47.
Bock, J. K., & Warren, R. K. (1985). Conceptual accessibility and
syntactic structure in sentence formulation. Cognition, 21(1), 47–67.
Brennan, S. E. (1995). Centering attention in discourse. Language and
Cognitive Processes, 10, 137–167.
Brennan, S. E., Friedman, M. W., & Pollard, C. J. (1987). A centering
approach to pronouns. In Proceedings of the 25th annual meeting of
the association for computational linguistics (pp. 155–162).
Cambridge, MA: Association for Computational Linguistics.
Chang, F. (2002). Symbolically speaking: A connectionist model of
sentence production. Cognitive Science, 26,609–651.
Chang, F. (2009). Learning to order words: A connectionist model of
heavy NP shift and accessibility effects in Japanese and English.
Journal of Memory and Language, 61, 374–397.
Clark, E., & Clark, H. (1978). Universals, relativity and language pro-
cessing. In J. Greenberg (Ed.), Universals of human language
(Method and Theory, Vol. 1, pp. 225–277). Stanford: Stanford
University Press.
Clark, H. H., & Haviland, S. E. (1977). Comprehension and the given-
new contract. In R. O. Feedle (Ed.), Discourse production and
comprehension (Vol. 1, pp. 1–40). Norwood, NJ: Ablex.
Craik, F. I. M., & Tulving, E. (1975). Depth of processing and the
retention of words in episodic memory. Journal of Experimental
Psychology: General, 104, 268–294.
Ferreira, V. S., & Dell, G. S. (2000). The effect of ambiguity and lexical
availability on syntactic and lexical production. Cognitive
Psychology, 40, 296–340.
Ferreira, F., Ferraro, V., & Bailey, K. G. D. (2002). Good enough
representations in language comprehension. Current Directions in
Psychological Science, 11, 11–15.
Ferreira, F., Foucart, A., & Engelhardt, P. E. (2013). Language processing
in the visual world: Effects of preview, visual complexity, and
prediction. Journal of Memory and Language, 69(3), 165–182.
Ferreira, F., & Patson, N. (2007). The good enough approach to language
comprehension. Language and Linguistics Compass, 1, 71–83.
Fisher, R. P., & Craik, F. I. M. (1980). The effects of elaboration on
recognition memory. Memory & Cognition, 8, 400–404.
Fletcher, C. R. (1984). Markedness and topic continuity in discourse
processing. Journal of Verbal Learning and Verbal Behavior, 23,
487–493.
Fukumura, K., Hyönä, J., & Scholfield, M. (2013). Gender affects se-
mantic competition: the effect of gender in a non-gender marking
language. Journal of Experimental Psychology Learning Memory
and Cognition, 39, 1012–1021.
Fukumura, K., & Van Gompel, R. P. G. (2010). Choosing anaphoric
expressions: Do people take into account likelihood of reference?
Journal of Memory and Language, 62(1), 52–66.
Fukumura, K., & Van Gompel, R. P. G. (2011). The effects of animacy in
the choice of referring expressions. Language and Cognitive
Processes, 26, 1472–1504.
Fukumura, K., Van Gompel, R. P. G., Harley, T., & Pickering, M. J.
(2011). How does similarity-based interference affect the choice of
referring expression? Journal of Memory and Language, 65, 331–
344.
Fukumura, K., Van Gompel, R. P. G., & Pickering, M. J. (2010). The use
of visual context during the production of referring expressions.
Quarterly Journal of Experimental Psychology, 63, 1700–1715.
Garnsey, S. M., Pearlmutter, N. J., Meyers, E., & Lotocky, M. A. (1997).
The contributions of verb bias and plausibility to the comprehension
of temporarily ambiguous sentences. Journal of Memory and
Language, 37, 58–93.
Garvey, C., & Caramazza, A. (1974). Implicit causality in verbs.
Linguistic Inquiry, 5(3), 459–464.
Givón, T. (1983). Topic continuity in discourse: An introduction. In T.
Givón (Ed.), Topic continuity in discourse: A quantitative cross-
language study (pp. 1–41). Amsterdam: John Benjamins.
Givón, T. (1988). The pragmatics of word order: Predictability, impor-
tance and attention. In M. Hammond, E. A. Moravcsik, & J. R.
Wirth (Eds.), Studies in syntactic typology (Vol. 17, pp. 243–284).
Amsterdam: John Benjamins.
Givón, T. (1989). Mind, code, and context: Essays in pragmatics.
Hillsdale, N.J.: L. Erlbaum Associates.
Gordon, P. C., Grosz, B. J., & Gilliom, L. A. (1993). Pronouns, names,
and the centering of attention in discourse. Cognitive Science, 17,
311–347.
Grosz, B. J., Joshi, A. K., & Weinstein, S. (1995). Centering: A frame-
work for modelling the local coherence of discourse. Computational
Linguistics, 21, 203–225.
Gundel, J. K., Hedberg, N., & Zacharski, R. (1993). Cognitive status and
the form of anaphoric expressions in discourse. Language, 69, 274–
307.
Halliday, M. (1967). Notes on transitivity and theme in English, Part II.
JournalofLinguistics,3,199–244.
Hawkins, J. A. (1994). A performance theory of order and constituency.
Cambridge, UK: Cambridge University Press.
Hofmeister, P. (2011). Representational complexity and memory retrieval
in language comprehension. Language and Cognitive Processes,
26(3), 376–405.
Jaeger, T. F. (2008). Categorical data analysis: Away from ANOVAs
(transformation or not) and towards logit mixed models. Journal
of Memory and Language, 59, 434–446.
Jaeger, T. F. (2010). Redundancy and reduction: Speakers manage syn-
tactic information density. Cognitive Psychology, 61, 23–62.
Kehler, A., Kertz, L., Rohde, H., & Elman, J. L. (2008). Coherence and
coreference revisited. Journal of Semantics, 25, 1–44.
Keil, F. C. (1979). Semantic and conceptual development: An ontological
perspective. Cambridge, MA: Harvard University Press.
Lewis, R., & Vasishth, S. (2005). An activation-based model of
sentence processing as skilled memory retrieval. Cognitive
Science, 29, 1–45.
Marks, W. (1987). Retrieval constraints on associative elaborations.
Journal of Experimental Psychology: Learning, Memory, and
Cognition, 13(2), 301–309.
Sanford, A. J., & Sturt, P. (2002). Depth of processing in language
comprehension: not noticing the evidence. Trends in Cognitive
Science, 6, 382–386.
Simner,J.,&Pickering,M.J.(2005).Planningcausesandcon-
sequences in discourse. Journal of Memory and Language,
52, 226–239.
Stallings, L. M., & MacDonald, M. C. (2011). It’s not just the
“Heavy NP”: Relative phrase length modulates the produc-
tion of heavy-NP shift. Journal of Psycholinguistic Research,
40, 177–187.
Stallings, L. M., MacDonald, M. C., & O’Seaghdha, P. G. (1998). Phrasal
order constraints in sentence production: Phrase length and verb
dispos ition in heavy-NP shift. Journal of Memory and Language,
39, 392–417.
Stevenson, R. J., Crawley, R.A., & Kleinman, D. (1994). Thematic roles,
focus and the representation of events. Language and Cognitive
Processes, 9, 519–548.
Stevenson, R. J., Knott, A., Oberlander, J., & McDonald, S. (2000).
Interpreting pronouns and connectives: Interactions among
Mem Cogn
Author's personal copy
focusing, thematic roles and coherence relations. Language and
Cognitive Processes, 15, 225–262.
Swets,B.T.,Desmet,C.,Clifton,J.,&Ferreira,F.(2008).
Underspecification of syntactic ambiguities: evidence from self-
paced reading. Memory & Cognition, 36, 201–217.
Yamashita, H., & Chang, F. (2001). “Long before short”prefer-
ence in the production of a head-final language. Cognition,
81(2), 45–55.
Zipf, G. K. (1949). Human behavior and the principle of least effort: An
introduction to human ecology. Addison-Wesley.
Mem Cogn
Author's personal copy