ArticlePDF Available

Abstract

Recent evidence shows that distractors that signal high compared to low reward availability elicit stronger attentional capture, even when this is detrimental for task-performance. This suggests that simply correlating stimuli with reward administration, rather than their instrumental relationship with obtaining reward, produces value-driven attentional capture. However, in previous studies, reward delivery was never response independent, as only correct responses were rewarded, nor was it completely task-irrelevant, as the distractor signaled the magnitude of reward that could be earned on that trial. In two experiments, we ensured that associative reward learning was completely response independent by letting participants perform a task at fixation, while high and low rewards were automatically administered following the presentation of task-irrelevant colored stimuli in the periphery (Experiment 1) or at fixation (Experiment 2). In a following non-reward test phase, using the additional singleton paradigm, the previously reward signaling stimuli were presented as distractors to assess truly task-irrelevant value driven attentional capture. The results showed that high compared to low reward-value associated distractors impaired performance, and thus captured attention more strongly. This suggests that genuine Pavlovian conditioning of stimulus-reward contingencies is sufficient to obtain value-driven attentional capture. Furthermore, value-driven attentional capture can occur following associative reward learning of temporally and spatially task-irrelevant distractors that signal the magnitude of available reward (Experiment 1), and is independent of training spatial shifts of attention towards the reward signaling stimuli (Experiment 2). This confirms and strengthens the idea that Pavlovian reward learning underlies value driven attentional capture.
Pavlovian reward learning underlies value driven
attentional capture
Berno Bucker
1
&Jan Theeuwes
1
#The Author(s) 2016. This article is published with open access at Springerlink.com
Abstract Recent evidence shows that distractors that signal
high compared to low reward availability elicit stronger atten-
tional capture, even when this is detrimental for task-perfor-
mance. This suggests that simply correlating stimuli with re-
ward administration, rather than their instrumental relation-
ship with obtaining reward, produces value-driven attentional
capture. However, in previous studies, reward delivery was
never response independent, as only correct responses were
rewarded, nor was it completely task-irrelevant, as the
distractor signaled the magnitude of reward that could be
earned on that trial. In two experiments, we ensured that as-
sociative reward learning was completely response indepen-
dent by letting participants perform a task at fixation, while
high and low rewards were automatically administered fol-
lowing the presentation of task-irrelevant colored stimuli in
the periphery (Experiment 1) or at fixation (Experiment 2).
In a following non-reward test phase, using the additional
singleton paradigm, the previously reward signaling stimuli
were presented as distractors to assess truly task-irrelevant
value driven attentional capture. The results showed that high
compared to low reward-value associated distractors impaired
performance, and thus captured attention more strongly. This
suggests that genuine Pavlovian conditioning of stimulus-
reward contingencies is sufficient to obtain value-driven at-
tentional capture. Furthermore, value-driven attentional cap-
ture can occur following associative reward learning of tem-
porally and spatially task-irrelevant distractors that signal the
magnitude of available reward (Experiment 1), and is inde-
pendent of training spatial shifts of attention towards the re-
ward signaling stimuli (Experiment 2). This confirms and
strengthens the idea that Pavlovian reward learning underlies
value driven attentional capture.
Keywords Visual attention .Associative reward learning .
Pavlovian conditioning .Attentional capture
Introduction
Visual selective attention is crucial for efficient everyday be-
havior, by enhancing the representation of stimuli that are
relevant and suppressing the representation of stimuli that
are potentially distracting. For example, when walking on
the sidewalk, you attend fellow pedestrians and traffic signs
while ignoring bypassing cars and irrelevant billboards.
However, if there is a colored money bill lying on the pave-
ment, it is likely to grab your attention. The probability that a
certain stimulus is selected amongst competing stimuli is par-
tially dependent on its perceptual features (e.g., luminance or
color-contrast), and the control settings of an observer to at-
tend to specific features or objects. Classically, the automati-
cally stimulus-driven factors are referred to as bottom-up, and
the voluntarily goal-driven factors are referred to as top-down
(for reviews see Corbetta & Shulman 2002; Desimone &
Duncan, 1995; Itti & Koch, 2001;Theeuwes,2010).
However, there is more to the money bill lying on the
pavement, as the learned value that the bill has acquired over
years of life experience makes it likely that it captures your
attention. This happens, although money on the pavement
does not Bstick out^per se in a bottom up fashion, and despite
that we are not constantly searching for bills in a top down
manner when we are walking on the sidewalk. A recent body
*Berno Bucker
berno.bucker@vu.nl
1
Department of Experimental and Applied Psychology, Vrije
Universiteit Amsterdam, Van der Boechorststraat 1, 1081
BT Amsterdam, The Netherlands
Atten Percept Psychophys
DOI 10.3758/s13414-016-1241-1
of literature has shown that learned reward associations can
evoke attentional biases that cannot be explained in terms of
bottom-up and top-down factors (for reviews see Anderson,
2013,2015; Awh, Belopolsky, and Theeuwes 2012; Chelazzi,
Perlato, Santandrea, & Della Libera, 2013; Chun, Golomb, &
Turk-Browne, 2011; Kristjánsson & Campana 2010,Le
Pelley, Mitchell, Beesley, George, & Wills, 2016). Awh
et al. (2012) argued that the attentional priority map should
be extended beyond the integration of bottom-up and top-
down factors, and include selection historyas a third factor
driving visual selective attention. Selection history includes
attentional effects due to previous selection (i.e., priming)
and reward learning, and comprises that attentional selection
is influenced by the significance that certain stimuli have
gained over time through experience. This suggests that the
deployment of attention is flexibly adapted depending on the
context to maximize behavioral outcomes. As rewards in the
environment are crucial for guiding optimal behavior, it is not
surprising that reward signaling stimuli have a significant im-
pact on the allocation of selective attention.
In experimental settings, it is well known that the prospect
of reward can increase the motivation and effort to deliver
optimal performance, thereby recruiting cognitive control pro-
cesses that allow, amongst other effects, for more efficient
deployment of attention (for reviews see Botvinick &
Braver, 2015; Pessoa, 2009; Pessoa & Engelmann, 2010).
For example, when reward is signaled at the start of a trial or
block, performance is enhanced when relatively high reward
can be obtained, usually characterized by a reduction in reac-
tion time and/or improved accuracy (e.g., Bucker &
Theeuwes, 2014; Sawaki, Luck, & Raymond, 2015; Small
et al., 2005). Furthermore, when the delivery of relatively high
reward is directly coupled to a stimulus or stimulus feature,
that stimulus (feature) enjoys increased attentional priority,
thereby eliciting improved behavioral performance (e.g.,
Bucker & Theeuwes, 2016; Kiss, Driver, & Eimer, 2009;
Krebs, Boehler, Egner, & Woldorff, 2011; Munneke,
Hoppenbrouwers, & Theeuwes, 2015). Hence, when reward
is signaled in advance, or is congruent with the current task-
demands (e.g., coupled to the spatial location or a visual fea-
ture of the target), it allows for optimal preparation and stra-
tegic adjustment of attentional processes in order to achieve
more efficient goal-directed behavior. This intuitively makes
sense, as the best strategy to maximize reward income under
these circumstances is to integrate and prioritize the reward
signal in the process of attentional selection.
However, it has also been shown that learned reward asso-
ciations can trigger attentional prioritization that goes against
the goal-directed state of the observer, and can be detrimental
for task performance (e.g., Anderson, Laurent, & Yantis
2011a,2011b; Della Libera & Chelazzi, 2006,2009;Failing
& Theeuwes, 2014, Hickey, Chelazzi, & Theeuwes, 2010a,
2010b). For example, Hickey et al. (2010a,2010b)showed
that, when high and low rewards were distributed at random,
attention was biased towards the color thatwas high rewarded
on the previous trial, not only when the target was presented in
the high reward color on the current trial, but also when the
distractor was presented in the high reward color on the cur-
rent trial. This suggests that high reward stimuli automatically
capture attention, and elicit an attentional bias towards fea-
tures associated with high reward, even when selection of
those features runs against the current goals of the observer
and is detrimental for task performance and monetary payout.
In addition, several studies have shown that reward associ-
ated stimuli are prioritized, even when reward features be-
come completely task-irrelevant in a context where the actual
rewards are no longer delivered (e.g., Anderson et al. 2011a,
2011b; Bucker, Silvis, Donk, & Theeuwes, 2015; Della Libera
& Chelazzi, 2006;2009; Failing & Theeuwes, 2014;
MacLean, Diaz, & Giesbrecht, 2016). Typically, these studies
make use of (1) an initial training phase in which target fea-
tures are associated with high and low reward-value, and (2) a
test phase in which the previously reward signaling target
features become distractor features and rewards are no longer
delivered. For example, in Anderson et al. (2011a,2011b),
during an initial training phase, participants had to search for
a red or green circle amongst multiple colored circles, and
discriminate the orientation of a line segment within the target
circle. For successful selection of the colored target circle,
followed by a correct manual response, participants earned
either a high or low reward depending on the color-reward
contingencies (e.g., red gives a high chance on high reward
and a low chance on low reward, and green gives a high
chance on low reward and a low chance on high reward).
Then, during the following test phase, participants had to
search for an odd-shaped target amongst several distractors
that had the same shape and report the orientation of a line
segment within the odd-shaped target (i.e., the additional
singleton paradigm, Theeuwes, 1992). Rewards could no lon-
ger be earned, but one of the distractors could occasionally
appear in the previously high or low reward signaling color.
Even though participants were instructed to ignore any color
information, the distractor slowed search significantly when it
was presented in the color previously associated with the high
reward compared to the color previously associated with the
low reward (Anderson et al., 2011a) or compared to when
none of the reward colors were present (Anderson et al.,
2011b). This suggests, that so-called value driven attentional
capture (see Anderson, 2013) occurs even when the high
reward-value associated stimulus (feature) is task-irrelevant,
physically non-salient and no longer predictive for reward
delivery.
Recently it has been argued that value-driven attentional
capture as described above could be the result of stimulus
response learning rather than the mere statistical regularities
between the presence of a stimulus and reward delivery (see
Atten Percept Psychophys
Le Pelley et al., 2016). By repeatedly reinforcing attentional
orienting towards high reward features in a training phase, this
learned attentional response becomes a selection bias that per-
sists during a following test phase, even if shifting attention
towards the same stimulus features is now detrimental for task
performance. In other words, it is possible that the observed
attentional capture by the previously reward signaling stimuli
during the testphase is a carryover of attentional orienting that
was initially trained during the training phase. This would
suggest that instrumental rather than Pavlovian conditioning
underlies the effects of value driven attentional capture. To
investigate the associative learning mechanism underlying
value driven attentional capture, Le Pelley and colleagues
(Le Pelley, Pearson, Griffiths, & Beesley, 2015)designeda
variation ofthe additionalsingleton paradigm in which high or
low rewards were delivered for fast and correct target re-
sponses depending on the color of a task-irrelevant distractor.
Crucially, attentional orienting towards the distractor stimulus
needed to be suppressed in order to give a correct response in
time and receive that trials reward. In fact, orienting towards
the reward signaling distractor item resulted in the omission of
reward. The authors reasoned that if instrumental learning
underlies value-driven attentional capture, behavior should
be more efficient on high compared to low reward trials as
suppressing attentional orienting towards the reward item is
reinforced, making it easier to perform the target discrimina-
tion. However, if Pavlovian learning underlies value-driven
attentional capture, behavior should be less efficient on high
compared to low reward trials, as the reward signaling
distractor automatically captures attention, making it more
difficult to perform the target discrimination. The results of
Le Pelley et al. (2015) showed the high compared to low
reward signaling distractors captured attention more strongly,
and similar findings have been reported for other behavioral
(Mine & Saiki, 2015) and oculomotor tasks (Bucker,
Belopolsky, & Theeuwes, 2015; Failing, Nissens, Pearson,
Le Pelley, & Theeuwes, 2015; McCoy & Theeuwes 2016;
Pearson, Donkin, Tran, Most, Le Pelley, 2015). This suggests
that Pavlovian rather than instrumental reward learning under-
lies the effects of value-driven attentional capture.
However, in all previous studies, reward administration
was response dependent, as only fast and correct responses
were followed by actual reward delivery. Furthermore, the
stimulus that signaled the availability of reward was presented
within the stimulus display, and the only way for participants
to find out how much reward they could earn on any given
trial was to attend the color of the reward signaling distractor.
In all previous studies, except for Mine and Saiki (2015),
which utilized a design with a training and test phase,
value-driven attentional capture was assessed while the re-
ward associated distractor was present to predict the reward
outcome of a trial. This implies that the reward associated
distractor was never completely task-irrelevant as it conveyed
the informational value of the reward magnitude that was
available (i.e., signaling how much reward could be earned
on a particular trial). To rule out any influence of task-rele-
vance, we therefore separated the associative reward learning
phase from the test phase in which value-driven attentional
capture was assessed in the present study. Although Mine
and Saiki (2015) also used a separate training phase to asso-
ciate distractor stimuli with differential reward-value, the
distractor stimuli always appeared in the target stimulus dis-
play at the time a response was required. Furthermore, only
correct responses were rewarded, making associative reward
learning response dependent. Thus, in all previous studies
trying to uncover the learning mechanism underlying value-
driven attentional capture associative reward learning was
never truly Pavlovian, and the reward associated distractors
were never completely separated from the task at hand. In the
present study, we aimed to specifically address whether the
mere co-occurrence of a stimulus and reward administration,
without any instrumental relationship in obtaining reward,
could lead to value-driven attentional capture.
In order to examine whether genuine Pavlovian reward
learning underlies value-driven attentional capture, we con-
ducted two experiments. In both experiments, a Pavlovian
reward conditioning phase was separated from a non-reward
test phase using the additional singleton paradigm. This en-
suredthattherewasnomotivationwhatsoevertoattendto-
wards the reward-value associated stimuli in order to check
how much reward could be earned on any given trial.
Crucially, during associative reward learning in the reward
conditioning phase, reward administration was also complete-
ly task- and response-independent. Participants fixated the
central fixation cross and had to respond to a 45° angular
degrees rotation of this fixation cross. While fixating the cen-
tral fixation cross, circles or diamonds (one of which was
presented in color) appeared in the periphery and were, inde-
pendent of the response of the participant, followed either by
high or low reward administration. The reward signaling
distractors and reward feedback were never presented simul-
taneously with a change of fixation, such that associative re-
ward learning was separated from the fixation change task in
time and that the delivery of reward could not coincidentally
be attributed to any key press or performance on the assigned
task. In Experiment 1, stimuli following high and low reward
administration were presented in the periphery, such that the
location where reward learning occurred was spatially sepa-
rate from the task at fixation. In Experiment 2, the stimulus
display contained only one colored stimulus, which was pre-
sented at the focus of attention at fixation, to prevent spatial
orienting towards the reward signaling stimuli.
The same non-reward test phase followed the reward
conditioning phase in both experiments, to examine value-
driven attentional capture. During the non-reward test
phase, rewards were no longer administered, and participants
Atten Percept Psychophys
had to perform a discrimination task on a bar stimulus that was
presented ina gray shape singleton amongst several non-target
shapes. Crucially, one of the distractor shapes was sometimes
presented inthe high or low reward-value associated color. We
expected that the colored singletons that previously signaled
reward would attract attention and slowsearch relativeto trials
without a colored singleton distractor. Furthermore, if
Pavlovian learning underlies value driven attentional capture,
performance should be impaired on trials in which a high
compared to low reward-value associated distractor stimulus
was present. However, if associative reward learning involves
an instrumental aspect and is dependent on the association of a
response with reward delivery, we expect to find no perfor-
mance differences, between high and low reward-value
distractor trials in the non-reward test phase. Altogether,
these experiments should give clear insight in whether asso-
ciative reward learning through pure Pavlovian conditioning
elicits value-driven attentional capture.
Experiment 1
During the reward conditioning phase, participants performed
a task at fixation, while completely task-irrelevant stimuli
were presented in the periphery (see Fig. 1). One of these
stimuli was presented in a color that could be associated with
the delivery of either a high or a low reward. Crucially, the
stimuli in the periphery and reward administration were
completely task irrelevant and not associated with any re-
sponse. Thus, through Pavlovian reward learning, we associ-
ated specifically colored stimuli with either high or low re-
ward-value. In the non-reward test phase, rewards were no
longer administered, and participants performed a discrimina-
tion task on a bar stimulus presented in a gray shape singleton
amongst several non-target shapes (see Fig. 2). Crucially, one
of the distractor shapes was sometimes presented in one of the
previously reward signaling colors. We examined value-
driven attentional capture by comparing performance between
trials in which a high and low reward-value associated
distractor was present.
Methods
Participants
For Experiment 1, a group of 24 participants [7 males, 2031
years of age, mean = 23.5 years, standard deviation (SD) = 2.7
years] was tested at the Vrije Universiteit Amsterdam. All
participants reported having normal or corrected-to-normal
vision, and gave written informed consent before participa-
tion. Participants earned 5.28 reward during the reward con-
ditioning phase, and were paid at a rate of 8.00 per hour to
compensate for participation. As the experiment lasted ap-
proximately 40 min, participants earned around 10.00 in
total. All research was approved by the Vrije Universiteit
Faculty of Psychology ethics board, and was conducted ac-
cording to the principles of the Declaration of Helsinki.
Apparatus
All participants were tested in a sound-attenuated, dimly-lit
room, with their head resting on a chinrest at a viewing dis-
tance of 75 cm. A computer with a 3.2 GHz Intel Core pro-
cessor running OpenSesame (Mathôt, Schreij, & Theeuwes,
2012) generated the stimuli on a 22-inch screen (resolution
1680 × 1050, refreshing at 120 Hz). The necessary response
data were acquired through the standard keyboard and all
auditory stimuli were presented through headphones.
Stimuli
Reward conditioning phase
Throughout the reward conditioning phase, a white (CIE: x=
.255, y= .437, 67.11 cd/m
2
) fixation cross (.30° x .30°) was
presented on a black (<1 cd/m
2
) background at the center of
the screen (see Fig. 1). On fixation-change trials (25 % of the
trials) the fixation cross (B+^) rotated 45° angular degrees
(Bx^) for a brief period, but the fixation cross never disap-
peared from the screen. The reward stimulus display was
based on the additional singleton paradigm display
(Theeuwes, 1992) but did not contain target or distractor line
elements. Accordingly, six shapes, randomly one diamond
(2.95° x 2.95°) amongst five circles (r= 1.45°) or one circle
amongst five diamonds, were presented at equal distances on
an imaginary circle (r= 5.90°). Fivestimuli were presented in
gray (CIE: x= .308, y= .335, 22.1 cd/m
2
), and one color
singleton was presented in either blue (CIE: x= .190, y=
.197, 22.5 cd/m
2
)oryellow(CIE:x=.412,y= .513, 22.6
cd/m
2
). The shape singleton was never presented in color. The
locations of the shape and color singleton were assigned ran-
domly on each trial. The reward feedback display contained
the written text B+1 ct^or B+10 ct^(~.5° x 3.0°) presented in
white (CIE: x= .255, y= .437, 67.11 cd/m
2
), 0.50° visual
degrees above the fixation cross at the center of the screen.
Simultaneously with the visually presented feedback, the
sound of one or more dropping coins was played in low and
high reward trials, respectively.
The reward stimulus display was presented for 1200 ms
followed by a 200-ms interval during which the fixation cross
was shown. Then the reward feedback display was presented
for 800 ms. On no fixation change trials, the fixation cross was
presented for a random inter-trial interval of 400800 ms after
which the next reward stimulus display was shown. On
Atten Percept Psychophys
fixation change trials the reward feedback display was follow-
ed by a random 4001200ms interval, after which the fixation
cross rotated 45° angular degrees. When this change occurred
participants had to press the BX^button on the keyboard with-
in a 1400 ms response window. Immediately following the
button press, or after 1400 ms, the fixation cross rotated back
to normal and the standard 400800 ms inter-trial interval was
followed by the presentation of the next reward stimulus
display.
Non-reward test phase
For the non-reward test phase, we utilized the additional
singleton task (Theeuwes, 1992). Accordingly, six shapes,
randomly one diamond (2.9 x 2.95°) amongst five circles
(r= 1.45°) or one circle amongst five diamonds, were present-
ed at equal distances on an imaginary circle (r=5.90°)(see
Fig. 2). All stimuli were presented in gray (CIE: x=.308,y=
.335, 22.1 cd/m
2
), or all were presented in gray except for the
color singleton, which was presented in either blue (CIE: x=
.190, y= .197, 22.5 cd/m
2
) or yellow (CIE: x = .412, y = .513,
22.6 cd/m
2
). The shape singleton was never colored and al-
ways presented in gray. All shapes contained a gray (CIE: x=
.308, y= .335, 22.1 cd/m
2
) horizontal or vertical line element
(0.50° x 3 pixels). At all times a white (CIE: x=.255,y=.437;
67.11 cd/m
2
) fixation cross (.30° x .30°) was presented at the
center of the screen. Feedback consisted of the written words
(~.5° x 2.0°), Bcorrect^,Bincorrect^or Btoo slow^presented in
white (CIE: x=.255,y= .437; 67.11 cd/m
2
). Visual feedback
for incorrect and too slow response trials was accompanied by
a pure tone (sine wave) of 400 Hz and 800 Hz, respectively.
The stimulus display was presented for a maximum period
of 1400 ms, and disappeared as soon as a response was given.
The stimulus display was directly followedby the visual feed-
back for 500 ms, occasionally accompanied by the auditory
feedback that was played for 200 ms. All trials were separated
by a random 12001600 ms inter-trial interval.
Procedure and design
Participants were instructed to remain fixated at the central
fixation cross at all times, and to respond as quickly and ac-
curately as possible. In the reward conditioning phase, the task
for participants was to press the BX^keyboard button as soon
as the fixation cross rotated 45° angular degrees. They were
instructed that other stimuli would appear in the periphery but
that these were not related to their task and could be ignored.
Furthermore, they were told that they would automatically
receive rewards while doing the fixation task, and that reward
administration was independent of the assigned task and cor-
responding response. The reward conditioning phase
consisted of three separate blocks with 32 trials each. Within
1200 ms
400-1200 ms
800 ms
200 ms
0-1400 ms
1200 ms
800 ms
200 ms
400-800 ms
400-800 ms
x
Fixation change
No fixation change
a
b
+ 1 ct
High reward (+10 ct)
Low reward (+ 1 ct)
+ 10 ct
Fig. 1 a,bSchematic representation of the trial sequence and timing of
the reward conditioning phase in Experiment 1.ahigh reward stimulus
display (blue circle) and high reward feedback display indicating a 10
cent win, followed by a fixation change trial (fixation changed from B+^
to Bx^) on which participants had to press the keyboard BX^button within
1400 ms. bA low reward stimulus display (yellow diamond) and low
reward feedback display indicating a 1 cent win, followed by a no fixation
change trial on which no response was required
Atten Percept Psychophys
each block the high and low reward stimulus display were
both shown 16 times in random order. The color-reward con-
tingencies (i.e., blue and yellow for high and low rewards)
were counterbalanced across participants. The fixation cross
changed eight times per block (25 % of the time) randomly in
between trials. In between blocks of the reward conditioning
phase, participants were shown on how many occasions they
missed the fixation change and their total reward received. In
total, 96 reward displays were shown and 24 fixation changes
occurred during the reward conditioning phase.
At the start of the non-reward test phase, participants were
explicitly told that rewards were no longer delivered.
Furthermore, they were told that the fixation change task
was finished, but that fixation needed to be maintained. The
task in the non-reward test phase was to discriminate the ori-
entation of the target line within the odd-shaped figure (i.e.,
the shape singleton), and to press the BZ^or BM^button as
fast and accurate as possible for horizontal or vertical line
elements respectively. The test phase consisted of eight blocks
with 36 trials each. Within each block there were 12 high
reward-value distractor trials, 12 low reward-value distractor
trials and 12 no distractor trials. The shape singleton contained
a horizontal or vertical target line equally often within each
block. In between blocks, participants were informed about
their mean response time and mean accuracy in that particular
block and their overall accuracy. In total, 288 trials were com-
pleted in the non-reward test phase. Including breaks in
between blocks, all participants were able to finish the exper-
iment within approximately 40 min.
Statistical analyses
Accuracy was calculated as the percentage correct of all trials.
All correct responses that were made within the 1400 ms re-
sponse window were included in the accuracy analysis. For
the reaction time analysis, only correct trials were analyzed.
Furthermore, trials on which reaction time deviated more than
2.5 SD from the participants individual overall mean correct
reaction time (2.0 % of the correct reaction time data), were
excluded from the analysis.
In order to investigate possible speed accuracy tradeoffs on
an individual level, and in order to compare distractor condi-
tion differences in speed and/or accuracy between
Experiments 1 and 2, we calculated the inverse efficiency
score (IES), first proposed by Townsend and Ashby (1978,
see also Townsend and Ashby 1983). IES (reaction time /
No distractorHigh reward-value Low reward-value
correct
0-1400 ms
500 ms
1200-1600 ms
ZM
a
b
Fig. 2 a,bSchematic representation of the trial sequence and timing of the
non-reward test phase. aThe stimulus display of the additional singleton
paradigm with a high reward-value color singleton distractor (blue circle),
and a diamond shape singleton target containing a horizontal line element
for which a BZ^key press within 1400 ms resulted in the correct feedback
screen. bOverview of the three different distractor type conditions
Atten Percept Psychophys
proportion of correct responses) reflects the average energy
consumed by the system over time, and can properly be used
as a measure of performance in addition to reaction time and
accuracy (Bruyer & Brysbaert, 2011). In order to examine the
effects of reward on attention, IES has been used before in
Shomstein & Johnson (2013).
To investigate whether Pavlovian reward conditioning
could lead to value-driven attentional capture, a repeated mea-
sures ANOVA with distractor type (high reward-value/low
reward-value/no distractor) as factor was performed on reac-
tion time, accuracy and IES. The effect of distractor presence
(high and low reward-value distractor trials compared to no
distractor trials) and the effect of value driven attentional cap-
ture (high compared to low reward-value distractor trials)
were investigated using two-tailed paired samples t-tests.
Exclusions
We excluded participants from the analyses if performance in
one or more conditions of the non-reward test phase was at
or below chance level. The cumulative binomial distribution
indicated that performance significantly (P< .041) deviated
from chance (50 %) if participants responded correctly on 56
or more trials out of a total of 96 trials per condition. Thus,
participants who scored below 58 % correct in one or more
conditions were excluded from the analysis. In Experiment 1,
all participants scored above this criterion.
Results
Reward conditioning
Out of 24 subjects, one subject missed the fixation change
once and one subject showed a false alarm in the reward
conditioning phase. As reward was independent of the fixa-
tion task and any response, all participants earned 5.28 extra
reward.
Value-driven attentional capture
To investigate whether associative reward learning through
Pavlovian conditioning elicited value-driven attentional cap-
ture in the non-reward test phase, we compared mean reaction
time, accuracy and IES between the high reward-value, low
reward-value and no distractor condition in the test phase (see
Fig. 3).
Reaction time
A repeated measures ANOVA on mean reaction time with
distractor type (high reward-value/low reward-value/no
distractor) as factor showed a significant main effect, F(2,
46) = 33.823, P<.001,η
p2
= .595. However, a planned paired
samples two-tailed t-test showed that reaction times the high
reward-value (mean = 772 ms) and the low reward-value
(mean = 768 ms) distractor condition, although numerically,
did not significantly differ (t< 1). Subsequent planned paired
samples two-tailed t-tests showed that reaction times were
significantly shorter in the no distractor condition (mean =
733), compared to both the high t(23) = 6.455, SE = 5.992,
P<.001,η
p2
= .644, and low, t(23) = 7.251, SE = 4.897, P<
.001, η
p2
= .696, reward-value distractor condition. These re-
sults show that participants significantly slowed their re-
sponses when one of the reward-value distractors was present
compared to the condition in which no distractor was present,
although no significant slowing was observed when high
compared to low reward-value distractors were present.
However, participants performed significantly less accurately
in the high compared to the low reward-value distractor
condition.
Accuracy
A repeated measures ANOVA on mean accuracy with
distractor type (high reward-value/low reward-value/no
distractor) as factor showed a significant main effect, F(2,
46) = 11.565, P<.001,η
p2
= .335. Crucially, as already
indicated, a subsequent paired samples two-tailed t-tests
showed that accuracy was significantly worse in the high
(mean = 80 %) compared to low (mean = 82 %) reward-
value distractor condition, t(23) = 2.172, SE = .839, P=
.040, η
p2
= .170 . Furthermore, performance was better in
the no distractor condition (mean = 85 %) compared to both
the high, t(23) = 4.563, SE = 1.065, P<.001,η
p2
= .475, and
low reward-value distractor condition, t(23) = 2.676, SE =
1.135, P=.013,η
p2
=.237.
Efficiency
In order to confirm a genuine performance deficit in the high
compared to the low reward-value distractor condition, we
made use of the IES. A repeated measures ANOVA on IES
with distractor type (high reward-value/low reward-value/no
distractor) as factor showed a significant main effect, F(2, 46)
= 26.760, P<.001,η
p2
= .528. Crucially IES was higher in the
high (mean = 984 ms) compared to low (mean = 953 ms)
reward-value distractor condition, t(23) = 2.259, SE =
13.883, p=.034,η
p2
= .182, confirming that search perfor-
mance was significantly less efficient when a high compared
to a low reward-value distractor was present. Furthermore,
paired samples two-tailed t-tests showed that search perfor-
mance was more efficient in the no distractor condition (mean
= 878 ms) compared to both the high, t(23) = 7.345, SE =
14.451, P<.001,η
p2
= .701, and low, t(23) = 4.592, SE =
Atten Percept Psychophys
16.282, P< .001, η
p2
= .478, reward-value distractor
condition.
Discussion
Altogether, the results of Experiment 1show that participants
respond faster, more accurately and more efficiently on trials
in which no distractor is presented compared to trials in which
a reward-value-associated distractor is present. Crucially, per-
formance was significantly impaired when high compared to
low reward-value associated distractors were presented, illus-
trated by a significant decline in accuracy and search efficien-
cy. These results suggests that associative reward learning of
the stimulus-reward contingencies can occur completely tem-
porally and spatially independent of the assigned task, such
that value-driven attentional capture is observed in a following
non-reward test phase. This confirms and strengthens the idea
that Pavlovian reward learning underlies value-driven atten-
tional capture.
Experiment 2
As the rewardsignaling color singletons werepresented in the
periphery in the reward conditioning phase of Experiment 1,it
is possible that learning the color-reward contingencies in-
volved orienting of attention towards them. Even though re-
ward administration was completely task- and response-inde-
pendent, it is thus possible that associative reward learning in
Experiment 1still involved an instrumental aspect, as atten-
tion might have been captured by the color singleton in the
periphery. Indeed, it has been shown that color singletons can
capture attention in an automatic fashion, even when it is not
relevant for the task (Theeuwes, 1992,2010). If such automat-
ic instrumental conditioning took place during reward condi-
tioning, it is possible that training this instrumental response
was responsible for the value driven attentional capture effect
that was observed in the non-reward test phase. This is of
specific importance, as the shift of attention towards the re-
ward signaling color singletons in the reward conditioning
phase matches the attentional response towards the reward-
value associated distractors in the non-reward test phase that
constitutes value-driven attentional capture.
In order to show that true Pavlovian reward learning un-
derlies value-driven attentional capture, we ensured that no
automatic shifts of attention towards the reward signaling col-
or singleton could occur during associative reward learning in
Experiment 2. Therefore, we presented a single colored stim-
ulus at the focus of attention at fixation, such that there was no
need for a spatial shift of attention towards the periphery,
hereby eliminating every instrumental aspect of associative
reward learning.
Method
In general, the method of Experiment 2was highly similar to
that of Experiment 1.
Participants
For Experiment 2, a new group of 26 participants (11 males,
2034 years of age, mean = 25.0 years, SD = 3.4 years) was
tested at the Vrije Universiteit Amsterdam. All participants
reported having normal or corrected-to-normal vision, and
gave written informed consent before participation.
Participants earned 5.28 reward during the reward condi-
tioning phase, and were paid at a rate of 8.00 per hour to
compensate for participation. As the experiment lasted ap-
proximately 40 min, participants earned around 10.00 in
total. All research was approved by the Vrije Universiteit
Faculty of Psychology ethics board, and was conducted ac-
cording to the principles of the Declaration of Helsinki.
820
800
780
760
740
720
High reward-value Low reward-value No distractor
Distractor type
Reaction time (ms)
n.s. ***
***
90
88
86
80
78
76
84
82
High reward-value Low reward-value No distractor
Distractor type
Accuracy (% correct)
*
*
***
1050
1000
950
800
750
900
850
High reward-value Low reward-value No distractor
Distractor type
Inverse Efficiency Score (ms)
*
***
***
abc
Fig. 3 acResults Experiment 1.aMean reaction time per condition. b
Mean accuracy per condition. cMean inverse efficiency score (reaction
time/percentage correct responses) per condition. Note that accuracy is
significantly lower and that the inverse efficiency score is significantly
higher (and performance thus worse) in the high compared to the low
reward-value distractor condition. Error bars indicate Standard errors
(SE) of the means
Atten Percept Psychophys
Apparatus
Experiment 2and Experiment 1were conducted with the
same equipment under the same conditions.
Stimuli
Reward conditioning phase
The stimuli and timings used in the reward conditioning phase
of Experiment 2(see Fig. 4) were very similar to those used in
Experiment 1. However, instead of presenting six shapes (in-
cluding one color singleton) around fixation in Experiment 1,
only one colored stimulus was presented at fixation, exactly in
the middle of the screen, in Experiment 2.Bypresentingthe
colored stimuli in the reward conditioning phase at the focus
of attention at fixation, nospatial attentional orienting towards
the reward signaling stimuli occurred. As in Experiment 1,
one of the colors (blue or yellow) was consistently coupled
to high reward (10 cents) administration and the other color
was consistently coupled to low reward (1 cent)
administration.
Non-reward test phase
The stimuli and timings used in the non-reward test phase
were identical to those used in Experiment 1(see Fig. 2).
Statistical analyses
The data in Experiment 2were handled identical to those in
Experiment 1. In Experiment 2, due to a deviation of more
than 2.5 SD from the participants individual overall mean
correct reaction time, 2.4 % of the correct reaction time data
were excluded from the reaction time analysis.
Exclusions
As in Experiment 1, we excluded participants from the anal-
yses if performance in one or more conditions of the non-
reward test phase was at or below chance level (50 %). In
Experiment 2, two participants scored at or below chance level
(i.e., below 58 % correct) in one or more conditions and were
therefore excluded from the analyses.
Results
Reward conditioning
Out of 24 subjects, one subject missed the fixation change
once and one subject missed the fixation change four times
out of a total of 24 fixation changes. Furthermore, one subject
showed one false alarm and two other subjects showed two
false alarms during the reward conditioning phase. As reward
was independent of the fixation task and any response, all
participants earned 5.28 extra reward.
Value-driven attentional capture
To investigate whether value-driven attentional capture follow-
ing Pavlovian learning is independent of training attentional
orienting towards reward signaling stimuli, we presented the
reward signaling stimuli in the reward conditioning phase at
fixation and compared mean reaction time, accuracy and IES
between the high reward-value, low reward-value and no
distractor condition in the non-reward test phase (see Fig. 5).
Reaction time
A repeated measures ANOVA on mean reaction time with
distractor type (high reward-value/low reward-value/no
distractor) as factor showed a significant main effect, F(2,
46) = 33.949, P<.001,η
p2
= .596. Crucially, planned paired
samples two-tailed t-test showed that participants responded
significantly slower in the high (mean = 729 ms) compared to
the low (mean = 720 ms) reward-value distractor condition,
t(23) = 2.278, SE = 3.758, P=.032,η
p2
= .184. Subsequent
planned paired samples two-tailed t-tests showed that reaction
times were significantly shorter in the no distractor condition
(mean = 682), compared to both the high t(23) = 6.856, SE =
6.802, P< .001, η
p2
= .671, and low, t(23) = 5.465, SE =
6.967, P<.001,η
p2
= .565, reward-value distractor condition.
Accuracy
A repeated measures ANOVA on mean accuracy with
distractor type (high reward-value/low reward-value/no
distractor) as factor showed a significant main effect, F(2,
46) = 14.869, P<.001,η
p2
= .393. A subsequent paired
samples two-tailed t-tests showed that, although participants
were less accurate in the high (81 %) compared to the low
(82 %) reward-value distractor condition, accuracy did not
differ significantly, t(23) = 1.440, SE = .663, P=.163,η
p2
=
.083. Further planned paired samples two-tailed t-tests
showed that performance was better in the no distractor con-
dition (mean = 87 %) compared to both the high, t(23) =
4.121, SE = 1.401, P<.001,η
p2
= .425, and low reward-
value distractor condition, t(23) = 3.985, SE = 1.209, P<
.001, η
p2
=.408.
Efficiency
As in Experiment 2, in order to confirm a genuine performance
deficit in the high compared to the low reward-value distractor
Atten Percept Psychophys
condition, and to take possible speed-accuracy tradeoffs at an
individual level into account, we calculated the IES (reaction
time / proportion of correct responses). A repeated measures
ANOVA on IES with distractor type (high reward-value/low
reward-value/no distractor) as factor showed a significant main
effect, F(2, 46) = 28.697, P< .001, η
p2
= .555. Crucially, the
IES was higher in the high (mean = 924 ms) compared to low
(mean = 898 ms) reward-value distractor condition, t(23) =
2.608, SE = 10.007, p= .016, η
p2
= .228, confirming that
performance was less efficient when a high compared to a
low reward-value distractor was present. Furthermore, paired
samples two-tailed t-tests showed that search performance was
more efficient in the no distractor condition (mean = 789 ms)
compared to both the high, t(23) = 5.823, SE = 21.642, P<
.001, η
p2
= .596, and low, t(23) = 5.291, SE = 18.885, p< .001,
η
p2
= .549, reward-value distractor condition. These results
show that search was significantly less efficient when one of
the reward-value distractors was present compared to the con-
dition in which no reward-value associated distractor was pres-
ent, and, crucially, that a significant impairment in search per-
formance was observed when high compared to low reward-
value distractors were present.
1200 ms
400-1200 ms
800 ms
200 ms
0-1400 ms
1200 ms
800 ms
200 ms
400-800 ms
400-800 ms
x
Fixation change
No fixation change
a
b
+ 1 ct
High reward (+10 ct)
Low reward (+ 1 ct)
+ 10 ct
Fig. 4 a,bSchematic representation of the trial sequence and timing of
the reward conditioning phase in Experiment 2.ahigh reward stimulus
display containing only one colored stimulus (blue circle), followed by a
high reward feedback display indicating a 10 cent win, followed by a
fixation change trial (fixation changed from B+^to Bx^)onwhich
participants had to press the keyboard BX^button within 1400 ms. bA
low reward stimulus display containing only one colored stimulus (yellow
diamond),followed by a low reward feedback display indicating a 1 cent
win, followed by a no fixation change trial on which no response was
required
760
740
720
700
680
660
High reward-value Low reward-value No distractor
Distractor t
y
pe
Reaction time (ms)
*
***
***
90
88
86
80
78
76
84
82
High reward-value Low reward-value No distractor
Distractor t
y
pe
n.s.
***
***
High reward-value Low reward-value No distractor
Distractor t
y
pe
*
***
***
ab c
Accuracy (% correct)
1050
1000
950
800
750
900
850
Inverse Efficiency Score (ms)
Fig. 5 acResults Experiment 2.aMean reaction time per condition. b
Mean accuracy per condition. cMean inverse efficiency score (reaction
time/percentage correct responses) per condition. Note that reaction time
is significantly slower and that the inverse efficiency score is significantly
higher (and performance thus worse) in the high compared to the low
reward-value distractor condition. Error bars indicate SE of the means
Atten Percept Psychophys
Discussion
Altogether, the results of Experiment 2show that high and low
reward-value associated distractors significantly slow search
and impair search efficiency compared to a no distractor con-
dition. Crucially, participants were significantly slower, and
showed less efficient search behavior, when the high com-
pared to the low reward-value distractor was presented in the
periphery in the non-reward test phase. This suggests that a
possible initial automatic shift of attention to a peripherally
presented color singleton during reward learning, as might
have taken place during Experiment 1, did not play a role in
associative reward learning. The fact that learning or training
to shift attention towards reward signaling stimuli is not re-
quired in order for observers to later show value-driven atten-
tional capture (i.e., shifting attention towards high and low
reward-value associated stimuli), emphasizes the Pavlovian
nature of associative reward learning underlying value-
driven attentional capture.
General discussion
In two experiments, we examined whether true Pavlovian
conditioning would be sufficient to obtain value-driven atten-
tional capture. More specifically, we investigated whether the
mere co-occurrence between the presentation of colored stim-
uli and the administration of high and low rewards was able to
elicit value driven attentional capture in a subsequent test
phase during which rewards were no longer delivered. In the
reward conditioning phase, we ensured that associative reward
learning was completely task- and response-independent by
letting participants perform a task at fixation, while the re-
wards were automatically administered following the presen-
tation of task-irrelevant colored stimuli. In Experiment 1,the
reward signaling stimuli were presented in the periphery, such
that the reward structure was both separated in time and in
place from the assigned task during associative reward learn-
ing. In Experiment 2, the reward signaling stimuli were pre-
sented at fixation, to eliminate the possible instrumental aspect
of shifting attention towards the reward signaling stimuli dur-
ing associative reward learning.
The results show that performance in the non-reward test
phase was impaired on trials in which a high compared to low
reward-value associated distractor was presented. This sug-
gests that associative reward learning through Pavlovian con-
ditioning imbued the reward signaling items with value. In
other words, Pavlovian learning of value signals in the reward
conditioning phase elicited value-driven attentional capture in
the non-reward test phase. Furthermore, the results show that
performance was impaired on trials in which one of the
reward-value associated distractors was present compared to
trials in which no distractor was present, which is in line with
previous studies showing value-driven attentional capture
(Anderson et al., 2011a,2011b) and the original findings re-
garding the additional singleton task demonstrating attentional
capture by physically salient singletons (Theeuwes, 1992).
Together, the results of these two experiments provide clear
evidence that value-driven attentional capture occurs follow-
ing associative reward learning through pure Pavlovian
conditioning.
The findings of the present study are in line with previous
studies suggesting that Pavlovian learning of stimulus-reward
contingencies rather than instrumental reward learning under-
lies value-driven attentional capture (Le Pelley et al., 2015;
Pearson et al., 2015; Failing et al., 2015). In these previous
studies, reward administration was not contingent on
responding to the stimuli that signaled the magnitude of the
available reward. In fact, these experiments were explicitly
designed in such a way that instrumental reward learning
and Pavlovian reward learning would produce opposite re-
sults. Likewise, the present study was explicitly designed to
examine whether pure Pavlovian conditioning during an asso-
ciative reward learning phase could produce value-driven at-
tentional capture in a non-reward test phase. However, in prior
studies using a training and test phase such as in Anderson
et al. (2011a,2011b), it cannot be distinguished whether
value-driven attentional capture occurred as a consequence
of instrumental or Pavlovian reward learning, as the reward-
value associated stimuli enjoyed increased attentional priority
because of both their selection and reward history (see Awh
et al., 2012). Typically, participants learned to attend and se-
lect reward associated target stimuli during a training phase,
which then captured attention when they appeared as
distractors in a non-reward test phase. Using such a design,
associative reward learning can be due to (1) learning to attend
the reward-value associated stimuli during training (i.e., selec-
tion history), or (2) the mere co-occurrence of those stimuli
with reward administration (i.e., reward history). Thus, when
attentional selection is congruent with the value signal, one
cannot distinguish whether instrumental associative reward
learning by means of a trained attentional response, or
Pavlovian associative reward learning as a result of the regu-
larities between the presentation of specific stimuli and reward
delivery, underlies value-driven attentional capture. However,
the present results suggest that reward history, independent of
selection history, is already sufficient to elicit value-driven
attentional capture. Nevertheless, future research is needed
to examine whether training an attentional response that is
congruent with the value signal elicits, for example, stronger
capture or capture that is more resistant to extinction, com-
pared to value-driven attentional capture that underlies pure
Pavlovian conditioning.
Compared to all previous studies, the present study is
unique because the reward signaling stimuli in Experiment 2
were presented at the focus of attention. By presenting the
Atten Percept Psychophys
reward signaling stimuli at the focus of attention at fixation, it
is unlikely that a shift of spatial attention towards them oc-
curred, as participants were already fixated at fixation for the
assigned task. By preventing any form of attentional orienting
towards the reward signaling stimuli during the reward condi-
tioning phase, we eliminated all possible instrumental aspects
of associative reward learning. This was not the case in previ-
ous studies (Failing et al. 2015; Le Pelley et al., 2015;Mine&
Saiki, 2015; Pearson et al., 2015), as the reward associated
distractor was never presented at fixation before. In the behav-
ioral study by Mine and Saiki (2015), the reward signaling
distractors in the training phase were always presented in the
periphery, leaving the option that participants covertly or
overtly shifted attention towards them during reward learning.
Although overt shifts of attention towards the reward signal-
ing distractor stimuli resulted in reward omission in the ocu-
lomotor paradigms of Le Pelleyet al. (2015) and Pearson et al.
(2015), the reward predictive stimuli were physically salient,
and therefore might have automatically captured covert atten-
tion. As the authors argue (see, Le Pelley et al., 2015), this
provides circumstances under which a form of instrumental
conditioning (i.e., superstitious conditioning, see Skinner,
1948) could favorably promote covert attentional shifts to-
wards the high compared to the low reward signaling
distractor in the future. If the greater likelihood of covert at-
tentional shifts towards the high reward signaling distractor
translates into making more overt shifts towards this
distractor, it can account for the observed oculomotor value-
driven attentional capture effect. Similarly, even though being
physically non-salient, it can be argued that the distractors in
Failing et al. (2015) possibly also captured attention covertly.
As participants were explicitly told before the experiment
what the high and low reward-value associated colors were,
the distractors explicitly conveyed the informational value of
signaling the reward magnitude that could be earned on a
particular trial. Therefore it is possible that participants covert-
ly oriented their attention towards the reward associated
distractor stimuli. Along the same line of reasoning used in
Le Pelley et al. (2015), associative reward learning in Failing
et al. (2015) thus possibly involved superstitious instrumental
conditioning. Here, by preventing covert shifts of spatial at-
tention towards the reward signaling stimuli in the reward
conditioning phase of Experiment 2, the present results un-
equivocally provide evidence that value-driven attentional
capture can occur as a result of pure Pavlovian associative
reward learning. This is crucial as it suggests that associative
reward learning can be independent of training a covert or
overt attentional orienting response towards the reward pre-
dictive stimuli, which implies that associative learning of the
signal value of reward associated stimuli underlies value driv-
en attentional capture.
Given that associative reward learning was completely in-
dependent of the assigned task, the results suggest that
learning stimulus-reward contingencies from the environment
can occur rather automatically. In a recent study, in which
assessing the effect of value-driven attentional capture was
interwoven with associative reward learning, Pearson et al.
(2015) demonstrated that value-driven attentional capture
was automatic and impenetrable for strategic attentional con-
trol. Participants were explicitly informed that looking at the
distractor that signaled the magnitude of reward that could be
earned on that trial resulted in an omission of that reward. The
results showed that the explicit instructions could not mini-
mize or counteract value-driven attentional capture, and the
size of the attentional capture effect was not different for par-
ticipants who did or did not receive explicit information about
the stimulus-reward contingencies. In the present study, ex-
plicit information regarding the stimulus reward-
contingencies were omitted on purpose, but, nevertheless,
the results show that the color-reward contingencies were
learned during the reward conditioning phase. In fact, during
associative reward learning, participants were even instructed
to ignore all stimuli, except for the fixation cross at the center
of the screen. Thus, while participants were focused on wheth-
er the fixation cross would change, high and low rewards were
automatically administered following the presentation of the
reward signaling stimuli. Furthermore, to prevent coincidental
instrumental learning, we ensured that the fixation cross never
changed at the time the reward stimulus display or reward
feedback was presented, thereby avoiding that the required
response for a fixation change was mistakenly interpreted to
result in reward delivery. This suggests that there was no in-
centive whatsoever for participants to pay attention to the
reward signaling stimuli, as the reward signaling stimuli were
completely task-irrelevant, and the reward structure was
completely independent of any response. This is unique for
the present work as in previous studies (Failing et al. 2015;Le
Pelley et al., 2015;Mine&Saiki,2015; Pearson et al., 2015)
rewards were delivered only following a correct response.
Although it is reasonable to assume that participants may have
attended the reward-value associated stimuli in Experiment 2,
as they were presented at fixation, the reward signaling stimuli
never appeared simultaneously with a fixation change on
which a response was required. Furthermore, the reward-
value associated stimuli in Experiment 1never appeared on
a task-relevant spatial location, thereby providing absolutely
no incentive to pay attention to them. Nevertheless, the results
show that high compared to low reward-value associated
distractors captured attention more strongly in the non-
reward test phase that followed Pavlovian conditioning.
This implies that the stimulus-reward contingencies were
learned automatically during the reward conditioning phase
of both experiments, despite the fact that the reward signaling
stimuli served as temporally (and spatially) task-irrelevant
distractors that had to be ignored. Thus, the present study
shows that the value signal can be learned even if every
Atten Percept Psychophys
incentive to pay attention to the reward structure is removed,
while Pearson and colleagues showed that providing explicit
information about the reward structure could not counteract
the automatic effect of value-driven attentional capture.
Together, these studies provide converging evidence that
Pavlovian reward learning and value-driven attentional cap-
ture are automatic and occur independently of strategic atten-
tional control.
To summarize, we show in two experiments that high com-
pared to low reward-value associated stimuli capture attention
more strongly in a non-reward test phase, after Pavlovian
learning of stimulus-reward contingencies in a preceding re-
ward conditioning phase. The present results suggest that
value-driven attentional capture can occur following associa-
tive reward learning of temporally and spatially task-irrelevant
distractors that signal the magnitude of available reward
(Experiment 1) and that value-driven attentional capture is
independent of training spatial shifts of attention towards the
reward signaling stimuli (Experiment 2). The present study is
the first to show associative reward learning through pure
Pavlovian conditioning, as the reward signaling stimuli and
reward administration were completely independent of the
assigned task and any response during associative reward
learning. This confirms and strengthens the idea that
Pavlovian reward learning underlies value-driven attentional
capture.
Acknowledgments This research was supported by a European
Research Council (ERC) advanced grant [ERC-2012-AdG-323413] to
Jan Theeuwes.
Open Access This article is distributed under the terms of the Creative
Commons Attribution 4.0 International License (http://
creativecommons.org/licenses/by/4.0/), which permits unrestricted use,
distribution, and reproduction in any medium, provided you give appro-
priate credit to the original author(s) and the source, provide a link to the
Creative Commons license, and indicate if changes were made.
References
Anderson, B. A. (2013). Avalue-driven mechanism of attentional selec-
tion. JournalofVision,13(3), 7.
Anderson, B. A. (2015). The attention habit: how reward learning shapes
attentional selection. Annals of the New York Academy of Sciences,
1369, 2439.
Anderson, B. A., Laurent, P. A., & Yantis, S. (2011a). Value-driven at-
tentional capture. Proceedings of the National Academy of Sciences
oftheUnitedStatesofAmerica,108(25), 1036710371.
Anderson, B. A., Laurent, P. A., & Yantis, S. (2011b). Learned value
magnifies salience-based attentional capture. PLoS One, 6(e27926).
Awh, E., Belopolsky, A. V., & Theeuwes, J. (2012). Top-down versus
bottom-up attentional control: A failed theoretical dichotomy.
Trends in Cognitive Sciences, 16(8), 437443.
Botvinick, M., & Braver, T. (2015). Motivation and cognitive control:
From behavior to neural mechanism. Psychology, 66(1), 83113.
Bruyer, R., & Brysbaert, M. (2011). Combining speed and accuracy in
cognitive psychology: Is the inverse efficiency score (IES) a better
dependent variable than the mean reaction time (RT) and the per-
centage of errors (PE)? Psychologica Belgica, 51(1), 513.
Bucker, B., Belopolsky, A. V., & Theeuwes, J. (2015). Distractors that
signal reward attract the eyes. Visual Cognition, 23, 124.
Bucker, B., Silvis, J. D., Donk, M., & Theeuwes, J. (2015). Reward
modulates oculomotor competition between differently valued stim-
uli. Vision Research, 108, 103112.
Bucker, B., & Theeuwes, J. (2014). The effect of reward on orienting and
reorienting in exogenous cuing. Cognitive, Affective, & Behavioral
Neuroscience, 14(2), 635646.
Bucker, B., & Theeuwes, J. (2016). Appetitive and aversive outcome
associations modulate exogenous cueing. Attention, Perception &
Psychophysics, 78, 2253. doi:10.3758/s13414-016-1107-6
Chelazzi, L., Perlato, A., Santandrea, E., & Della Libera, C. (2013).
Rewards teach visual selective attention. Vision Research, 85, 58
72.
Chun, M. M., Golomb, J. D., & Turk-Browne, N. B. (2011). A taxonomy
of external and internal attention. Annual Review of Psychology, 62,
73101.
Corbetta, M., & Shulman, G. L. (2002). Control of goal-directed and
stimulus-driven attention in the brain. Nature Reviews
Neuroscience, 3(3), 201215.
Della Libera, C., & Chelazzi, L. (2006). Visual selective attention and the
effects of monetary rewards. Psychological Science, 17(3), 222
227.
Della Libera, C., & Chelazzi, L. (2009). Learning to attend and to ignore
is a matter of gains and losses. Psychological Science, 20(6), 778
784.
Desimone, R., & Duncan, J. (1995). Neural mechanisms of selective
visual attention. Annual Review of Neuroscience, 18(1), 193222.
Failing, M., Nissens, T., Pearson, D., Le Pelley, M., & Theeuwes, J.
(2015). Oculomotor capture by stimuli that signal the availability
of reward. Journal of Neurophysiology, 114(4), 23162327.
Failing, M. F., & Theeuwes, J. (2014). Exogenous visual orienting by
reward. Journal of Vision, 14(5), 6.
Hickey, C., Chelazzi, L., & Theeuwes, J. (2010a). Reward changes sa-
lience in human vision via the anterior cingulate. The Journal of
Neuroscience, 30(33), 1109611103.
Hickey, C., Chelazzi, L., & Theeuwes, J. (2010b). Reward guides vision
when it's your thing: Trait reward-seeking in reward-mediated visual
priming. PLoS One, 5(11), e14087.
Itti, L., & Koch, C. (2001). Computational modelling of visual attention.
Nature Reviews Neuroscience, 2(3), 194203.
Kiss, M., Driver, J., & Eimer, M. (2009). Reward priority of visual target
singletons modulates event-related potential signatures of attentional
selection. Psychological Science, 20(2), 245251.
Krebs, R. M., Boehler, C. N., Egner, T., & Woldorff, M. G. (2011). The
neural underpinnings of how reward associations can both guide and
misguide attention. The Journal of Neuroscience, 31(26), 9752
9759.
Kristjánsson, Á., & Campana, G. (2010). Where perception meets mem-
ory: A review of repetition priming in visual search tasks. Attention,
Perception & Psychophysics, 72(1), 518.
Le Pelley, M. E., Mitchell, C. J., Beesley, T., George, D. N., & Wills, A.J.
(2016). Attention and associative learning in humans: An integrative
review. Psychonomic Bulletin & Review, 142(10), 11111140.
Le Pelley, M. E., Pearson, D., Griffiths, O., & Beesley, T. (2015). When
goals conflict with values: Counterproductiveattentional and oculo-
motor capture by reward-related stimuli. Journal of Experimental
Psychology: General, 144(1), 158.
MacLean, M. H., Diaz, G. K., & Giesbrecht, B. (2016). Irrelevant learned
reward associations disrupt voluntary spatial attention. Attention,
Perception & Psychophysics, 78(7), 22412252.
Mathôt, S., Schreij, D., & Theeuwes, J. (2012). OpenSesame: An open-
source, graphical experiment builder for the social sciences.
Behavior Research Methods, 44, 314324.
Atten Percept Psychophys
McCoy, B., & Theeuwes, J. (2016). The effects of reward on oculomotor
control. JournalofNeurophysiology.doi:10.1152/jn.00498.2016
Mine, C., & Saiki, J. (2015). Task-irrelevant stimulus-reward association
induces value-driven attentional capture. Attention, Perception &
Psychophysics, 77(6), 18961907.
Munneke, J., Hoppenbrouwers, S. S., & Theeuwes, J. (2015). Reward can
modulate attentional capture, independent of top-down set.
Attention, Perception & Psychophysics, 77(8), 25402548.
Pearson, D., Donkin, C., Tran, S. C., Most, S. B., & Le Pelley, M. E.
(2015). Cognitive control and counterproductive oculomotor cap-
ture by reward-related stimuli. Visual Cognition, 23(12), 4166.
Pessoa, L. (2009). How do emotion and motivation direct executive con-
trol? Trends in Cognitive Sciences, 13(4), 160166.
Pessoa, L., & Engelmann, J. B. (2010). Embedding reward signals into
perception and cognition. Frontiers in Neuroscience, 4, 17.
Sawaki, R., Luck, S. J., & Raymond, J. E. (2015). How attention changes
in response to incentives. Journal of Cognitive Neuroscience,
27(11), 22292239.
Shomstein, S., & Johnson, J. (2013). Shaping attention with reward ef-
fects of reward on space-and object-based selection. Psychological
Science, 24(12), 23692378.
Skinner, B. F. (1948). 'Superstition' in the pigeon. Journal of
Experimental Psychology, 38(2), 168.
Small, D. M., Gitelman, D., Simmons, K., Bloise, S. M., Parrish, T., &
Mesulam, M. M. (2005). Monetary incentives enhance processing in
brain regions mediating top-down control of attention. Cerebral
Cortex, 15(12), 18551865.
Theeuwes,J. (1992). Perceptual selectivity for color and form. Perception
& Psychophysics, 51(6), 599606.
Theeuwes, J. (2010). Topdown and bottomup control of visual selec-
tion. Acta Psychologica, 135(2), 7799.
Townsend, J. T., & Ashby, F. G. (1978). Methods ofmodeling capacity in
simple processing systems. In J. Castellan & F. Restle (Eds.),
Cognitive theory (Vol. 3, pp. 200239). Hillsdale: Erlbaum.
Townsend, J. T., & Ashby, F. G. (1983). Stochastic modeling of elemen-
tary psychological processes. Cambridge: Cambridge University
Press.
Atten Percept Psychophys
... The effect of feature-reward associations in the previous study has been explained by reinforcement of the attentional selection of the feature associated with reward (an Attentional Habit; Anderson, 2016;Failing & Theeuwes, 2018). Similar training-test paradigms have found that attentional biases toward high-value distractors occur even when the feature related to reward never requires a response (Bucker & Theeuwes, 2017Mine & Saiki, 2015). These findings suggest that reward-related attentional biases rely merely on the feature-reward relationship and may be better explained by a Pavlovian attentional bias. ...
... As recently suggested by other authors, this feature of response independence may be particularly relevant for measuring signtracking (Basel & Lazarov, 2023;Colaizzi et al., 2020;Heck et al., 2025). Therefore, we propose that designs in which the reward is completely response-independent (Bucker & Theeuwes, 2017;Pearson & Le Pelley, 2020) might be even better suited for assessing attentional sign-tracking. ...
Article
Full-text available
Particular features of the stimuli that predict significant outcomes tend to capture our attention in a rather automatic and inflexible way. This form of attention has been described as a Pavlovian bias that mimics the phenomenon of sign-tracking described in animals, where reward-predictive cues become motivational magnets. In humans, Value-Modulated Attentional Capture (VMAC) refers to a phenomenon where distractors that signal high-value outcomes receive higher attentional priority. VMAC is particularly difficult to extinguish, showing a similar persistence often described in animal sign-tracking. In the present study, we evaluated to what extent VMAC would persist using a more specific extinction procedure than previous research, where instead of removing the possibility of obtaining rewards, the different discriminant stimuli that signal reward equate its value. Furthermore, we manipulated between experiments whether the high-value distractor predicted high-reward and high-punishment contingent to response accuracy (mimicking previous research; Experiment 1) or only high-reward (Experiment 2), and also explored the association of VMAC and its persistence with measures of emotional impulsivity employed in past research. Our results show that when both rewards and punishments are possible, VMAC does not extinguish after an extensive extinction stage, nor is it associated with measures of emotional impulsivity. When punishments were removed, we showed that VMAC gradually extinguished both in response times and accuracy and that the persistence of VMAC was significantly associated with positive urgency. We discussed these results on the potential of punishments to qualitatively alter learning and response strategies employed by participants.
... This aligns with the proposed distinction between the influences of statistical learning and reward history (Kim & Anderson, 2019). While statistical learning influences attention through habit-driven mechanism (Jiang, 2018), reward history likely modulates attention through Pavlovian mechanisms (Bucker & Theeuwes, 2017). Another possibility for the absence of reward effects on specific locations may be due to their short-lived nature, potentially diminished by the separation of training and test sessions across days in our study. ...
Article
Attention is rapidly directed to stimuli associated with rewards in past experience, independent of current task goals and physical salience of stimuli. However, despite the robust attentional priority given to reward-associated features, studies often indicate negligible priority toward previously rewarded locations. Here, we propose a relational account of value-driven attention, a mechanism that relies on spatial relationship between items to achieve value-guided selections. In three experiments (N = 124), participants were trained to associate specific locations with rewards (e.g., high-reward: top-left; low-reward: top-right). They then performed an orientation-discrimination task where the target's absolute location (top-left or top-right) or spatial relationship ("left of" or "right of") had previously predicted reward. Performance was superior when the target's spatial relationship matched high-reward than low-reward, irrespective of absolute locations. Conversely, the impact of reward was absent when the target matched the absolute location but not the spatial relationship associated with high reward. Our findings challenge the default assumption of location specificity in value-driven attention, demonstrating a generalizable mechanism that humans adopted to integrate value and spatial information into priority maps for adaptive behavior.
... ST attributes the incentive value to cues rather than drugs, and even when the reward is lost (e.g., food), it cannot stop the attention to cues. For example, raccoons may become fixated on biting coins (US) and miss out on food (CS) rewards (Breland and Breland, 1961;Bucker and Theeuwes, 2016). Attention control deficits may be related to this behavioral pattern. ...
Article
Full-text available
Compulsion stands as a central symptom of drug addiction; however, only a small fraction of individuals who use drugs exhibit compulsive characteristics. Differences observed in Sign-trackers (ST) and Goal-trackers (GT) during Pavlovian conditioning may shed light on individual variances in drug addiction. Here, we focus on the behavioral attributes, formation processes, and neural mechanisms underlying ST and how they drive addiction toward compulsivity in humans. We will explore addiction from three interconnected levels: individual personality traits, social factors, and neurobiology. Furthermore, we distinguish between the processes of sensitization and habituation within ST. These nuanced distinctions across various aspects of addiction will contribute to our understanding of the addiction development process and the formulation of targeted preventive strategies.
... Note however that there are relevant differences between these phenomena and the mechanisms underlying them. In the VMAC, stimuli predicting reward are attended to even when irrelevant to the task, so the participant's active engagement or response to these stimuli is not required or even punished (Le Pearson et al., 2015; see also, e.g., Bucker & Theeuwes, 2017;Failing & Theeuwes, 2017). Given that it seems to be the mere association with reward what makes these stimuli potent in drawing attention, it has been suggested that Pavlovian associations are responsible for the effect (Le . ...
Article
Full-text available
Previous research has shown that cues that are good predictors of relevant outcomes receive more attention than nonpredictive cues. This attentional bias is thought to stem from the different predictive value of cues. However, because successful performance requires more attention to predictive cues, the bias may be a lingering effect of previous attention to cues (i.e., a selection history effect) instead. Two experiments assessed the contribution of predictive value and selection history to the bias produced by learned predictiveness. In a first task, participants responded to pairs of cues, only one of which predicted the correct response. A second task was superficially very similar, but the correct response was determined randomly on each trial and participants responded based on some physical characteristic of a target stimulus in each compound. Hence, in this latter task, participants had to pay more attention to the target stimuli, but these stimuli were not consistently associated with a specific response. Results revealed no differences in the attentional bias toward the relevant stimuli in the two tasks, suggesting that the bias induced by learned predictiveness is a consequence of deploying more attention to predictive stimuli during training. Thus, predictiveness may not bias attention by itself, adding nothing over and above the effect expected by selection history.
... According to this idea, high-value stimuli are more salient than low-value stimuli and therefore more likely to capture a receiver's attention. This is true regardless of whether the stimulus value is intrinsic or learned, as research shows that stimuli previously associated with money capture attention, even if there is no current monetary incentive associated with a task [41,42]. This finding extends to neutral faces that have previously been associated with a high monetary reward and to abstract stimuli that have been associated with social rewards such as emojis [43][44][45]. ...
Article
Full-text available
The mechanisms that underpin human social behaviour are poorly understood, in part because natural social behaviour is challenging to study. The task of linking the mechanisms thought to drive social behaviour to specific social behaviours in a manner that maintains ecological validity poses an even greater challenge. Here we report evidence that the subjective value people assign to genuine smiles, as measured in the laboratory, determines their responsiveness to genuine smiles encountered in a naturalistic social interaction. Specifically, participants (university undergraduates; age 17 to 36) who valued genuine smiles to a greater degree also showed stronger attention capture effects to neutral faces that were previously associated with genuine smiles and faster reciprocity of a social partner’s smiles in a real social interaction. Additionally, the faster participants responded to the partner’s genuine smiles the higher the partner’s ratings of interaction quality were after the interaction. These data suggest that individual differences in subjective value of genuine smiles, measured in the lab, is one element that underpins responsiveness to natural genuine smiles and subsequent social outcomes.
... To integrate accuracy scores with RTs , inverse efficiency scores (IES) were calculated (Bruyer & Brysbaert, 2011;Bucker & Theeuwes, 2017) for all aforementioned conditions by dividing RTs (in seconds) by the proportion of correct (PC) responses (IES = RT/PC). The IES is a measure of the average energy consumed by the system over time (Bruyer & Brysbaert, 2011;Townsend & Ashby, 1983), i.e. could be used to measure the cost of task load. ...
Article
Full-text available
Problematic internet users (PIUs) tend to have poorer attentional control, impaired working memory (WM), and poorer executive functions. Based on previous results using verbal and visuospatial WM tasks, there might be a difference in the processing efficiency of various types of information. Visuospatial processing skills might be enhanced in PIUs, however, studies considering a possible association with verbal processing skills are scarce. Here we investigated how performance on verbal and visuospatial tasks are associated with measures of PIU. Participants (N=51) completed a dot-finding and a 1-back single task with both verbal and visuospatial stimuli, and a dual task that combined the two single tasks. Thus, we manipulated task Difficulty (Single vs Dual) and Modality (Verbal vs Visuospatial). We used the Problematic Internet Use Questionnaire (PIUQ) to measure PIU. People scoring higher (compared to those scoring lower) on control disorder and obsession subscales of PIUQ performed worse on the verbal and better on the visuospatial condition. Higher PIUQ total scores correlated with more difficulty on the single-task and lower difficulty with dual-task performance. In conclusion, although participants more prone to PIU were better in task switching, higher levels of PIU may be associated with worse verbal processing and increased distractibility.
... In the selection history literature, it has been demonstrated that experiences in previous trials can influence attentional guidance in the present trial. For example, reward-predictive features have been shown to bias attention more effectively than features that predict fewer or no rewards (Anderson et al., 2011;Anderson & Halpern, 2017;Bucker & Theeuwes, 2017;Le Pelley et al., 2015;Pearson et al., 2016). The importance of attention has been proposed in theories of associative learning (Esber & Haselgrove, 2011), in which attention is directed toward predictive (Le Pelley, 2010;Mackintosh, 1975) or uncertain stimuli (Pearce & Hall, 1980). ...
Article
Full-text available
Foraging refers to behavior that exploits the current environment for resources and induces exploration for a better environment. Visual foraging tasks have been used to study human behavior during visual searches. Participants searched for target stimuli among the distractors and either acquired or lost points when they clicked on a target or distractor. In the current study, we investigated the influence of blocking feature learning in visual foraging. For this purpose, we divided participants into control and blocking groups. The blocking group completed three phases: in the first phase, stimulus colors predicted rewards; in the second phase, stimulus color and orientation predicted rewards; in the third phase, only stimulus orientation predicted rewards. The control group completed either the second and third phases (Experiments 1 and 2) or all three phases (Experiment 3) with different reward combinations: color in Phase 1, color and orientation in Phase 2, and orientation in Phase 3. When the learning of a stimulus feature was blocked, the participants made less accurate responses. This suggests that the learning of task-relevant information was disturbed, and the blocked feature was not selected to after blocking. Additional analyses showed that the performance deviated slightly from the optimal performance; however, the extent of the deviation was not affected by blocking, implying that two distinct decision-making processes were involved in visual foraging. Our findings highlight the impact of blocking feature learning on visual foraging performance and reveal its distinct influence on multiple decision-making processes in this task.
... and bottom-up factors like a stimulus' physical features and salience (Theeuwes, 1992;Itti, 2005). Here, we attempt to systematically manipulate the extent to which certain stimuli automatically capture participants' attention by utilizing a related but separate construct, that of value-driven attention (Chelazzi et al., 2013;Bucker and Theeuwes, 2017). If successful, this procedure has potential implications for a field of research focusing on the etiology of social (in addition of other types of) anxiety disorder as well as its therapy. ...
Article
Full-text available
Attentional biases toward threatening faces have repeatedly been studied in the context of social anxiety, with etiological theories suggesting exacerbated biases as a possible cause for the latter. To counteract these postulated effects, research has focused on the concept of attentional bias manipulation (ABM), in which spatial contingencies between succeeding stimuli are traditionally employed in training paradigms designed to deliberately shift automatic attention processes away from threat-related stimuli. The ABM research field has been faced with various methodological challenges, such as inconsistent results, low reliabilities of dependent variables and a high susceptibility to moderating factors. We aimed to combine several recent approaches to address these issues. Drawing upon theories of value-driven attention, we explored reward-based contingencies in a Dot Probe task to improve the training’s efficacy, combined with neurophysiological measures for greater reliability compared to reaction times, while evaluating the moderating effect of explicitness in the instruction. In a healthy sample (N = 60) and within a single session, we found a general attentional bias toward angry faces present across all conditions as indicated by the N2pc, which was, however, marked by large intrinsic lateralization effects, with submeasures exhibiting opposing polarities. This prompted us to explore an alternative, intrahemispheric calculation method. The new N2pc variant showed the attentional bias to have disappeared at the end of the training session within the explicit instruction group. Reliabilities of the main dependent variables were varied from excellent to questionable, which, together with the exploratory nature of the analysis, leaves this result as preliminary.
Preprint
Full-text available
Particular features of the stimuli that predict significant outcomes tend to capture our attention in a rather automatic and inflexible way. This form of attention has been described as a Pavlovian bias that mimics the phenomenon of sign-tracking described in animals, where reward-predictive cues become motivational magnets. In humans, Value-Modulated Attentional Capture (VMAC) refers to a phenomenon where distractors that signal high-value outcomes receive higher attentional priority. VMAC is particularly difficult to extinguish, showing a similar persistence often described in animal sign-tracking. In the present study, we evaluated to what extent VMAC would persist using a more specific extinction procedure than previous research, where instead of removing the possibility of obtaining rewards, the different discriminant stimuli that signal reward equate its value. Furthermore, we manipulated between experiments whether the high-value distractor predicted high-reward and high-punishment contingent to response accuracy (mimicking previous research; Experiment 1) or only high-reward (Experiment 2), and also explored the association of VMAC and its persistence with measures of emotional impulsivity employed in past research. Our results show that when both rewards and punishments are possible, VMAC does not extinguish after an extensive extinction stage, nor is it associated with measures of emotional impulsivity. When punishments were removed, we showed that VMAC gradually extinguished both in response times and accuracy and that the persistence of VMAC was significantly associated with positive urgency. We discussed these results on the potential of punishments to qualitatively alter learning and response strategies employed by participants.
Article
Previous nonhuman studies have reported that sign-tracking to a conditioned stimulus (CS) is increased when the intertrial interval (ITI) duration is increased. Separate studies indicate that individual differences in sign-tracking (vs. goal-tracking) at a fixed ITI (and CS duration) is predictive of the conditioned reinforcer efficacy of the CS. The present study evaluates, for the first time, if increasing the ITI increases rats' sign-tracking and the conditioned reinforcing efficacy of the CS. Forty-five female rats were randomly assigned to one of three groups that completed appetitive Pavlovian training with ITIs of 14, 24, or 96s. Subsequently, they completed tests of conditioned reinforcement. Replicating previous findings, longer ITIs increased sign-tracking to a lever-CS and, extending the literature, conditioned reinforcer efficacy of that CS was highest at the longest ITI used during Pavlovian training. Implications for behavioral interventions using conditioned reinforcement are discussed.
Article
Full-text available
This article presents a comprehensive survey of research concerning interactions between associative learning and attention in humans. Four main findings are described. First, attention is biased toward stimuli that predict their consequences reliably (learned predictiveness). This finding is consistent with the approach taken by Mackintosh (1975) in his attentional model of associative learning in nonhuman animals. Second, the strength of this attentional bias is modulated by the value of the outcome (learned value). That is, predictors of high-value outcomes receive especially high levels of attention. Third, the related but opposing idea that uncertainty may result in increased attention to stimuli (Pearce & Hall, 1980), receives less support. This suggests that hybrid models of associative learning, incorporating the mechanisms of both the Mackintosh and Pearce-Hall theories, may not be required to explain data from human participants. Rather, a simpler model, in which attention to stimuli is determined by how strongly they are associated with significant outcomes, goes a long way to account for the data on human attentional learning. The last main finding, and an exciting area for future research and theorizing, is that learned predictiveness and learned value modulate both deliberate attentional focus, and more automatic attentional capture. The automatic influence of learning on attention does not appear to fit the traditional view of attention as being either goal-directed or stimulus-driven. Rather, it suggests a new kind of “derived” attention.
Article
Full-text available
In two experiments, we utilized an exogenous cueing task in which different-colored abrupt-onset cues were associated with an appetitive (gain of 10 cents), aversive (loss of 5 cents), or neutral (no gain or loss) outcome. Reward delivery did not depend on performance, but instead the specific exogenous cues were always followed by their corresponding outcome in a classical-conditioning-like manner. Compared to neutral cues and independent of cue-target delay, the results of Experiment 1 showed that appetitive cues strengthened attentional capture, whereas aversive cues reduced attentional capture. The data revealed that both appetitive and aversive cues initially facilitated responding at the validly cued location. At the long cue-target delays, however, this facilitation effect at the validly cued location remained present for gain-associated cues while it reversed for loss-associated cues. The results of Experiment 2 confirmed these findings by showing that both neutral and aversive cues initially facilitated responding at the cued location and that, at long cue-target delays, aversive cues elicited stronger reorienting away from the cued location as compared to neutral cues. Together these findings indicate that all abrupt-onset cues initially capture attention independent of their outcome association. Yet, if time passes, attention remains lingering at the location of gain-associated cues, whereas attention is released and reoriented away from the location of loss-associated cues. Altogether, we show that associating the color of an abrupt-onset cue with an appetitive or aversive outcome can modulate attentional deployment following exogenous cueing.
Article
Full-text available
It is well known that eye movement patterns are influenced by both goal- and salience-driven factors. Recent studies however have demonstrated that objects that are non-salient and task-irrelevant can still capture our eyes if moving our eyes to those objects has previously produced reward. Here we demonstrate that training such an association between eye movements to an object and delivery of reward is not needed. Instead, an object that merely signals the availability of reward captures the eyes even when it is physically non-salient and never relevant for the task. Furthermore we show that oculomotor capture by reward is more reliably observed in saccades with short latencies. We conclude that a stimulus signaling high reward has the ability to capture the eyes independently of bottom-up physical salience or top-down task-relevance and that the effect of reward affects early selection processes. Copyright © 2015, Journal of Neurophysiology.
Article
Full-text available
The traditional distinction between exogenous and endogenous attentional control has recently been enriched with an additional mode of control, termed "selection history." Recent findings have indicated, for instance, that previously rewarded or punished stimuli capture more attention than their physical attributes would predict. As such, the value that is associated with certain stimuli modulates attentional capture. This particular influence has also been shown for endogenous attention. Although recent leads have emerged, elucidating the influences of reward on exogenous and endogenous attention, it remains unclear to what extent exogenous attention is modulated by reward when endogenous attention is already deployed. We used a Posner cueing task in which exogenous and endogenous cues were presented to guide attention. Crucially, the exogenous cue also indicated the reward value. That is, the color of the exogenous cue indicated how much reward could be obtained on a given trial. The results showed main effects of endogenous and exogenous attention (i.e., speeded reaction times when either cue was valid, as compared to when it was invalid). Crucially, an interaction between exogenous cue validity and reward level was observed, indicating that reward-based associative-learning processes rapidly influence attentional capture, even when endogenous attention has been actively deployed.
Article
Full-text available
Although the performance of simple cognitive tasks can be enhanced if an incentive is provided, the mechanisms enabling such motivational control are not known. This study sought to uncover how mechanisms of attention and readiness are altered by reward-associated incentive stimuli. We measured EEG/ERP activity as human adults viewed a high- or low-incentive cue, experienced a short preparation interval, and then performed a simple visual search task to gain the predicted reward. Search performance was faster with high versus low incentives, and this was accompanied by distinct incentive-related EEG/ERP patterns at each phase of the task (incentive, preparation, and search). First, and most surprisingly, attention to high but not low incentive cues was actively suppressed, as indexed by a PD component in response to the incentive display. During the subsequent preparation interval, neural oscillations in the alpha frequency range were reduced after high-incentive cues, indicating heightened visual readiness. Finally, attentional orienting to the target in the search array was deployed with relatively little effort on high-incentive trials, as indexed by a reduced N2pc component. These results reveal the chain of events by which the brain's executive control mechanisms respond to incentives by altering the operation of multiple processing systems to produce optimal performance.
Article
The present study examines the extent to which distractors that signal the availability of monetary reward on a given trial affect eye movements. We used a novel eye movement task in which observers had to follow a target around the screen while ignoring distractors presented at varying locations. We examined the effects of reward magnitude and distractor location on a host of oculomotor properties, including saccade latency, amplitude, landing position, curvature, and erroneous saccades towards the distractor. We found consistent effects of reward magnitude on classic oculomotor phenomena such as the remote distractor effect, the global effect, and oculomotor capture by the distractor. We also show that a distractor in the visual hemifield opposite to the target had a larger effect on oculomotor control than an equidistant distractor in the same hemifield as the target. Bayesian hierarchical drift diffusion modelling revealed large differences in drift rate depending on the reward value, location and visual hemifield of the distractor stimulus. Our findings suggest that high reward distractors not only capture the eyes, but affect a multitude of oculomotor properties associated with oculomotor inhibition and control.
Article
Attention can be guided involuntarily by physical salience and by non-salient, previously learned reward associations that are currently task-irrelevant. Attention can be guided voluntarily by current goals and expectations. The current study examined, in two experiments, whether irrelevant reward associations could disrupt current, goal-driven, voluntary attention. In a letter-search task, attention was directed voluntarily (i.e., cued) on half the trials by a cue stimulus indicating the hemifield in which the target letter would appear with 100 % accuracy. On the other half of the trials, a cue stimulus was presented, but it did not provide information about the target hemifield (i.e., uncued). On both cued and uncued trials, attention could be involuntarily captured by the presence of a task-irrelevant, and physically non-salient, color, either within the cued or the uncued hemifield. Importantly, one week prior to the letter search task, the irrelevant color had served as a target feature that was predictive of reward in a separate training task. Target identification accuracy was better on cued compared to uncued trials. However, this effect was reduced when the irrelevant, and physically non-salient, reward-associated feature was present in the uncued hemifield. This effect was not observed in a second, control experiment in which the irrelevant color was not predictive of reward during training. Our results indicate that involuntary, value-driven capture can disrupt the voluntary control of spatial attention.
Article
There is growing consensus that reward plays an important role in the control of attention. Until recently, reward was thought to influence attention indirectly by modulating task-specific motivation and its effects on voluntary control over selection. Such an account was consistent with the goal-directed (endogenous) versus stimulus-driven (exogenous) framework that had long dominated the field of attention research. Now, a different perspective is emerging. Demonstrations that previously reward-associated stimuli can automatically capture attention even when physically inconspicuous and task-irrelevant challenge previously held assumptions about attentional control. The idea that attentional selection can be value driven, reflecting a distinct and previously unrecognized control mechanism, has gained traction. Since these early demonstrations, the influence of reward learning on attention has rapidly become an area of intense investigation, sparking many new insights. The result is an emerging picture of how the reward system of the brain automatically biases information processing. Here, I review the progress that has been made in this area, synthesizing a wealth of recent evidence to provide an integrated, up-to-date account of value-driven attention and some of its broader implications.
Article
We review evidence for partially segregated networks of brain areas that carry out different attentional functions. One system, which includes parts of the intraparietal cortex and superior frontal cortex, is involved in preparing and applying goal-directed (top-down) selection for stimuli and responses. This system is also modulated by the detection of stimuli. The other system, which includes the temporoparietal cortex and inferior frontal cortex, and is largely lateralized to the right hemisphere, is not involved in top-down selection. Instead, this system is specialized for the detection of behaviourally relevant stimuli, particularly when they are salient or unexpected. This ventral frontoparietal network works as a 'circuit breaker' for the dorsal system, directing attention to salient events. Both attentional systems interact during normal vision, and both are disrupted in unilateral spatial neglect.