ArticlePDF Available

Hierarchical Encoding in Visual Working Memory: Ensemble Statistics Bias Memory for Individual Items

Authors:

Abstract and Figures

Influential models of visual working memory treat each item to be stored as an independent unit and assume that there are no interactions between items. However, real-world displays have structure that provides higher-order constraints on the items to be remembered. Even in the case of a display of simple colored circles, observers can compute statistics, such as mean circle size, to obtain an overall summary of the display. We examined the influence of such an ensemble statistic on visual working memory. We report evidence that the remembered size of each individual item in a display is biased toward the mean size of the set of items in the same color and the mean size of all items in the display. This suggests that visual working memory is constructive, encoding displays at multiple levels of abstraction and integrating across these levels, rather than maintaining a veridical representation of each item independently.
Content may be subject to copyright.
Psychological Science
22(3) 384 –392
© The Author(s) 2011
Reprints and permission:
sagepub.com/journalsPermissions.nav
DOI: 10.1177/0956797610397956
http://pss.sagepub.com
Observers can quickly and accurately compute ensemble sta-
tistics about a display, such as the mean size (Ariely, 2001;
Chong & Treisman, 2003), orientation (Parkes, Lund, Angelucci,
Solomon, & Morgan, 2001), and location (Alvarez & Oliva,
2008) of the items; the mean expression of a set of faces
(Haberman & Whitney, 2007); and even higher-level spatial
layout statistics (Alvarez & Oliva, 2009). However, little work
has explored why observers compute these statistics and, in
particular, whether the encoding of these higher-order statis-
tics might play a role in how observers represent the individual
items from such displays in memory.
Nearly all studies of visual working memory use displays
consisting of simple stimuli and items that have been chosen
randomly.1 These displays are, as best as possible, prevented
from having any overarching structure or gist. Thus, influen-
tial models of visual working memory tend to treat each item
as an independent unit and assume that items do not influence
each others’ representations (Alvarez & Cavanagh, 2004;
Bays, Catalao, & Husain, 2009; Luck & Vogel, 1997; Rouder
et al., 2008; Wilken & Ma, 2004; Zhang & Luck, 2008;
although see Lin & Luck, 2008, and Johnson, Spencer, Luck,
& Schöner, 2009).
We propose that, contrary to the assumptions of previous
models of visual working memory, ensemble statistics allow
observers to encode such working memory displays more
efficiently: In a process paralleling how people encode real
scenes (Lampinen, Copeland, & Neuschatz, 2001; Oliva,
2005), observers might encode the “gist” of simple working
memory displays (ensemble statistics such as mean size) in
addition to information about specific items (their individual
information). Such hierarchical encoding would allow observ-
ers to represent information about every item in the display
simultaneously, significantly improving the fidelity of their
memory representations compared with encoding only three
or four individual items.
To test this hypothesis, we used the ensemble statistic of
mean size and the grouping principle of common color, both
of which are known to be automatically and effortlessly com-
puted and could act as a form of higher-order structure in
visual displays (Chong & Treisman, 2005b). Our results dem-
onstrate a form of hierarchical encoding in visual working
memory: The remembered size of individual items was biased
toward the mean size of items of the same color and the mean
size of all items in the display. This suggests that, contrary to
Corresponding Author:
Timothy F. Brady, Department of Brain and Cognitive Sciences, 46-4078,
Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge,
MA 02139
E-mail: tfbrady@mit.edu
Hierarchical Encoding in Visual Working
Memory: Ensemble Statistics Bias Memory
for Individual Items
Timothy F. Brady1 and George A. Alvarez2
1Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, and 2Department of Psychology, Harvard University
Abstract
Influential models of visual working memory treat each item to be stored as an independent unit and assume that there are no
interactions between items. However, real-world displays have structure that provides higher-order constraints on the items
to be remembered. Even in the case of a display of simple colored circles, observers can compute statistics, such as mean circle
size, to obtain an overall summary of the display. We examined the influence of such an ensemble statistic on visual working
memory. We report evidence that the remembered size of each individual item in a display is biased toward the mean size
of the set of items in the same color and the mean size of all items in the display. This suggests that visual working memory
is constructive, encoding displays at multiple levels of abstraction and integrating across these levels, rather than maintaining a
veridical representation of each item independently.
Keywords
working memory, constructive memory, ensemble statistics, summary statistics
Received 5/21/10; Revision accepted 10/16/10
Research Article
Ensemble Statistics Bias Visual Working Memory 385
existing models of visual working memory, items are not
recalled as independent units; instead, an item’s reported size
is constructed by combining information about that specific
item with information about the set of items at multiple levels
of abstraction.
Experiment 1: Ensemble Statistics
Bias Size Memory
We examined whether the ensemble statistics of a display
would bias memory for individual items when observers
attempted to remember the size of multiple colored circles. We
hypothesized that in the case of displays with both small red
circles and large blue circles, observers would tend to report
the size of a particular circle as larger if it was blue than if it
was red. Such a size bias would suggest that observers had
taken into account the size of the items in each color set.
Method
Observers. Twenty-one observers were recruited and run
using Amazon Mechanical Turk (https://www.mturk.com). All
were from the United States, gave informed consent, and were
paid $0.40 for approximately 3 min of their time.
Procedure. All observers were presented with the same 30
displays consisting of three red, three blue, and three green
circles of varying size (see Fig. 1) and were told to remember
the size of all of the red and blue circles, but to ignore the green
circles. We included the green distractor items in the displays
because we believed they would encourage observers to
encode the items by color, rather than to select all of the items
into memory at once (Halberda, Sires, & Feigenson, 2006;
Huang, Treisman, & Pashler, 2007). The order of the 30 dis-
plays was randomized across observers. Each display appeared
for 1.5 s and was followed by a 1-s blank, after which a single
randomly sized circle reappeared in black at the location that a
red or blue circle had occupied. Observers had to slide the
computer mouse up or down to resize this new black circle to
the size of the red or blue circle they had previously seen at
that location; they then clicked to lock in their answer and start
the next trial.
Stimuli. The nine circles appeared on a gray background that
measured 600 × 400 pixels. Each circle was positioned at a
random location within an invisible 6 × 4 grid; jitter of ±10
pixels was added to the circles’ locations to prevent collineari-
ties. The size and resolution of observers’ computer monitors
were not controlled. However, all observers attested to the fact
that the displays were visible in their entirety. Moreover, the
critical comparisons are within subjects, and individual differ-
ences in absolute size of the displays are factored out by focus-
ing on within-subjects comparisons between conditions.
Circle sizes were drawn from a separate normal distribution
for each color. The mean diameter for the circles of a given
color was chosen uniformly on each trial from the interval (15
pixels, 95 pixels), and the diameter of each individual circle
was then chosen from a normal distribution with this mean and
a standard deviation equal to one eighth of this mean. Thus, on
a given trial, the three red circles could be sampled from
around 35 pixels, the blue circles from around 80 pixels, and
the green circles from around 20 pixels. However, which color
set was largest and which was smallest was chosen randomly
on each trial; thus, on the next trial, it could be the green cir-
cles that were largest and the blue circles that were smallest.
So that we could directly test the hypothesized bias in
reported size, we generated 15 matched pairs of displays. First,
15 displays were generated as described; then, another 15
were created by switching the color of the to-be-tested item to
the other nondistractor color (and making the reverse switch
for another circle, so that there would still be three red circles
and three blue circles in the display). Thus, the displays in
Fig. 1. Example pair of matched displays from Experiment 1. Observers had to remember the size of the red and blue
circles and ignore the green circles. After each trial, they were tested on the size of a single circle using a recall procedure.
The two displays in a matched pair had the same items, but the colors of the tested item (here, the circle second from the
left on the bottom) and another item (the circle at the bottom left) were swapped between the displays. Note that the size
of the circles is not to scale in order to more clearly show the properties of the displays.
386 Brady, Alvarez
each of the pairs were matched in the size of all of the circles
present and differed only in the color of two circles, including
the circle that would later be tested. The 30 displays were ran-
domly interleaved, with the constraint that paired displays
could not appear one after the other. By comparing reported
size when the tested item was one color with reported size
when it was another color, we were able to directly test the
hypothesis that observers’ memory for size is biased toward
the mean size of all items in the tested item’s color set.
Results
Overall accuracy. We first assessed whether observers were
able to accurately perform the size memory task by comparing
their performance with an empirical measure of chance perfor-
mance obtained by randomly pairing a given observer’s
responses with the correct answers from different trials (mean
difference by chance = 30.5 pixels, SEM = 0.78 pixels).
Observers’ average error was 16.4 pixels (SEM = 1.7 pixels), a
level of performance that was significantly better than our
measure of chance, p < 109.
Bias from same-colored circles. To test our main hypothesis,
we examined whether observers’ size estimates tended to be
biased toward the size of the circles with the same color as the
tested circle. We divided each matched pair on the basis of
which of the pair contained a tested item the same color as the
circles that were smaller on average and which contained a
tested item the same color as the circles that were larger on
average. We then divided reported sizes on the latter trials by
reported sizes on the former trials. A ratio of 1.0 would indi-
cate that observers were not biased. However, if observers’
size estimates were biased toward the mean size of the circles
in the same color as the tested item, this ratio would be greater
than 1.0 (see Fig. 2a).
On average, the reported size of the tested circle was 1.11
times greater (SEM = 0.03) on trials with the larger same-
colored circles than on trials with the smaller same-colored
circles (see Fig. 2b). This ratio was significantly greater than 1.0,
t(20) = 4.17, p = .0004. In addition, the direction of the effect
was highly consistent across observers, with 19 of the 21
observers having a ratio above 1.0. The maximum possible
bias was 1.6, because the larger same-colored circles were on
average 1.6 times larger than their matched counterparts.
Thus, the observers reported a size 18% of the way between
the correct size and the mean size of the same-colored circles.
This effect was a result of memory and not a perceptual bias,
because in a version of the experiment with a precue indicat-
ing which item would be tested, observers (N = 22) reported
the size accurately (mean error = 6.4 pixels, SEM = 0.5 pixels)
and with no bias toward the mean size of the same-colored
circles (bias = 1.00, SEM = 0.01; for details, see Perceptual
Effects in the Supplemental Material available online).
Model: optimal integration across different levels of
abstraction. One interpretation of the data is that observers
represented the displays at multiple levels of abstraction and
integrated across these levels when retrieving the size of the
Mean Size
of Red Dots
Mean Size
of Blue Dots
Presented
Size
Reported
Size
Tested
Item
Presented
Blue
Presented
Red
Reported Size When Presented Blue
Reported Size When Presented Red
Bias =
a b
0.90
0.95
1.00
1.05
1.10
1.15
1 2a
Experiment
2b
Bias
Toward
Mean
Bias
Away
From
Mean 0.85
Fig. 2. Bias in size estimates in the three experiments. The illustration in (a) shows how bias was calculated from a pair of matched displays. We
measured whether observers reported different sizes for the tested circle when it was red versus when it was blue (the circle was in fact the same
size in both presentations). In this example, the blue circles were larger than the red circles on average, and the reported sizes of the tested circle are
indicative of bias. Which color was used for the larger circles was counterbalanced across trials in the actual experiment. Because bias was always
calculated by dividing the size reported for tested items presented in the color of the larger circles by the size reported for tested items presented
in the color of the smaller circles, a ratio greater than 1.0 indicated bias toward the mean size of the same-colored circles. The graph (b) shows
participants’ mean bias in Experiment 1, in which color was task relevant, and in Experiments 2a and 2b, in which color was task irrelevant. Error bars
represent ±1 SEM.
Ensemble Statistics Bias Visual Working Memory 387
tested circle or when initially encoding its size. To more
directly test this idea, we formalized how observers might rep-
resent a display hierarchically using a probabilistic model (for
similar models, see Hemmer & Steyvers, 2009; Huttenlocher,
Hedges, & Vevea, 2000). The model had three levels of
abstraction, representing particular circles, all circles of a
given color, and all circles in the entire display. In the model,
observers encoded a noisy sample of the size of each individ-
ual circle, and the size of each circle was itself considered a
noisy sample from the expected size of the circles of that color,
which was itself considered a sample of the expected size of
the circles in a given display. We asked what observers ought
to report as their best guess about the size of the tested circle
(assuming normal distributions at each level).
The intuition this model represents is fairly straightfor-
ward: If the red circles in a particular display are all quite
large, but the observer encodes a fairly small size for one of
them, it is more likely that this circle is a large circle the
observer accidentally encoded as too small than that it is a
small circle the observer accidentally encoded as too large.
Thus, in general, the model suggests that the optimal way to
minimize errors in responses is to be biased slightly (either
when encoding the circles or when retrieving their size) toward
the mean of both the set of circles of the same color as the
tested circle and the overall mean of the display. Figure 3
presents predictions of this model, along with a corresponding
representation of the behavioral data from Experiment 1 (for
information on model implementation, see Optimal Observer
Model in the Supplemental Material available online). Note
that in both the observers’ data and the model predictions, the
slopes of the lines indicate a bias toward reporting all circles
as less extreme in size then they really were, and also note that
the plotted points indicate a bias toward reporting a size simi-
lar to the size of the same-colored circles.
The model had a single free parameter, which indicated
how noisy the encoding of a given circle was (the standard
deviation of the normal distribution from which the encoded
size was sampled) and thus how biased toward the means
observers’ size estimates ought to be. We set this parameter to
25 pixels (the estimated standard deviation of errors collaps-
ing across all observers and displays), rather than maximizing
the fit to the data. Although not strictly independent of the data
being fit, this method of choosing the value for the parameter
is not based on the measures we used to assess the model.
In general, the model provided a strong fit to the data as
assessed by two different metrics: First, the model predicted
the difference between the correct answer and reported answer
for each display, ignoring the paired structure of the displays (r =
.89, p < .0001). Second, the model predicted the difference in
reported size between matched displays (r = .82, p < .001),
even though the tested circle was actually the same size within
each such pair. Any model of working memory that treats items
0 20 40 60 80 100 120
0
20
40
60
80
100
120
a b
Correct Size (pixels)
Reported Size (pixels)
0
20
40
60
80
100
120
Reported Size (pixels)
0 20 40 60 80 100 120
Correct Size (pixels)
Larger Same-Colored Circles Smaller Same-Colored Circles
Fig. 3. Comparison of the behavioral data from Experiment 1 and predictions of the optimal integration model described in the text. The
graph in (a) shows the data from Experiment 1, averaged across observers so that each display is represented by a single point. The graph in
(b) shows the predictions from the model, which integrates information from multiple levels of abstraction (with SD = 25 pixels). Data and
predictions are shown separately for displays in which the tested circle was the same color as the circles that were smaller on average and
displays in which the tested circle was the same color as the circles that were larger on average.
388 Brady, Alvarez
as independent (e.g., most slot and resource models, including
the mathematical model presented by Zhang & Luck, 2008)
cannot predict a systematic difference between these trials.
Discussion
We found that observers are biased by the ensemble statistics
of a display when representing display items in visual working
memory. In the case of displays with circles of several differ-
ent colors, observers’ reports of the size of a given circle are
biased by the size of the other circles of the same color. This
effect is not accounted for by perceptual biases or location
noise or by incorrectly reporting the wrong item from the dis-
play (see Potential Reports of the Incorrect Item in the Supple-
mental Material), is not a result of guessing on the basis of the
mean size of the circles of that color (see Comparison to an
Across-Trial Guessing Model in the Supplemental Material),
and is compatible with a simple Bayesian model in which
observers integrate information at multiple levels of abstrac-
tion to form a final hypothesis about the size of the tested item.
Experiments 2a and 2b: Attention
to Color Is Required
In Experiment 1, the color of the items was task relevant. In
fact, because observers have difficulty attending to more than
a single color at a time (Huang et al., 2007), observers in
Experiment 1 likely had to separately encode the sizes of the
red circles and the sizes of the blue circles, and this might have
increased the salience of color grouping. Salience of color as a
grouping dimension may have been a crucial part of why
observers used the mean size of the circles that were the same
color as the tested circle in guiding their memory retrieval.
Thus, in Experiments 2a and 2b, we removed the green circles
from the displays and asked observers to simply remember the
sizes of all of the circles. This allowed us to evaluate the auto-
maticity of the biases we observed in Experiment 1—for
example, the extent to which they depend on attentional selec-
tion and strategy. In addition, Experiments 2a and 2b provided
a control for low-level factors that could have influenced the
results of Experiment 1.
Method
Twenty-five new observers completed Experiment 2a, and 20
different observers completed Experiment 2b.
The methods and 30 displays used in Experiment 2a were
the same as in Experiment 1 except that the green circles used
as distractor items were not present in the displays. Experiment
2b was identical to Experiment 2a except that the circles were
shown for only 350 ms rather than 1.5 s in order to decrease
observers’ performance to the same level as in Experiment 1.
Results
Experiment 2a. Observers’ performance in Experiment 2a
was very good. The average error was 10.2 pixels (SEM = 0.60
pixels), which was significantly less than our empirical mea-
sure of chance (29 pixels, SEM = 0.28 pixels) p < 1019, and
significantly less than the error of subjects in Experiment 1,
t(44) = 3.73, p < .001.
Observers in Experiment 2a displayed no bias as a function
of color. The mean bias of 0.99 (SEM = 0.01) was not signifi-
cantly different from 1.0, t(24) = –0.86, p = 0.39 (see Fig. 2b).
This result is compatible with the idea that the size of the
same-colored circles does not bias observers’ estimates of the
tested circle’s size when color is not task relevant.
However, the observers in Experiment 2a had significantly
lower error rates than the observers in Experiment 1. Thus, it
is possible that the observers in Experiment 2a did not display
a bias because they were able to encode all of the circles accu-
rately as individuals. To initially examine this possibility, we
compared the least accurate 50% of observers in Experiment 2a
with the most accurate 50% of observers in Experiment 1. The
error rates reversed (Experiment 1: mean error = 10 pixels;
Experiment 2a: mean error = 13 pixels), yet the bias remained
present only in Experiment 1 (Experiment 1: 1.07; Experiment 2a:
1.00). This provided preliminary evidence that the difference
in accuracy between the two experiments did not drive the dif-
ference in bias.
Experiment 2b: overall accuracy and bias. Experiment 2b
experimentally addressed the concern that the lack of bias
among observers in Experiment 2a was driven by their high
performance level. In Experiment 2b, display time was
reduced from 1.5 s to 350 ms to increase the error rate while
maintaining the task irrelevance of color. Observers in Experi-
ment 2b had an error rate of 15.9 pixels on average (SEM =
2.25 pixels). This was significantly less than our empirical
measure of chance (31.3 pixels, SEM = 1.13 pixels), p < 109,
but not significantly less than the error rate of subjects in
Experiment 1, t(39) = –0.17, p = .86. Thus, Experiment 1 and
Experiment 2b were equated on error rate. However, observ-
er’s reports of the size of the tested circles in Experiment 2b
were still not biased toward the size of the same-colored cir-
cles (M = 1.00, SEM = 0.01).
Optimal integration model. We applied the same model
used in Experiment 1 to the data from Experiment 2b, but with
only two levels (no grouping by color): information about the
particular circle and information about all the circles in the dis-
play. This model once again provided a strong fit to the experi-
mental data (Fig. 4). Because the model did not use color
information, it predicted exactly the same performance for both
trials within each matched pair. This prediction is in line with
Ensemble Statistics Bias Visual Working Memory 389
the observed bias of 1.00 in the experimental data. Further-
more, the model predicted the overall bias toward the mean
size of the circles in the display, and its predictions correlated
with the errors people made across all trials, r = .53, p = .002.
Discussion
In Experiments 2a and 2b, in which color was not task rele-
vant, observers did not display a bias toward the mean size of
the same-colored circles, even when the experiment was
equated with Experiment 1 on difficulty. However, in both
Experiment 2a and Experiment 2b, observers’ estimates were
still biased toward the mean size of the circles in the display
overall. The data are compatible with a Bayesian model in
which observers treat all items as coming from a single group,
rather than breaking the items into separate groups by color.
Furthermore, the results of these experiments help rule out
possible confounds in Experiment 1, such as the possibility
that location noise caused swapping of items in memory, as the
displays used in Experiments 2a and 2b were exactly the same
as those used in Experiment 1 except for the absence of irrel-
evant green circles. We have also run Experiments 1 and 2a as
separate conditions in a single within-subjects experiment and
replicated the finding of a bias only when displays included
the green circles (see Replication and a Within-Subject Exper-
iment in the Supplemental Material).
General Discussion
We found that observers are biased by the ensemble statistics of
a display when representing items in visual working memory.
When asked to report the size of an individual circle, observers
tended to report it as larger if the other items in the same color
were large and smaller if the other items in the same color were
small. This bias was reliable across observers and was pre-
dicted by a simple Bayesian model that encodes a display at
multiple levels of abstraction. Taken together, our findings sug-
gest that items in visual working memory are not represented
independently and, more broadly, that visual working memory
is susceptible to the very same hallmarks of constructive mem-
ory that are typical of long-term memory (Bartlett, 1932).
Representation of ensemble statistics
It is well established that the visual system can efficiently
compute ensemble statistics (e.g., Alvarez & Oliva, 2009; Ariely,
2001; Chong & Treisman, 2003) and does so even when this is
not required by the task, causing, for example, a false belief
that an item with a set’s mean value of an attribute was present
(de Fockert & Wolfenstein, 2009; Haberman & Whitney,
2009). However, less work has explored why the visual system
represents ensemble statistics. One benefit of ensemble repre-
sentations is that they can be highly accurate, even when the
0 20 40 60 80 100 120
0
20
40
60
80
100
120
Correct Size (pixels)
Reported Size (pixels)
0
20
40
60
80
100
120
Reported Size (pixels)
0 20 40 60 80 100 120
Correct Size (pixels)
a b
Larger Same-Colored Circles Smaller Same-Colored Circles
Fig. 4. Comparison of the behavioral data from Experiment 2b and predictions of the optimal integration model described in the text. The
graph in (a) shows the data from Experiment 2b, averaged across observers so that each display is represented by a single point. The graph in
(b) shows the predictions from the model, which integrates information from multiple levels of abstraction (with SD = 25 pixels). Data and
predictions are shown separately for displays in which the tested circle was the same color as the circles that were smaller on average and
displays in which the tested circle was the same color as the circles that were larger on average.
390 Brady, Alvarez
local measurements constituting them are very noisy (Alvarez
& Oliva, 2008, 2009). Another possible benefit of ensemble
representations is that they can be used to identify outliers in a
display (Rosenholtz & Alvarez, 2007), and thus potentially
guide attention to items that cannot be incorporated in the
summary for the rest of the group (Brady & Tenenbaum, 2010;
Haberman & Whitney, 2009). The current work suggests a
new use of ensemble statistics: Such statistics can increase the
accuracy with which items are stored in visual working mem-
ory, reducing uncertainty about the size of individual items by
optimally combining item-level information with ensemble
statistics at multiple levels of abstraction.
It is interesting that observers in our experiments used the
mean size of the circles of specific colors to reconstruct the dis-
plays only when color was task relevant, despite the fact that
using this statistic would improve memory for the individual
items in all conditions. This could suggest that the units over
which such ensemble statistics are computed is limited by selec-
tive attention (e.g., Chong & Treisman, 2005a). Turk-Browne,
Jungé, and Scholl (2005) suggested that statistical learning, a
form of learning about sequential dependencies, may happen
automatically, but that the particular sets over which the statis-
tics are computed may be controlled by selective attention. This
hypothesis is compatible with our current findings: When
observers did not attend to the colored sets as separate units,
they may not have computed separate summary statistics for the
two colored sets (alternatively, separate summary statistics may
have been encoded, but not used in reconstructing the circle
sizes). However, when observers attended to color, the ensem-
ble statistics for the two colors seem to have been computed in
parallel, as found by Chong and Treisman (2005b).
Dependence between items in
visual working memory
Our results demonstrate a case of nonindependence between
items in visual working memory: We found that items are repre-
sented not just individually, but also as a group or ensemble.
Although previous experiments did not directly address such
hierarchical effects, nonindependence between items in visual
working memory has been observed previously. For example,
Huang and Sekuler (2010) found that observers exhibit bias in
reporting the spatial frequency of Gabor patches, tending to
report spatial frequencies as though they have been pulled
toward the spatial frequencies of previously presented Gabor
patches. In addition, Jiang, Olson, and Chun (2000) have shown
that changing the spatial context of an item influences memory
for that item (see also Vidal, Gauchou, Tallon-Baudry, &
O’Regan, 2005). This suggests that an item is not represented
independently of its spatial context in working memory.
Similarly, several studies (Brady & Tenenbaum, 2010;
Sanocki, Sellers, Mittelstadt, & Sulman, 2010; Victor &
Conte, 2004) have shown that observers can take advantage of
perceptual regularities in working memory displays to remem-
ber more individual items from those displays. Brady and
Tenenbaum (2010) investigated checkerboard-like displays
and conceptualized their findings in terms of hierarchical
encoding, in which the gist of the display is encoded in addi-
tion to specific information about a small number of items that
are least consistent with the gist. This hypothesis is compatible
with the model we presented here for simpler displays, accord-
ing to which observers encode ensemble information as well
as information about specific items.
This dependence between items in memory is not predicted
or explained by influential models of visual working memory.
Current theories model visual working memory as a flexible
resource that is quantized into slots (Zhang & Luck, 2008) or
continuously divisible (Alvarez & Cavanagh, 2004; Bays &
Husain, 2008; Wilken & Ma, 2004). According to these mod-
els, fewer items can be remembered with higher precision
because they receive more memory resources. However, these
models assume that items are stored independently, and there-
fore cannot account for the dependence between items in
memory observed in the current study. Expanding these mod-
els to account for the current results will require specification
of whether abstract levels of representation compete for the
same resources as item-level representations (e.g., Feigenson,
2008), or whether there are essentially separate resources for
ensemble representations and item-level representations (e.g.,
Brady & Tenenbaum, 2010).
Role of long-term memory in
visual working memory
In addition to work demonstrating dependencies between
stored representations of items and hierarchical encoding of a
particular display, there is a significant amount of previous
work showing that the representation of items in visual work-
ing memory depends on information in long-term memory
(e.g., Brady, Konkle, & Alvarez, 2009). For instance, Konkle
and Oliva (2007) and Hemmer and Steyvers (2009) have shown
that knowledge of the size of an object in the real world biases
the remembered size of that object in a display after a short
delay. Hemmer and Steyvers (2009) provided a model of this
effect as due to Bayesian inference in a constructive memory
framework, and their model is similar to the one we proposed
here for the on-line representation of the displays in our experi-
ments. Convergence between a model for using ensemble
information from the current display and a model for integrat-
ing information from the current display with information from
long-term memory suggests a promising future direction for
understanding the use of higher-order information in memory.
Conclusion
We found that observers are biased by the ensemble statistics of
a display when representing items from that display in visual
working memory. Rather than storing items independently,
observers seem to construct the size of an individual item using
information from multiple levels of abstraction. Thus, despite
Ensemble Statistics Bias Visual Working Memory 391
the active maintenance processes involved in visual working
memory, it appears to be susceptible to the very same hallmarks
of constructive memory that are typical of retrieval from long-
term memory and scene recognition (Bartlett, 1932; Lampinen
et al., 2001). Cognitive and neural models of visual working
memory need to be expanded to account for such constructive,
hierarchical encoding processes.
Acknowledgments
For helpful conversation and comments on earlier drafts, we thank
Talia Konkle, Adena Schachner, Josh Tenenbaum, and Aude Oliva.
Declaration of Conflicting Interests
The authors declared that they had no conflicts of interest with
respect to their authorship or the publication of this article.
Funding
This research was supported by National Institute of Mental Health
Grant R03-MH086743 to G.A.A. and by a National Science Foundation
graduate research fellowship to T.F.B.
Supplemental Material
Additional supporting information may be found at http://pss.sagepub
.com/content/by/supplemental-data
Note
1. See Hollingworth (2008) for studies that employed real-world
scenes.
References
Alvarez, G.A., & Cavanagh, P. (2004). The capacity of visual short-
term memory is set both by visual information load and by num-
ber of objects. Psychological Science, 15, 106–111.
Alvarez, G.A., & Oliva, A. (2008). The representation of simple
ensemble visual features outside the focus of attention. Psycho-
logical Science, 19, 392–398.
Alvarez, G.A., & Oliva, A. (2009). Spatial ensemble statistics:
Efficient codes that can be represented with reduced attention.
Proceedings of the National Academy of Sciences, USA, 106,
7345–7350.
Ariely, D. (2001). Seeing sets: Representation by statistical proper-
ties. Psychological Science, 12, 157–162.
Bartlett, F. (1932). Remembering: A study in experimental and social
psychology. New York, NY: Macmillan.
Bays, P.M., Catalao, R.F.G., & Husain, M. (2009). The precision of
visual working memory is set by allocation of a shared resource.
Journal of Vision, 9(10), Article 7. Retrieved from http://journal
ofvision.org/9/10/7/
Bays, P.M., & Husain, M. (2008). Dynamic shifts of limited working
memory resources in human vision. Science, 321, 851.
Brady, T.F., Konkle, T., & Alvarez, G.A. (2009). Compression in
visual short-term memory: Using statistical regularities to form
more efficient memory representations. Journal of Experimental
Psychology: General, 138, 487–502.
Brady, T.F., & Tenenbaum, J.B. (2010). Encoding higher-order struc-
ture in visual working memory: A probabilistic model. In S. Ohlsson
& R. Catrambone (Eds.), Proceedings of the 32nd Annual Con-
ference of the Cognitive Science Society (pp. 411–416). Portland,
OR: Cognitive Science Society.
Chong, S.C., & Treisman, A. (2003). Representation of statistical
properties. Vision Research, 43, 393–404.
Chong, S.C., & Treisman, A. (2005a). Attentional spread in the statistical
processing of visual displays. Perception & Psychophysics, 67, 1–13.
Chong, S.C., & Treisman, A. (2005b). Statistical processing: Com-
puting the average size in perceptual groups. Vision Research,
45, 891–900.
de Fockert, J., & Wolfenstein, C. (2009). Rapid extraction of mean
identity from sets of faces. Quarterly Journal of Experimental
Psychology, 62, 1716–1722.
Feigenson, L. (2008). Parallel non-verbal enumeration is constrained
by a set-based limit. Cognition, 107, 1–18.
Haberman, J., & Whitney, D. (2007). Rapid extraction of mean emotion
and gender from sets of faces. Current Biology, 17, R751–R753.
Haberman, J., & Whitney, D. (2009). The visual system ignores
outliers when extracting a summary representation [Abstract].
Journal of Vision, 9(8), Article 804. Retrieved from http://www
.journalofvision.org/content/9/8/804
Halberda, J., Sires, S.F., & Feigenson, L. (2006). Multiple spatially
overlapping sets can be enumerated in parallel. Psychological
Science, 17, 572–576.
Hemmer, P., & Steyvers, M. (2009). Integrating episodic memories
and prior knowledge at multiple levels of abstraction. Psycho-
nomic Bulletin & Review, 16, 80–87.
Hollingworth, A. (2008). Visual memory for natural scenes. In S.J.
Luck & A. Hollingworth (Eds.), Visual memory (pp. 123–162).
New York, NY: Oxford University Press.
Huang, J., & Sekuler, R. (2010). Distortions in recall from visual
memory: Two classes of attractors at work. Journal of Vision,
10(2), Article 24. Retrieved from http://www.journalofvision.org/
content/10/2/24.full
Huang, L., Treisman, A., & Pashler, H. (2007). Characterizing the
limits of human visual awareness. Science, 317, 823–825.
Huttenlocher, J., Hedges, L.V., & Vevea, J.L. (2000). Why do catego-
ries affect stimulus judgment? Journal of Experimental Psychol-
ogy: General, 129, 220–241.
Jiang, Y., Olson, I.R., & Chun, M.M. (2000). Organization of visual
short-term memory. Journal of Experimental Psychology: Learn-
ing, Memory, and Cognition, 26, 683–702.
Johnson, J.S., Spencer, J., Luck, S., & Schöner, G. (2009). A dynamic
neural field model of visual working memory and change detec-
tion. Psychological Science, 20, 568–577.
Konkle, T., & Oliva, A. (2007). Normative representation of objects:
Evidence for an ecological bias in object perception and memory.
In D.S. McNamara & J.G. Trafton (Eds.), Proceedings of the 29th
Annual Meeting of the Cognitive Science Society (pp. 407–413).
Austin, TX: Cognitive Science Society.
Lampinen, J.M., Copeland, S., & Neuschatz, J.S. (2001). Recollec-
tions of things schematic: Room schemas revisited. Journal of
392 Brady, Alvarez
Experimental Psychology: Learning, Memory, and Cognition,
27, 1211–1222.
Lin, P.-H., & Luck, S.J. (2008). The influence of similarity on
visual working memory representations. Visual Cognition, 17,
356–372.
Luck, S.J., & Vogel, E.K. (1997). The capacity of visual working
memory for features and conjunctions. Nature, 390, 279–281.
Oliva, A. (2005). Gist of the scene. In L. Itti, G. Rees, & J.K. Tsotsos
(Eds.), The encyclopedia of neurobiology of attention (pp. 251–
256). San Diego, CA: Elsevier.
Parkes, L., Lund, J., Angelucci, A., Solomon, J.A., & Morgan, M.
(2001). Compulsory averaging of crowded orientation signals in
human vision. Nature Neuroscience, 4, 739–744.
Rosenholtz, R., & Alvarez, G.A. (2007) How and why we per-
ceive sets: What does modeling tell us? [Abstract]. Perception,
36(ECVP Abstract Supplement), 8. Retrieved from http://www
.perceptionweb.com/ecvp/ecvp07.pdf
Rouder, J.N., Morey, R.D., Cowan, N., Zwilling, C.E., Morey, C.C.,
& Pratte, M.S. (2008). An assessment of fixed-capacity models of
visual working memory. Proceedings of the National Academy of
Sciences, USA, 105, 5975–5979.
Sanocki, T., Sellers, E., Mittelstadt, J., & Sulman, N. (2010). How
high is visual short term memory capacity for object layout?
Attention, Perception, & Psychophysics, 72, 1097–1109.
Turk-Browne, N.B., Jungé, J.A., & Scholl, B.J. (2005). The automa-
ticity of visual statistical learning. Journal of Experimental Psy-
chology: General, 134, 552–564.
Victor, J.D., & Conte, M.M. (2004). Visual working memory for
image statistics. Vision Research, 44, 541–556.
Vidal, J.R., Gauchou, H.L., Tallon-Baudry, C., & O’Regan, J.K.
(2005). Relational information in visual short-term memory: The
structural gist. Journal of Vision, 5(3), Article 8. Retrieved from
http://journalofvision.org/5/3/8/
Wilken, P., & Ma, W.J. (2004). A detection theory account of change
detection. Journal of Vision, 4(12), Article 11. Retrieved from
http://www.journalofvision.org/content/4/12/11/
Zhang, W., & Luck, S.J. (2008). Discrete fixed-resolution representa-
tions in visual working memory. Nature, 452, 233–235.
... These studies have unveiled interactions among WM representations (Bae & Luck, 2017;Czoschke et al., 2020;Lively et al., 2021;Utochkin & Brady, 2020). In light of these interactions, the hierarchical encoding theory has been proposed, which posits that WM operates on multitiered displays integrating individual objects and ensembles (Brady & Alvarez, 2011;Robinson & Brady, 2023;Utochkin & Brady, 2020). ...
... Consequently, extensive investigations have explored ensemble coding in relation to fundamental physical attributes such as sizes (Ariely, 2001;Baek & Chong, 2020), colors (Demeyere et al., 2008;Webster et al., 2014), and orientations (Solomon, 2010), as well as social traits such as expressions (Elias et al., 2017) and directions of biological motion . Within the realm of WM, this phenomenon also garners support from both behavioral findings (Brady & Alvarez, 2011;Brady & Alvarez, 2015;Son & Chong, 2023;Son et al., 2020;Utochkin & Brady, 2020) and neural evidence (Oh et al., 2019). ...
... We performed a priori power analysis using G * Power 3.1 (Faul et al., 2007) to estimate the appropriate number of participants. In a previous study (Brady & Alvarez, 2011), which directly compared two conditions similar to our "increase-mean" and "decrease-mean" conditions, Experiment 1; t(20) = 4.17, p = .0004, the effect size Cohen's d was calculated to be 0. 91 ðt= ffiffi ffi n p Þ. Therefore, we anticipated a large effect size (Cohen's d = 0.8) for the effect of paired t tests in our experimental design. ...
Article
Full-text available
Manipulation plays a critical role in working memory, wherein understanding how items are represented during manipulation is a fundamental question. Previous studies on manipulation have primarily assumed independent representations by default (independent hypothesis). Here, we propose the ensemble hypothesis to challenge this conventional notion, suggesting that items are represented as ensembles undergoing updating during manipulation. To test these hypotheses, we focused on working memory updating in accordance with new information by conducting three delayed-estimation tasks under addition, removal, and replacement scenarios (Study 1). A critical manipulation involved systematically manipulating the mean orientation of all memory stimuli, either increasing (clockwise) or decreasing (counterclockwise) after the updating process. Following the independent hypothesis, memory errors would be similar under both conditions. Conversely, considering the biasing effect of the ensemble on individual representations, the ensemble hypothesis predicts that memories of individual items would be updated, aligning with the ensemble’s change direction. Namely, memory errors would be more positive in the increase-mean condition compared to the decrease-mean condition. Our results supported the ensemble hypothesis. Furthermore, to investigate the mechanisms underlying ensemble computations in updating scenarios, we conducted three ensemble tasks (Study 2) with similar designs to Study 1 and developed a computational model to quantify the contributions of each memory item. The results consistently demonstrated that addition involved complete updating, while removal led to incomplete updating. Across these three research parts, we propose that items are represented as dynamic ensembles during working memory updating processes. Furthermore, we elucidate the computational principles underlying ensembles throughout this process.
... Another possible mechanism at play in the current experiments are context-based effects. In line with recent proposals (e.g., Bays et al., 2024;Brady & Alvarez, 2011), it is possible that the reappearing items reactivated the ensemble of all memorized items. Such reactivation may have helped participants to 'fill in' the to-be-probed item. ...
Article
Full-text available
Adaptive behavior necessitates the prioritization of the most relevant information in the environment (external) and in memory (internal). Internal prioritization is known to guide the selection of external sensory input, but the reverse may also be possible: Does the environment guide the prioritization of memorized material? Here, we addressed whether reappearing sensory input could facilitate the prioritization of other nonreappearing memorized items held in visual working memory (VWM). Participants (total n = 96) memorized three orientations. Crucially some, but not all, items maintained in VWM were made available again in the environment. These reappearing items never had to be reproduced later. Experiment 1 showed that the reappearance of all but one memory item benefited accuracy and speed to the same extent as a spatial retro cue. This shows that reappearing items allow for the dynamic prioritization of another nonreappearing memorized item. What aspects of the reappearing sensory input drive this effect? Experiments 2–4 demonstrated that prioritization was facilitated most if reappearing items matched VWM content in terms of both location and orientation. Sensory input fully matching VWM is possibly processed more efficiently and/or protects against interference, ultimately leading to stronger prioritization of other memory content. We propose that the link between sensory processing and VWM is bidirectional: internal representations guide the processing of sensory input, which in turn facilitates the prioritization of other VWM content to subserve adaptive behavior. All data and analysis scripts are available online ( https://osf.io/qzvkc/ ).
... Lee and Schlegelmilch (2025) characterized their rule-based models as more theoretically grounded than our representationfocused approach. Recent advances in human perception (Funamizu et al., 2016;Petzschner et al., 2015;Weiss et al., 2002) and memory (Brady & Alvarez, 2011Brady et al., 2024) research have revealed the dynamic and probabilistic nature of human representational systems, suggesting the need to integrate these insights into generalization theories. These insights have been empirically demonstrated to influence generalization behavior across numerous studies (for a review, see Zaman, Chalkia, et al., 2021). ...
Article
Full-text available
Lee and Schlegelmilch (2025) reanalyzed data from Zaman, Yu, and Verheyen (2023), arguing that the role of perception in generalization is overemphasized and that higher level cognitive processes (in the form of a similarity-based rule) provide a better account. In this reply, we make the argument that their reanalysis contains flaws and inconsistencies and present additional evidence for consideration. While acknowledging the role of higher level cognition in generalization, we maintain that the available evidence supports stimulus perception and memory as important predictors. We view this exchange as a reminder that stimulus representation and higher level cognition exist on a spectrum, with their relative importance varying across contexts. Consequently, more research into the interactions between these processes is necessary. We argue that rather than adhering to existing theoretical frameworks, researchers should continue to challenge our understanding of human generalization by incorporating new evidence and insights into existing models and comparing them across diverse contexts. This approach will foster a more comprehensive and nuanced understanding of human generalization processes.
... Inter-individual and intra-individual differences during encoding, retrieval, or mechanisms acting on these processes may introduce substantial biases in memory that can deviate considerably from the objective representation of stimuli. 61,62 Consequently, the exclusive reliance on perceptual data to determine perceptual distance may fall short in capturing the complexities and variations introduced by memory processes. This situation calls for further investigation and consideration in the study of generalization behaviors. ...
Article
Full-text available
Generalization theories traditionally overlook how our mental representations dynamically change in the process of transferring learned knowledge to new contexts. We integrated perceptual and generalization theories into a computational model using data from 80 participants who underwent Pavlovian fear conditioning experiments. The model analyzed continuous measures of perception and fear generalization to understand their relationship. Our findings revealed large individual variations in perceptual processes that directly influence generalization patterns. By examining how perceptual and generalization mechanisms work together, we uncovered their combined role in producing generalization behavior. This research illuminates the probabilistic perceptual foundations underlying individual differences in generalization, emphasizing the crucial integration between perceptual and generalization processes. Understanding this relationship enhances our knowledge of generalization behavior and has potential implications for various cognitive domains including categorization, motor learning, language processing, and face recognition—all of which rely on generalization as a fundamental cognitive process.
... For example, work has suggested individuals have better WM performance when attempting to encode everything in the display instead of encoding only a subset of the information (Bengson & Luck, 2016;Wang et al., 2020; but see Atkinson et al., 2018;Cusack et al., 2009;Linke et al., 2011). Moreover, WM performance may reflect differences in how individuals make use of ensemble encoding and chunking strategies (Brady & Alvarez, 2011;Haberman et al., 2015;Nassar et al., 2018). Future work is necessary to examine how WM ability relates to a wider range of strategies. ...
Article
Full-text available
Visual working memory is highly limited. Fortunately, we can use various strategies to improve performance. For example, we can make use of grouping cues, such as an illusory shape (Kanizsa figure), to optimize encoding and storage of information. However, it remains unclear how strategy use varies across individuals. In the current study, we examined whether working memory ability is related to individual differences in the use of grouping strategy. We manipulated whether the items formed Kanizsa figures or were randomly oriented in an orientation change detection task. We explicitly informed participants of the manipulation. Moreover, we independently assessed visual working memory capacity in a colour change detection task. In Experiment 1, there was better memory performance for Kanizsa figures versus randomly oriented items. Importantly, we found that individuals with higher working memory capacity showed a larger Kanizsa benefit. In Experiment 2, we still found a memory benefit for Kanizsa figures when we sequentially presented the inducers. This suggests that participants are able to use more abstract rules to strategically encode items. Furthermore, we again found that high working memory capacity corresponds to a larger Kanizsa benefit, although this finding was less robust across multiple analysis methods. Taken together, these findings show that individual differences in working memory ability predict effective use of grouping strategy. Moreover, these individual differences may partly reflect grouping based on high-level knowledge. Supplementary information The online version contains supplementary material available at 10.3758/s13414-025-03013-w.
Article
Ensemble perception enables the visual system to function effectively when the number of stimuli in the environment exceeds its capacity. Ensemble representations not only help the limited capacity of visual representations, but they also facilitate the detection and representation of items deviating from the group (i.e., the outlier). This study focuses on how attentional mechanisms modulate outlier processing. In three experiments, we presented participants with an ensemble that was formed by circle stimuli in varying sizes, and the outlier item was distinct in terms of its location. We measured outlier localization performance while manipulating attentional orienting via a spatial cueing paradigm. In Experiment 1, a valid, invalid, or neutral cue was presented before or after the display. Facilitation of outlier localization was most pronounced in the valid precue condition. Experiment 2 included a task to actively engage ensemble perception in addition to outlier localization, and cue validity effect was observed as in Experiment 1. Experiment 3A was carried to directly compare the top-down and bottom-up influences on outlier processing by presenting two spatial outliers—one target and another distractor outlier. The target outlier identity was previously determined and was identical across trials. In Experiment 3B, the target was in red, making it salient among the remaining items. In the invalid trials, where the distractor outlier was cued, responses were closer to the distractor item indicating that outlier processing is cue driven. These experiments overall demonstrate that automaticity of outlier processing can be overridden by cue-driven processes.
Article
Full-text available
Visual working memory (VWM) is critical for temporarily storing and manipulating visual information but is limited in capacity. Previous research suggests that perceptual organization, such as grouping identical objects, may alleviate VWM resource constraints, yet the underlying mechanisms remain unclear. This study investigated whether identical orientation stimuli reduce VWM resource consumption and whether this reduction occurs during encoding or maintenance phase. Using the contralateral delay activity (CDA), an event-related potential (ERP) component that indexes the number of items stored in VWM, we examined three memory conditions: all-same (i.e., four identical orientations), partial-same (i.e., two pairs of identical orientations), and all-different (i.e., four different orientations) conditions. Behavioral results showed improved VWM performance as the number of identical orientations increased, with the highest performance in the all-same condition. ERP results revealed no significant differences in early CDA amplitudes across conditions, suggesting that identical stimuli do not reduce VWM resource consumption during early maintenance phase. However, late CDA amplitudes were significantly reduced in the all-same condition compared to the partial-same and all-different conditions, indicating that the benefits of identical objects emerge during the late maintenance phase. Notably, partially identical stimuli did not reduce VWM load, highlighting the complexity of integrating partially redundant information. These findings highlight the distinct contributions of early consolidation and maintenance phases in optimizing VWM through perceptual grouping and provide evidence that the facilitative effect of identical objects relies on their complete repetition and late-stage cognitive processes. Supplementary Information The online version contains supplementary material available at 10.1038/s41598-025-00433-4.
Article
Full-text available
In 2 experiments, the authors examined the effects of schemas on the subjective experience of remembering. Participants entered a room that was set up to look like a graduate student's office under intentional or incidental learning conditions. They later took a recognition memory test that included making remember-know judgments. In Experiment 1, they were tested during the same session; in Experiment 2 they were tested either during the same session or after a 48-hr delay. Consistent with the authors' predictions, memory for atypical objects was especially likely to be experienced in the remember sense. In addition, false remember judgments rose dramatically after the 48-hr delay, especially for participants in the incidental learning condition. Results are discussed in terms of schema theory, fuzzy-trace theory, and the distinctiveness heuristic.
Chapter
Full-text available
This chapter reviews research examining visual memory for complex, natural environments. It is divided into two main sections. The first concerns the use of visual memory to construct online representations of natural scenes (i.e., the representation produced as one is actively viewing a scene). The second concerns longer-term scene memory stored after a scene is no longer in view. © 2008 by Steven J. Luck and Andrew Hollingworth. All rights reserved.
Article
One set of properties that may emerge when attention is distributed over an array of similar items is the general statistics of the set. We investigated mean size perception using three different methods: a change detection "flicker" task, implicit priming, and a dual task paradigm. In the change detection task, participants discriminated a change of size from a change of location of an array of circles of different sizes. The changes either did or did not result in a mean change of size. Performance was more efficient when the mean size changed than when it did not, suggesting that the mean is perceptually represented. In the implicit priming task, we used a same-different size judgment on two circles, preceded by a prime display containing 12 circles of two different sizes. One or both of the target circles could match either the mean size (that was never presented), or one of the two primed sizes, or a size that was 20% larger or smaller than a presented size. The priming benefit was as large when the targets matched the mean size of the prime display as when they matched a size that was actually presented. To manipulate the deployment of attention, we used a dual task paradigm in which the secondary task required either global or focused attention. Thresholds for judging the mean size of circles in the array were lower when the concurrent task required global compared to local attention, even when the secondary tasks were closely matched in difficulty. The results support the proposal that we preserve statistical information from sets of similar objects rather than representing all the detailed information and that this information is best extracted when attention is globally deployed. Means of sets may be one ingredient of a schematic representation that could explain our coherent perception of a visual scene.
Chapter
ABSTRACT Studies in scene ,perception ,have ,shown ,that observers,recognize ,a real-world ,scene ,at a ,single glance. During this expeditious process of seeing, the visual system ,forms ,a spatial ,representation ,of the outside world,that is rich enough,to grasp the meaning of the scene, recognizing a few objects and other salient information in the image, to facilitate object detection and the deployment ,of attention. This representation refers to the gist of a scene, which includes all levels of processing, from low-level features (e.g., color, spatial frequencies) to intermediate image properties (e.g., surface, volume) and high-level information (e.g., objects, activation of semantic knowledge). Therefore, gist can be studied,at both,perceptual,and conceptual levels. I. WHAT IS THE “GIST OF A SCENE”?
Article
There has been a recent surge in the study of ensemble coding, the phenomenon in which the visual system represents a set of similar objects using summary statistics (Alvarez & Oliva, 2008; Ariely, 2001; Chong & Triesman, 2003; Haberman & Whitney, 2007; 2008). These studies have revealed that humans are sensitive to scene statistics across multiple levels of visual analysis, including high-level objects such as faces. However, the limits of ensemble coding remain undefined. Here we use the method-of adjustment to explore this issue, allowing observers to adjust a test face to match the mean expression of a set of faces. In Experiment 1, observers viewed sets of 12 faces that contained 2 emotional outliers (i.e. expressions that deviated substantially from the mean expression of the other 10 faces) for 250 ms. When adjusting the subsequent test face to the mean expression, all observers preferentially adjusted to the local mean of the set (mean expression with outliers excluded) rather than the global mean (mean expression with outliers included). In Experiment 2, we modeled the performance of an observer who implemented a ‘sampling’ strategy; that is, an observer who picked N faces from the set and perfectly averaged them. The sampling hypothesis was first put forth by Myczek and Simons (2008) as a potentially more parsimonious explanation for ensemble coding. However, our sets contained outliers, the presence of which would be a detriment to a model that sampled. The results show that a sampling model could not explain human performance. Thus, observers effectively filtered the outliers, extracting the local mean from the set in only 250 ms.
Book
In 1932, Cambridge University Press published Remembering, by psychologist, Frederic Bartlett. The landmark book described fascinating studies of memory and presented the theory of schema which informs much of cognitive science and psychology today. In Bartlett's most famous experiment, he had subjects read a Native American story about ghosts and had them retell the tale later. Because their background was so different from the cultural context of the story, the subjects changed details in the story that they could not understand. Based on observations like these, Bartlett developed his claim that memory is a process of reconstruction, and that this construction is in important ways a social act. His concerns about the social psychology of memory and the cultural context of remembering were long neglected but are finding an interested and responsive audience today. Now reissued in paperback, Remembering has a new Introduction by Walter Kintsch of the University of Colorado, Boulder.
Article
It is widely accepted that our visual systems are tuned to the statistics of input from the natural world, which suggests that our visual short-term memory may also take advantage of statistical regularities through efficient coding schemes. Previous work on VSTM capacity has typically used patches of color or simple features which are drawn from a uniform distribution, and estimated the capacity of VSTM for simple color patches to be ∼ 4 items (Luck & Vogel, 1997), and even fewer for more complex objects (Alvarez & Cavanagh, 2004). Here, we introduce covariance information between colors, and ask if VSTM can take advantage of the shared statistics to form a more efficient representation of the displays. We presented observers with displays of eight objects, presented in pairs around the fixation point, and then probed a single object in an eight-alternative forced-choice test. The displays were constructed so that each of the eight possible colors appeared in every display, but the color they were next to was not random - each color had a high probability pair (e.g. red appeared with green 80% of time). In information theoretic terms, the displays with statistical regularities have lower entropy compared with uniform displays, and thus require less information to encode. We found that observers could successfully remember 5.5 colors on these displays, significantly higher than the 3.5 colors remembered when the displays were changed to be uniformly distributed in the last block. These results show that capacity estimates, measured in number of objects, actually increased when the displays had some statistical regularities, and that VSTM capacity is not a fixed number of items. We suggest that quantifying capacity in number of objects fails to capture factors such as object complexity or statistical information, and that information theoretic measures are better suited to characterizing VSTM.
Article
After an initial consideration of psychological experimentation, the author describes a long series of experiments in the fields of perception, imagination, and remembering, using material which approximated that found in everyday life. The work on perceiving utilized chiefly geometrical diagrams; and that on imagination, ink-blots. The results in these two cases revealed the influence of the subjects' attitudes and indicated their tendency to introduce previously learned material. In the experiments on remembering two methods were used, one the method of repeated reproduction by a given subject and the other the method of serial reproduction where the material reproduced by one subject became the learning material for a second subject whose recall constituted the learning material for a third subject, etc. This latter series of experiments showed that proper names and titles are very unstable in recall, that there is a bias toward the concrete, that individualizing aspects of the material (stories) tend to be lost, and that abbreviations and rationalizations occur. Throughout the book emphasis is placed on the social determinants of the manner and matter of recall, a point of view which is supported in the anthropological material cited. "Remembering is not the re-excitation of innumerable fixed, lifeless and fragmentary traces. It is an imaginative reconstruction, or construction, built out of the relation of our attitude towards a whole active mass of organized past reactions or experience, and to a little outstanding detail which commonly appears in image or in language form." (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
An apple can be held in your hand if you are eating it, or be perceived in a fruit bowl from a few meters away. While the apple's real-world size is constant, the visual angle it subtends in your visual field varies based on your distance to the apple. Given the range of visual angles that an object can subtend in the visual field, is there a visual angle that is preferred? The current experiments show evidence supporting the idea of a privileged visual size for object representation, termed the normative size. In Experiment 1, observers adjusted the visual size of objects on a monitor, selecting the subjectively 'right' size to see the object. Inter-observer selections were strikingly consistent, and showed a correlation between the size selected on the screen and the actual physical size of the objects in the real world. These size selections were taken as estimates of the normative size for each object. In Experiments 2 and 3, using a size memory task and a change detection task, we found evidence that both long-term and short-term memory for the visual size of objects is biased towards the normative size. Altogether the results support the claims that perception of objects is sensitive to a normative size and that object memory is biased toward this perceptual norm.