ArticlePDF Available

Multidimensional letter similarity: A reply to Mewhort and Dow

Authors:
Perception &Psychophysics
1979, Vol. 26 (6), 501-502
Notes
and Comment
Multidimensional letter similarity:
Areply to Mewhort and
Dow
GROVER C. GILMORE
Case Western Reserve University
Cleveland, Ohio 44106
and
HARRY
HERSH
Digital Equipment Corporation
Maynard, Massachusetts 01754
Gilmore, Hersh, Caramazza, and Griffin
(1979)
reported aconfusion matrix for uppercase letters pre-
sented on the screen of a cathode-ray tube. Mewhort
and Dow (1979) recently have questioned the validity
of this matrix and the indices of letter similarity de-
rived from it. They apparently raised their objection
because of the low correlation between the diagonal
entries of the matrices reported by Gilmore et al.
(1979)
and Townsend (1971), who generated a con-
fusion matrix using a conventional tachistoscope.
There are several reasons why this low correlation
could occur. The most obvious explanation, which
the authors mention and dismiss, is the difference in
type fonts used in the studies. Another possible
source of the difference is that the values reported
in one of the studies may be unreliable or confounded
by another variable. Since the confusion matrix
reported by Gilmore et al. (1979) was based on eight
times as many stimulus trials as that reported by
Townsend (1971), one might question the reliability
of Townsend's matrix. However, Mewhort and Dow
(1979)
choose to argue that the method used by
Gilmore et al. (1979) in presenting the stimuli is the
source of the difference and, furthermore, that this
confounding factor leaves the value of the confusion
matrix questionable.
The argument presented by Mewhort and Dow
(1979)
to support this conclusion seems to be a con-
vincing one. The weakness of the presentation, how-
ever, is that it is based on inferences about the
stimulus presentation method and not facts. The
authors infer that a low-intensity level was employed,
since Gilmore et al.
(1979)
were able to achieve a
51010
level of accuracy with a mean stimulus duration
of 33 msec. This first inference is, indeed, correct.
We regret that the stimulus luminance level was not
reported, but we were not able to obtain an instru-
ment which could reliably measure the luminance.
Reporting the luminance level, however, would not
have forestalled the major objections raised by
Mewhort and Dow (1979). First of all, they question
the stability of the equipment for displaying dots at
a low-intensity level. Rather than rely on speculation,
we checked this point with the manufacturer
of
the
equipment, Digitial Equipment Corporation,
and
were assured that there is no reason to suspect a
stability problem (Rupp, Note 1). The second objec-
tion raised by Mewhort and Dow
(1979)
concerns the
dot-brightening algorithm used in generating the
letters. They contend that it may have been very slow
and that letters containing few dots may have been
brighter than those composed of more dots. The
algorithm employed has been described by Schwartz
(1978) and is not considered slow, since a dot is
refreshed approximately every 25
usee.
Furthermore,
while the displayed dots were continuously refreshed,
possible brightness differences among letters com-
posed of an unequal number of dots were controlled
for by presenting an appropriate number
of
extra
dots at a non visible portion
of
the screen. That is
to say, the same number
of
dots were generated
and brightened regardless of the letter displayed.
This should have been an effective brightness control.
Indeed, when our subjects were questioned, none of
them reported noting any brightness differences
among the letters. The subjects did note that some
letters, such as I and T, were easier to identify
than others, but they attributed this to the fact that
there were few letters shaped as these letters.
Since the brightness
of
the letters was controlled,
how can one then account for the high correlation
(r = - .873) between the number of dots in a letter
and its identification accuracy? One reasonable inter-
pretation is that the letters with few dots are simple
figures composed
of
relatively few features. In a
limited-capacity system, simple letters would have an
advantage over more complex ones at an encoding
level. Another interpretation, suggested by our sub-
jects, is that the simpler letters are relatively unique
in their shape, thereby placing lessprocessingdemands
on the observer. Based on an inspection of the letters
used by Gilmore et al. (1979, Figure 1), one can see
that the set of letters potentially confusable with the
letter L is much smaller than the set potentially con-
fusable with the letter M.
This conclusion is supported by a recent study by
Podgorny and Garner (1979). In an investigation
of
inter- and intraletter similarity, they have reported
that in a "same-different" task there are significant
differences in
"same"
reaction times among letters.
For instance, it takes much longer for a subject to
respond
"same"
to a pair
of
Ms than to a pair
Copyright 1979 Psychonomic Society, Inc. 501 0031-5117/79/120501-02$00.45/0
502
GILMORE
AND
HERSH
of
Ls. Podgorny and Garner (1979) also have inter-
preted these differences as due to either more ef-
ficient encoding
of
certain letters, such as L, or a low
level
of
confusability
of
these letters with the other
letters
of
the alphabet.
The major point raised by Mewhort and Dow
(1979) was that the confusion matrix reported by
Gilmore et al. (1979) lacks validity because
of
a con-
founding
of
letter structure with brightness. Yet,
as we have stated, the procedures and equipment
used in generating the
letters
controlled for potentially
confounding brightness differences among the letters.
We must conclude therefore that the differences in
level
of
identification accuracy among the letters
reliably reflects true processing differences and that
the confusion matrix can be used by investigators
as a reliable and valid instrument.
REFERENCE
NOTE
I.
Rupp, C. Personal communication, August 1979.
REFERENCES
GILMORE,
G.
C.,
HERSH,
H.,
CARAMAZZA,
A., &
GRIFFIN,
J.
Multidimensional letter similarity derived from recognition
errors. Perception &Psychophysics, 1979,25,425-431.
MEWHORT,
D. J. K., &Dow,
M.
L. Multidimensional letter simi-
larity: A
confound
with brightness? Perception &Psycho-
physics, 1979, 26, 325-326.
PODGORNY,
P.,
&
GARNER,
W. R. Reaction time as a measure
of inter- and intraobject similarity: Letters of the alphabet.
Perception &Psychophysics, 1979,26,37-52.
SCHWARTZ,
S.
P.
Tachistoscope simulation package. Behavior
Research Methods &Instrumentation, 1978, 10,773-778.
TOWNSEND,
J. T. Theoretical analysis of an alphabetic confusion
matrix. Perception &Psychophysics, 1971, 2,45-53.
(Received and accepted for publication October 15, 1979.)
Article
Full-text available
Reviews studies in which a single letter was visually presented under adverse conditions and the S's task was to identify the letter. The typical results for such studies were that (a) certain pairs of letters were more often confused than other pairs of letters; (b) certain letters were more easily recognized than others; and (c) confusion errors for a letter pair were often asymmetric, the number of errors differing depending on which letter of the pair was presented as the stimulus. A geometric model incorporating the properties of distance and spatial density is presented to account for these results. The present application of the distance–density model assumes that each letter is constructed in a typical 5 × 7 dot matrix. According to the model, (a) pairs of letters that are in close geometric space are more often confused than pairs of letters that are distant; (b) letters that are in less spatially dense regions are more easily recognized than letters that are in more spatially dense regions; and (c) asymmetric confusion errors result when one member of a letter pair is in a denser region than the other member of the letter pair. (59 ref)
Article
Full-text available
This study examined the ability of an asymmetric multidimensional scaling program (DEDICOM) to reveal information about letter-perception processes. To demonstrate its potential, we applied it to the controversy concerning local-to-global versus global-to-local letter perception. These two theories lead to different predictions about stimulus confusion asymmetries. Since DEDICOM is capable of recovering the structure of asymmetric or directional patterns, it should reveal whether a stimulus-response confusion matrix contains patterns of asymmetry more consistent with one or the other perceptual theory. This was tested using two data sets. The first (from Lupker, 1979) revealed an additive hierarchy of asymmetry strongly consistent with global-tolocal processing, although unexpected additional structure and reliable anomalies indicated the need for a more refined theoretical account. The second (a full alphabetic confusion matrix combining data from Gilmore et al., 1979; Loomis, 1982; and Towasend, 1971) revealed five distinct patterns, each consisting of transformations attributable to the failure to detect specific local letter features. This solution strengthened support for local-to-global processing, in sharp contrast to the first analysis. Possible reasons for this divergence are discussed, including differences in the stimuli, exposure durations, and a hypothetical two-stage process of perception. Despite their differences, both solutions demonstrated how asymmetric scaling can reveal structure in asymmetries, which is relevant to perceptual theory and which would have been difficult to recover by other means.
Article
Full-text available
A study was undertaken to acquire a confusion matrix of the entire upper-case English alphabet with a simple nonserifed font under tachistoscopic conditions. This was accomplished with two experimental conditions, one with blank poststimulus field and one with noisy poststimulus field, for six Ss run 650 trials each. Three mathematical models of recognition, two based on the concept of a finite number of sensory states and one being the choice model, were compared in their ability to predict the confusion matrix after their parameters were estimated from functions of the data. In order to ascertain the facility with which estimates of similarity among the letters could lead to a psychological space containing the letters, ηij, the similarity parameter of the choice model was input to an ordinally based multidimensional scaling program. Finally, correlation coefficients were computed among parameters of the models, the scaled space, and a crude measure of physical similarity. Briefly, the results were: (1) the finite-state model that assumed stimulus similarity (the overlap activation model) and the choice model predicted the confusion-matrix entries about equally well in terms of a sum-of-squared deviations criterion and better than the all-or-none activation model, which assumed only a perfect perception or random-guessing state following a stimulus presentation; (2) the parts of the confusion matrix that fit best varied with the particular model, and this finding was related to the models; (3) the best scaling result in terms of a goodness-of-fit measure was obtained with the blank poststimulus field condition, with a technique allowing different distances for tied similarity values, and with the Euclidean as opposed to the city-block metric; and (4) there was agreement among the models in terms of the way in which the models reflected sensory and response bias structure in the data, and in the way in which a single model measured these attributes across experimental conditions, as well as agreement among similarity ami distance measures with physical Similarity.
Article
Full-text available
In order to provide a reliable measure of the similarity of uppercase English letters, a confusion matrix based on 1,200 presentations of each letter was established. To facilitate an analysis of the perceived structural characteristics, the confusion matrix was decomposed according to Luce’s choice model into a symmetrical similarity matrix and a response bias vector. The underlying structure of the similarity matrix was assessed with both a hierarchical clustering and a multidimensional scaling procedure. This data is offered to investigators of visual information processing as a valuable tool for controlling not only the overall similarity of the letters in a study, but also their similarity on individual feature dimensions.
Article
Interobject visual similarity is often measured by the time that subjects require to say that two objects are not identical, with long RTs indicating high similarity. In Experiment 1, using a complete set of uppercase alphabetic stimuli, we show: (1) RTs correlate strongly with direct ratings of visual similarity. (2) Stimuli that are similar as measured by RT or by direct judgment have the same profile of similarity to all other stimuli in the set as measured by correlational analysis. (3) The order in which the stimuli are compared has only a small effect on measured similarity. All of these results indicate desirable properties of any measure of similarity.Intraobject similarity is a concept pertinent to the relation of an object to itself. In Experiments 1 and 2 we show: (1) All letters are not equally similar to themselves, since RT for the “same” response is different for different letters. (2) The relation between RT and intraobject similarity is opposite to that for interobject similarity, with short RTs indicating high intraobject similarity. (3) Even though intraobject similarities differ, each letter is more similar to itself than to any other letter in the set. We discuss the implications of these results in terms of assumed proximity constraints underlying similarity data.
Article
A system of FORTRAN-callable subroutines is described that enable a PDP-11/20 computer to function as a multichannel tachistoscope. Features of the system include multiple display buffering, dynamic buffer allocation, control of character size and density, provisions for user-generated characters and symbols, and routines for recording of both vocal and buttonpress responses.