ArticlePDF Available

Psychophysical and Physiological Evidence for a Precedence Effect in the Median Sagittal Plane

Authors:

Abstract and Figures

A listener in a room is exposed to multiple versions of any acoustical event, coming from many different directions in space. The precedence effect is thought to discount the reflected sounds in the computation of location, so that a listener perceives the source near its true location. According to most auditory theories, the precedence effect is mediated by binaural differences. This report presents evidence that the precedence effect operates in the median sagittal plane, where binaural differences are virtually absent and where spectral cues provide information regarding the location of sounds. Parallel studies were conducted in psychophysics by measuring human listeners' performance, and in neurophysiology by measuring responses of single neurons in the inferior colliculus of cats. In both experiments the precedence effect was found to operate similarly in the azimuthal and sagittal planes. It is concluded that precedence is mediated by binaurally based and spectrally based localization cues in the azimuthal and sagittal planes, respectively. Thus, models that attribute the precedence effect entirely to processes that involve binaural differences are no longer viable.
Content may be subject to copyright.
77:2223-2226, 1997. J Neurophysiol
Ruth Y. Litovsky, Brad Rakerd, Tom C. T. Yin and William M. Hartmann
You might find this additional information useful...
18 articles, 3 of which you can access free at: This article cites http://jn.physiology.org/cgi/content/full/77/4/2223#BIBL
4 other HighWire hosted articles: This article has been cited by
[PDF] [Full Text] [Abstract] , September 1, 1998; 80 (3): 1285-1301. J Neurophysiol
R. Y. Litovsky and T. C. T. Yin I. Correlates of Psychophysics
Physiological Studies of the Precedence Effect in the Inferior Colliculus of the Cat.
[PDF] [Full Text] [Abstract] , September 1, 2001; 86 (3): 1333-1350. J Neurophysiol
B. J. Mickey and J. C. Middlebrooks
Localization
Responses of Auditory Cortical Neurons to Pairs of Sounds: Correlates of Fusion and
[PDF] [Full Text] [Abstract] , October 1, 2003; 90 (4): 2149-2162. J Neurophysiol
D. J. Tollin and T. C.T. Yin Psychophysical Investigation of an Auditory Spatial Illusion in Cats: The Precedence Effect
[PDF] [Abstract] , November 1, 2008; 70 (8): 1558-1570. Atten Percept Psychophys
L. D. SANDERS, A. S. JOH, R. E. KEEN and R. L. FREYMAN One sound or two? Object-related negativity indexes echo perception
on the following topics: http://highwire.stanford.edu/lists/artbytopic.dtlcan be found at Medline items on this article's topics
Physiology .. Humans
Physiology .. Cats
Veterinary Science .. Inferior Colliculus
including high-resolution figures, can be found at: Updated information and services http://jn.physiology.org/cgi/content/full/77/4/2223
can be found at: Journal of Neurophysiologyabout Additional material and information http://www.the-aps.org/publications/jn
This information is current as of January 28, 2009 .
http://www.the-aps.org/.Physiological Society. ISSN: 0022-3077, ESSN: 1522-1598. Visit our website at
by the American Physiological Society, 9650 Rockville Pike, Bethesda MD 20814-3991. Copyright © 2005 by the American
publishes original articles on the function of the nervous system. It is published 12 times a year (monthly)Journal of Neurophysiology
on January 28, 2009 jn.physiology.orgDownloaded from
RAPID COMMUNICATION
Psychophysical and Physiological Evidence for a Precedence Effect in
the Median Sagittal Plane
RUTH Y. LITOVSKY,
1
BRAD RAKERD,
2
TOM C. T. YIN,
1
AND WILLIAM M. HARTMANN
3
1
Department of Neurophysiology, University of Wisconsin-Madison, Madison, Wisconsin 53706; and
2
Department of
Audiology and Speech Sciences and
3
Department of Physics and Astronomy, Michigan State University, East Lansing,
Michigan 48824
Litovsky, Ruth Y., Brad Rakerd, Tom C. T. Yin, and William
is symmetrical with respect to the two ears. Although micro-
M. Hartmann. Psychophysical and physiological evidence for a
phone measurements normally find some binaural differ-
precedence effect in the median sagittal plane. J. Neurophysiol.
ences for sources in the sagittal plane (Searle et al. 1975;
77: 2223–2226, 1997. A listener in a room is exposed to multiple
Wightman and Kistler 1989), the preponderance of evidence
versions of any acoustical event, coming from many different direc-
shows that these differences are not reliable enough to serve
tions in space. The precedence effect is thought to discount the
as localization cues (Asano et al. 1990; Hebrank and Wright
reflected sounds in the computation of location, so that a listener
1974; Middlebrooks and Green 1991). Instead, sources in
perceives the source near its true location. According to most audi-
the median sagittal plane are localized on the basis of spectral
tory theories, the precedence effect is mediated by binaural differ-
shape cues, peaks and valleys introduced by direction-depen-
ences. This report presents evidence that the precedence effect
operates in the median sagittal plane, where binaural differences
dent filtering performed by the external ears, head, and torso
are virtually absent and where spectral cues provide information
(Blauert 1983; Gardner and Gardner 1973; Roffler and But-
regarding the location of sounds. Parallel studies were conducted
ler 1968).
in psychophysics by measuring human listeners’ performance, and
The present study is a search for a precedence effect in the
in neurophysiology by measuring responses of single neurons in
median sagittal plane, a precedence effect that is mediated by
the inferior colliculus of cats. In both experiments the precedence
spectral shape cues. Early evidence that such an effect likely
effect was found to operate similarly in the azimuthal and sagittal
exists was found in a front-back competition experiment by
planes. It is concluded that precedence is mediated by binaurally
Blauert (1971). The present search was conducted on two
based and spectrally based localization cues in the azimuthal and
levels: one psychophysical in humans, the other physiologi-
sagittal planes, respectively. Thus,models that attribute the prece-
cal in cats.
dence effect entirely to processes that involve binaural differences
are no longer viable.
METHODS
Psychophysics
INTRODUCTION
The psychophysical experiments were performed in a 32-m
3
A room plays havoc with sound. The waves emitted by
anechoic room with the use of loudspeakers to simulate direct
a source are reflected and rereflected many times by the
sounds and reflections. There were five matched speakers: directly
room surfaces. Therefore a listener in a room is exposed to
in front at 0
7
, behind, 90
7
to the left, 90
7
to the right, and overhead.
multiple versions of any acoustical event, coming from many
On each experimental trial there were eight pairs of leading and
lagging clicks (each 0.025 ms in duration) repeated every 110 ms.
different directions in space. The auditory system can cope
Such a click train allows the precedence effect to build to a maxi-
with this sonic clutter because of the precedence effect, a
mum (Freyman et al. 1991). Trials were presented in blocks con-
remarkable neural process that fuses the direct sound and
sisting of trials in the azimuthal plane (left, front, and right sources )
its reflections into a single image (Haas 1951; Wallach et
or the sagittal plane (front, overhead, and behind sources). All
al. 1949). The precedence effect also discounts the reflected
permutations of leading source location, lagging source location,
sound in the computation of location so that a listener per-
and eight values of interclick delay (ICD ) ranging from 0 to 10
ceives the source near its true location. The standard theoreti-
ms were tested. The experiments measured localization in a compe-
cal model for this effect is an extension of the binaural model
tition experiment, so that after the stimulus was presented the sub-
for localization, a neural coincidence detector that operates
ject had to decide which of the three loudspeakers in the plane
on the difference in arrival time of signals at left and right
was closest to the location of the sound image. Eight subjects each
completed a total of 10 blocks for each plane, with blocks for the
ears (Jeffress 1948). The extension postulates an inhibitory
different planes randomly interspersed.
response generated by the leading sound (Franssen 1963;
Lindemann 1986). Physiological evidence for the coinci-
dence detector has been found in the medial superior olive Physiology
(Goldberg and Brown 1969), and psychophysical evidence
Physiological experiments paralleled the psychophysical ones by
for the extension to precedence has been found in headphone
using click sources in a free-field, anechoic chamber and comparing
experiments (Zurek 1980).
azimuthal and sagittal plane responses. Extracellular recordings
The binaural difference model successfully describes
were made in 38 neurons (characteristic frequencies ranging from
many aspects of localization, but it does not account for the
500 to 24,000 Hz) in the central nucleus of the inferior colliculus
(ICC) of barbiturate-anesthetized cats. The animal’s head was in
localization of sources in the median sagittal plane, which
22230022-3077/97 $5.00 Copyright
q
1997 The American Physiological Society
J561-6RC/ 9k0f$$ap24 08-27-97 15:17:05 neupal LP-Neurophys
on January 28, 2009 jn.physiology.orgDownloaded from
R. Y. LITOVSKY, B. RAKERD, T.C.T. YIN, AND W. M. HARTMANN2224
the center of semicircular arrays (radius of 1.2 m) of loudspeakers
Physiology
positioned along the azimuthal and sagittal planes at 15
7
intervals.
Positive angles refer to sounds in the contralateral hemifield in
In the physiological experiments for each neuron, we first
azimuth and above the interaural line in elevation. The precedence
measured the receptive field properties with the use of single
effect was probed by delivering clicks from two different speakers
clicks (0.1 ms in duration) at a level
Ç
10–15 dB above
with varying ICDs, with 50 trials for every condition. We measured
threshold (Fig. 2A). To compare the degree of suppression
precedence by the degree of suppression of the response to the
for stimuli on the azimuthal and sagittal planes, we always
lagging click as a function of the presence of a leading click at
chose the speaker directly in front, which lies at the intersec-
different ICDs.
tion of the two planes, to be the lagging source; and to
control for possible influences of response rate, we chose
RESULTS
the locations of the two leading sources such that the re-
sponses to a single click were approximately equal. In Fig.
Psychophysics 2 the leading sources were placed at
/
90
7
azimuth and
/
75
7
elevation. The dot rasters in Fig. 2Bshow that, for leadingFigure 1 shows the percentage of responses that matched
the leading click location at each ICD. If the precedence clicks in the azimuthal plane, at long ICDs (
ú
40 ms), there
is a response at a latency of
Ç
16 ms to the leading clickeffect is operating, this percentage will be high. Open sym-
bols show the results in the azimuthal plane, where binaural and a later response at about the same latency following the
lagging click. The response to the lagging click graduallydifference cues are present. These results are in good agree-
ment with previous studies of precedence (Yost and Soder- diminishes as the ICD is shortened and disappears for ICDs
õ
31 ms. Figure 2Cshows nearly identical behavior whenquist 1984; Zurek 1980) . When ICDs were
õ
1 ms, the
precedence effectwas incomplete;leading andlagging clicks the leading click was in the sagittal plane. Figure 2Dshows
little difference in the suppression of the response to theboth affected the perceived location to some degree, an effect
that has been called summing localization (Warncke 1941). lagging click in the two different planes when plotted as
normalized recovery curves.At ICDs of 1.0 and 2.0 ms, precedence with binaural differ-
ences was maximal, although still not entirely complete; To quantify the extent of suppression, from the recovery
curves we measured the ICD at which the lagging responsesubjects chose the leading click location on as many as 95%
of all trials. As the ICD increased to 5.0 ms, the lagging was suppressed by 50%. The similarity in the values of half-
maximal suppression shown in Fig. 2Dis typical of thatclick began to be audible, and it was chosen as the location
of the sound source on some trials, indicating that the prece- observed in most cells in thepopulation, although the overall
shapes of the recovery curves in Fig. 2Dare more similardence effect became weaker. The filled symbols in Fig. 1
show the results for the sagittal plane experiment. They show than usually seen. Figure 3 shows a scatter plot of the ICDs
for half-maximal suppression along the azimuthal and sagit-that a spectrally mediated precedence effect exists, and that
its dependence on ICD is comparable with that of the binau- tal planes for 38 cells. There are two striking features of the
data shown in Fig. 3. First, there is considerable variabilityral differenceprecedence effect. The only appreciable differ-
ence is that the spectrally mediated effect was somewhat in the degree of suppression between different cells in the
ICC, ranging from
Ç
0to
ú
100 ms. Such variability hasweaker at its maximum.
FIG
. 1. Mean responses for 8 subjects. For both source
planes, the plot shows the percentage of trials in which the
sound image appeared at the position of the leading source.
High percentages indicate a strong precedence effect. Similar
functions of the interclick delay (ICD) occur for both planes,
although the precedence effect is stronger when binaural dif-
ferences are present (open symbols).
J561-6RC/ 9k0f$$ap24 08-27-97 15:17:05 neupal LP-Neurophys
on January 28, 2009 jn.physiology.orgDownloaded from
PRECEDENCE IN THE SAGITTAL PLANE 2225
FIG
. 2. Physiological evidence for precedence in the azimuthal and sagittal planes for 1 neuron. A: azimuthal and sagittal
rate functions. At each location, 50 repetitions of clicks were presented with a period of 300 ms. The number of spikes per
stimulus is plotted against azimuthal and sagittal locations. Arrows: leading source location for Band C. B and C: dot rasters
showing the responses of the same neuron at ICDs ranging from 1 ms ( bottom) to 101 ms (top ) along the azimuthal (left)
and sagittal (right) planes. D: recovery functions for the lagging responses shown in Band C, normalized by the response
to the same stimulus in absence of the leading stimulus.
previously beenseen in theICC under other conditions (Car- ally. Although the dominance of the leading source in local-
ney and Yin 1989; Yin 1994). Second, for each cell there ization is maximal
Ç
2 ms, suppression of the lag as an
is a strong correlation (r
Å
0.8) between its suppressive independent auditory event can extend to 50 ms, as in echo
effect in the two planes. Thus for any given cell in the ICC suppression in concert halls (Kuttruff 1979). Localization
the degrees of suppression of the lagging click to a leading dominance of the lead was the only behavioral measure
click in the sagittal and azimuthal planes are comparable. probed by the psychophysical experiments above.
In summary, the experiments reported here show that the
precedence effect operates similarly in the azimuthal and
DISCUSSION
sagittal planes. Because the degree of suppression varies
The variability in suppression delays mirrors the range of with changes in relative location of the leading and lagging
sources in both planes (Litovsky and Yin 1994), precedencedelays at which the precedence effect is observed behavior-
J561-6RC/ 9k0f$$ap24 08-27-97 15:17:05 neupal LP-Neurophys
on January 28, 2009 jn.physiology.orgDownloaded from
R. Y. LITOVSKY, B. RAKERD, T.C.T. YIN, AND W. M. HARTMANN2226
FIG
. 3. Correlation of 50% maximal response along the
azimuthal vs. sagittal planes. Each asterisk represents the
values for 1 neuron.
superior olivary complex to dichotic tonal stimuli: some physiological
is mediated by binaurally based and spectrally based local-
mechanisms of sound localization. J. Neurophysiol. 32: 613 636, 1969.
ization cues, in the azimuthal and sagittal planes, respec-
F
RANSSEN
,N.V.Stereophony. Eindhoven, The Netherlands: Philips Techni-
tively. Therefore models that attribute the precedence effect
cal Library, 1963. (English translation 1964.)
entirely to processes that involve binaural differences are no
F
REYMAN
, R. L., C
LIFTON
, R. K.,
AND
L
ITOVSKY
, R. Y. Dynamic processes
longer viable. However, precedence is not identical in all
in the precedence effect. J. Acoust. Soc. Am. 90: 874–884, 1991.
planes; the psychophysical localization experiments show
H
AAS
, H. On the influence of a single echo on the intelligibility of speech.
that it is stronger in the azimuthal plane. This difference
Acustica 1: 49–58, 1951.
might be related to the fact that neurons in the ICC show
H
EBRANK
,J.H.
AND
W
RIGHT
, D. Spectral cues used in the localization of
stronger firing rate variation with source position in the azi-
sound sources on the median plane. J. Acoust. Soc. Am. 56: 1829 1834,
1974.
muthal plane compared with the sagittal plane. Despite the
J
EFFRESS
, L. A. A place theory of sound localization. J. Comp. Physiol.
quantitative differences, precedence appears to operate simi-
Psychol. 61: 468–486, 1948.
larly whether the localization cues are binaural or spectral.
K
UTTRUFF
,H.Room Acoustics (2nd ed.) . London: Applied Science Publish-
ers, 1979.
L
INDEMANN
, W. Extension of a binaural cross-correlation model by contra-
This research was supported by National Institute of Deafness and Other lateral inhibition. I. Simulation of lateralization for stationary signals. J.
Communications Disorders Grant DC-00181 to Michigan State University Acoust. Soc. Am. 80: 1608–1622, 1986.
and Grant DC-00116 to the University of Wisconsin. L
ITOVSKY
,R.Y.
AND
Y
IN
, T.C.T. Physiological correlates of the precedence
Present addresses of R. Litovsky: Dept. of Biomedical Engineering, Bos- effect: free-field recordings in the inferior colliculus of the cat (Abstract).
ton University, 44 Cummington St., Boston, 02215; and Eaton Peabody Assoc. Res. Otolaryngol. 85, 1994.
Laboratory, Massachusetts Eye and Ear Infirmary, Boston, MA 02215. M
IDDLEBROOKS
,J.C.
AND
G
REEN
, D. M. Sound localization by human
Address for reprint requests: R. Litovsky, Dept. of Biomedical Engi- listeners. Annu. Rev. Psychol. 42: 135–159, 1991.
neering, Boston University, 44 Cummington St., Boston, MA 02215. R
OFFLER
,S.K.
AND
B
UTLER
, R. A. Factors that influence the localization
of sounds in the equatorial plane. J. Acoust. Soc. Am. 43: 1255 1259,
Received 16 July 1996; accepted in final form 5 December 1996. 1968.
S
EARLE
, C. L., B
RAIDA
, L. D., C
UDDY
, D. R.,
AND
D
AVIS
, M. F. Binaural
pinna disparity: another auditory localization cue. J. Acoust. Soc. Am.
REFERENCES 57: 448–455, 1975.
W
ALLACH
, H., N
EWMAN
, E. B.,
AND
R
OSENZWEIG
, M. R. The precedence
A
SANO
, F., S
UZUKI
, Y.,
AND
S
ONE
, T. Role of spectral cues in median plane effect in sound localization. Am. J. Psychol. 57: 315–336, 1949.
localization. J. Acoust. Soc. Am. 88: 159–168, 1990. W
ARNCKE
, H. The fundamentals of room-related stereophonic reproduction
B
LAUERT
, J. Localization and the law of the first wavefront in the median in sound films. Akust. Z. 6: 174–188, 1941.
plane. J. Acoust. Soc. Am. 50: 466–470, 1971. W
IGHTMAN
,F.L.
AND
K
ISTLER
, D. J. Headphone simulation of free-field
B
LAUERT
,J.Spatial Hearing. Cambridge, MA: MIT Press, 1983. listening. II. Psychophysical validation. J. Acoust. Soc. Am. 85: 868
C
ARNEY
,L.H.
AND
Y
IN
, T.C.T. Responses of low-frequency cells in the 878, 1989.
inferior colliculus to interaural time differences of clicks: excitatory and Y
IN
, T.C.T. Physiological correlates of the precedence effect and summing
inhibitory components. J. Neurophysiol. 62: 144–161, 1989. localization. J. Neurosci. 14: 5170–5186, 1994.
G
ARDNER
,M.B.
AND
G
ARDNER
, R. S. Problem of localization in the medial Y
OST
,W.A.
AND
S
ODERQUIST
, D. R. The precedence effect: revisited. J.
plane: effect of pinnae cavity occlusion. J. Acoust. Soc. Am. 53: 400 Acoust. Soc. Am. 76: 1377 1383, 1984.
408, 1973. Z
UREK
, P. M. The precedence effect and its possible role in the avoidance
of interaural ambiguities. J. Acoust. Soc. Am. 67: 952–964, 1980.G
OLDBERG
,J.M.
AND
B
ROWN
, P. B. Responses of binaural neurons of dog
J561-6RC/ 9k0f$$ap24 08-27-97 15:17:05 neupal LP-Neurophys
on January 28, 2009 jn.physiology.orgDownloaded from
... In all such applications, a self-adapting model is required which can blindly extract the acoustic parameters of the room without any prior information and adjust its system parameters accordingly. Unfortunately, few datasets are available representing the neurophysiological mechanisms that are responsible for the automatic adaptation of the dynamic component of the precedence effect according to the acoustic conditions, although the authors in 25 agree that this effect is partially achieved via feedback from the higher auditory systems to peripheral auditory systems through the centrifugal pathways. We will therefore utilize this feedback concept in our proposed model to automatically adjust the parameters according to the dynamic acoustic conditions. ...
... A feedback signal is sent to the gammatone filter bank to adjust its number of channels (N) according to the detected conditions mimicking the process of. 25 It is emphasized again, that this automation process is only applicable for five rooms with two active sources at separation angles in the range {15°:15°:90°}. ...
... In all such applications, a self-adapting model is required which can blindly extract the acoustic parameters of the room without any prior information and adjust its system parameters accordingly. Unfortunately, few datasets are available representing the neurophysiological mechanisms that are responsible for the automatic adaptation of the dynamic component of the precedence effect according to the acoustic conditions, although the authors in [25] agree that this effect is partially achieved via feedback from the higher auditory systems to peripheral auditory systems through the centrifugal pathways. We will therefore utilize this feedback concept in our proposed model to automatically adjust the parameters according to the dynamic acoustic conditions. ...
... The system carries out this classification process at the output of our proposed model after every 2.5 seconds (this duration depends on the requirement that how fast the system should adjust to the new conditions) by choosing the classifier trained at the current value of N and adapt itself according to the current acoustic conditions. A feedback signal is sent to the gammatone filter bank to adjust its number of channels (N) according to the detected conditions mimicking the process of [25]. It is emphasized again, that this automation process is only applicable for five rooms with two active sources at separation angles in the range {15 0 :15 0 :90 0 }. ...
Article
Full-text available
Source separation algorithms based on spatial cues generally face two major problems. The first one is their general performance degradation in reverberant environments and the second is their inability to differentiate closely located sources due to similarity of their spatial cues. The latter problem gets amplified in highly reverberant environments as reverberations have a distorting effect on spatial cues. In this paper, we have proposed a separation algorithm, in which inside an enclosure, the distortions due to reverberations in a spatial cue based source separation algorithm namely model-based expectation-maximization source separation and localization (MESSL) are minimized by using the Precedence effect. The Precedence effect acts as a gatekeeper which restricts the reverberations entering the separation system resulting in its improved separation performance. And this effect is automatically transformed into the Clifton effect to deal with the dynamic acoustic conditions. Our proposed algorithm has shown improved performance over MESSL in all kinds of reverberant conditions including closely located sources. On average, 22.55% improvement in SDR (signal to distortion ratio) and 15% in PESQ (perceptual evaluation of speech quality) is observed by using the Clifton effect to tackle dynamic reverberant conditions.
... However, in the median-sagittal plane the percept appears to be much weaker and results have been some-what contradictory. Early studies found evidence of a median-sagittal "vertical" precedence [6] [7] while later studies investigating so-called "3-D" and "immersive" audio have noted a general lack of the effect [8] [9]. ...
... The precedence effect has been characterized as "binaurally-mediated" across transverse planes and "spectrally-mediated" in the median-sagittal plane [6]. Previous works have established precedencebased outcomes to be robust across the horizontalazimuthal frontal planes where clear binaural cues are present. ...
Conference Paper
Full-text available
This investigation was a continuation of AES143 #9832 and AES147 #10066 where reliable auditory precedence in the elevated, ear-level, and lowered horizontal planes was examined. This experiment altered and eliminated spectral influences that govern detection of elevation and presented two different horizontal and vertical ICTDs during a precedence-suppression task. A robust precedence effect was elicited via ear-level horizontal plane loudspeakers. In contrast, leading signal identification was minimal in the vertical condition and no systematic influence of the leading elevated and lowered median plane loudspeakers was witnessed suggesting that precedence was not active in the vertical condition. Observed influences that might have been generated by the lead-lag signal in the vertical plane was not consistent with known precedence paradigms.
... 24 Siehe z. B. Litovsky et al. (1997) und Tregonning und Martin (2015). 25 Siehe z. ...
Chapter
Ansätze zur Schallfeldsynthese zielen darauf ab, ein gewünschtes Schallfeld zu erzeugen. Konventionell sind diese Ansätze physikalisch motiviert. Akustische Eigenschaften definieren das gewünschte Schallfeld. Auf die technische Umsetzung folgt eine wahrnehmungsbezogene Bewertung des Hörerlebnisses. Die psychoakustische Schallfeldsynthese, wie sie in diesem Kapitel vorgestellt wird, verfolgt ein anderes Paradigma. Hier bestimmen die psychoakustischen Eigenschaften das gewünschte Schallfeld. Dieses Paradigma erlaubt die Implementierung von Hörschwellen und Integrationszeiten des Gehörs in die Herleitung des Schallfeldsynthesekerns. Dies ermöglicht eine unhörbare Reduzierung der zeitlichen, räumlichen und spektralen Auflösung. Das Ergebnis ist ein natürliches, räumliches Hörerlebnis und eine präzise Schallquellenlokalisation für mehrere Hörer bei vergleichsweise geringem Rechenaufwand und unhörbaren Synthesefehlern.
... 24 Siehe z. B. Litovsky et al. (1997) und Tregonning und Martin (2015). 25 Siehe z. ...
... Litovsky et al. investigated whether the PE occurs in the median plane from both psychophysical and physiological perspectives. They concluded that the PE appears to operate similarly in both the horizontal and median planes, but it should be noted that the physiological experiments dealt with cats [7]. In their psychophysical experiment, auditory stimuli were presented through loudspeakers at azimuthal angles of À90 (left), 0 (front), and +90 (right) in the horizontal plane, or loudspeakers at elevation angles of 0 (front), 90 (top), and 180 (rear) in the median plane. ...
Article
Full-text available
The precedence effect (PE) of a direct sound (leading sound) and reflections (lagging sounds) and related phenomena in the median plane have been studied by some researchers. However, it is still not clear how the lagging sound arriving from the median plane affects the leading sound image, especially for short time delays that are relevant to early reflections in architectural spaces. This study investigates the conditions where the PE and the related phenomena such as the summing localization (SL) and split of the sound images (SSI) are observed in the median plane for short time delays below 10 ms. A psychophysical experiment with a mapping method is used to simultaneously observe the PE, SL, and SSI. The experimental results suggest that, for the time delay ms, a lagging sound from the median plane leads to a perception of single sound image but it has a more prominent influence on the perceived sound direction than that from the horizontal plane. Furthermore, for the range of time delay, the lagging sound from the median plane has a more prominent effect on the sound image direction as the time delay increases.
... In contrast to the horizontal plane, asynchrony effects in the perception of height play a minor role. Although some studies claim the existence of a vertical precedence effect, e.g., [LRYH97], recent findings suggest that fused auditory images, elicit if delayed reflections from the same sagittal plane superimpose with the direct sound, are due to backward masking [EOBW18]. The Precedence Effect affecting Lateralization ...
Thesis
Full-text available
The perception of sound in rooms is influenced by the room acoustics. Depending on geometrical properties and texture of the room, a direct sound is followed by multiple reflections. For standard surrounding audio reproduction systems, the influence of reflections on the perception is well studied. Recent developments allow more particular constellations and compact loudspeaker arrays with highly pronounced variable directivity patterns that excite wall reflections from a single point in the room to spatialize auditory events. However, their prediction in space mostly fails when standard localization models are used. This is because the underlying psychoacoustic principles are different from those known for standard spatialization systems. This doctoral thesis investigates perceptions elicited by the sound field of a directional sound source in a room. Starting from auditory events evoked by a few precisely controlled sound instances examined in the laboratory, the aim of this work is to understand what perceptions are formed by the interaction of direct sound and its reflections. This bottom-up approach allows the development of models of perception building upon the measurements from the different stages of experimental complexity.
... See e.g.Litovsky et al. (1997) andTregonning and Martin (2015). 25 See e.g.Fastl (2010);Blauert (2008);Ziemer (2018). ...
Chapter
Sound field synthesis approaches aim at creating a desired sound field. Conventionally, these approaches are physically-motivated. Acoustical properties define the desired sound field. A perceptual evaluations of the listening experience follows the technical implementation. Psychoacoustic sound field synthesis, as introduced in this chapter, has another paradigm. Here, psychoacoustical properties determine the desired sound field. This allows for implementation of auditory thresholds and integration times in the derivation of the sound field synthesis core. This allows for inaudible reduction of temporal, spatial, and spectral resolution. The result is a natural, spatial listening experience and a precise source localization for multiple listeners with comparably low computational efforts and inaudible synthesis errors.
Thesis
Full-text available
Every city sound has a great impact on its inhabitants' everyday life. Streets, squares, parks, and even buildings and businesses, present a sonic footprint which characterises every city at a certain period. Despite this, in the field of acoustics these sounds are usually analysed as noises to be measured and reduced, and the sonic characterisation of a city is usually limited to the production of a map depicting the levels of the main acoustic pollution agents. This fact may have caused the study of city sounds to be deemed secondary and may also have led to prioritising the resolution of conflicts brought about by undesired noises. This sound map aims at creating a practical tool that collects all the most distinctive soundscapes of Malaga so that they can be listened to by people from all over the world, become part of the city's cultural heritage and be archived and catalogued for their conservation. Besides, its contrast with noise maps, a sound map allows for the characterisation of the territory from a different perspective, in which the identity of the depicted area is defined by all its sounds. All the recordings are being made with binaural microphones, so the sounds produce a more immersive experience when using headphones. These microphones were built for this project, and a series of HRTF measurements was obtained and applied to different audio signals for the realization of a psychoacoustic test, in order to assess the spatiality provided by the system. Another series of audio samples was generated from the MIT’s HRTFs, and both results have been compared.
Chapter
Localization of sounds in the horizontal plane depends on interaural differences established by the sound source location, environment, and anatomy of the listener. Important interaural differences are the interaural level difference (ILD) and the interaural time difference (ITD). All the anatomical effects responsible for the interaural differences in humans are captured, with varying accuracy, by the spherical-head model. The cones of confusion of the spherical-head model approximately apply to real-world human sound localization, and they are best represented by the lateral-polar system of angular coordinates which can be related to the spherical-polar system. The sensitivity of human listeners to interaural differences can be studied directly using headphone presentation or practically using spatially distributed real sound sources. The difference between these two experimental modes corresponds to the perceptual difference between lateralization (heard within the head) and localization (heard as external). Both lateralization and localization experiments indicate that the effective ITD representation is the ITD itself and not the interaural phase difference. According to the duplex model, ITDs are the dominant cues for localization of low-frequency sounds, whereas ILDs are dominant for high. ITDs themselves appear to be encoded according to a place process (Jeffress model) at the high end of the ITD frequency range and by an opponency process at the low end, although this distinction is an area of current research. Models such as the position variable model attempt to account for the combined effects of the ITD and ILD. The central nervous system preferentially weights interaural differences according to arrival time. The precedence effect is an important example of this preferential weighting.
Article
Full-text available
In keeping with our promise earlier in this review, we summarize here the process by which we believe spatial cues are used for localizing a sound source in a free-field listening situation. We believe it entails two parallel processes: 1. The azimuth of the source is determined using differences in interaural time or interaural intensity, whichever is present. Wightman and colleagues (1989) believe the low-frequency temporal information is dominant if both are present. 2. The elevation of the source is determined from spectral shape cues. The received sound spectrum, as modified by the pinna, is in effect compared with a stored set of directional transfer functions. These are actually the spectra of a nearly flat source heard at various elevations. The elevation that corresponds to the best-matching transfer function is selected as the locus of the sound. Pinnae are similar enough between people that certain general rules (e.g. Blauert's boosted bands or Butler's covert peaks) can describe this process. Head motion is probably not a critical part of the localization process, except in cases where time permits a very detailed assessment of location, in which case one tries to localize the source by turning the head toward the putative location. Sound localization is only moderately more precise when the listener points directly toward the source. The process is not analogous to localizing a visual source on the fovea of the retina. Thus, head motion provides only a moderate increase in localization accuracy. Finally, current evidence does not support the view that auditory motion perception is anything more than detection of changes in static location over time.
Article
Full-text available
1. We studied extracellular responses of low-frequency cells in the central nucleus of the inferior colliculus (ICC) to interaural time differences (ITDs) of clicks and compared their responses to ITDs of noise and tones. Most cells that displayed sensitivity to ITDs of clicks responded cyclically as a function of ITD with central peaks and troughs at the same ITDs as in response to noise. The positions of these peaks and troughs also matched those predicted from tonal ITD curves. Thus over the range of physiologically relevant ITDs, the binaural cells in the ICC showed similar sensitivity to ITDs of tones, noise, and clicks. 2. The transient nature of the response to a click allowed association of individual discharges with either the ipsilateral or contralateral stimulus when the binaural stimulus included a large ITD. We studied the influence of the click presented to one side on responses to the click presented to the other side. By examining responses to clicks with large ITDs, ranging from 2 to 3 up to 200 ms, we could identify both excitatory and inhibitory components in response to binaural clicks. 3. For many cells, there was evidence for a short-lasting excitation arising from one or both inputs of the binaural stimulus. Inhibitory interactions could also be demonstrated over a large range of ITDs. Long-lasting, late inhibitory components arose from both contralateral and ipsilateral inputs. In 87% of cells that were driven by the contralateral input, a late inhibitory component originating from the ipsilateral side was detected. In all cells that were driven by the ipsilateral side, a late inhibitory contralateral component was detected. This late inhibition of the excitatory response to one side by a leading stimulus to the other side could be evoked even when the leading stimulus was not effective in evoking an excitatory response. 4. Some cells also exhibited an early inhibitory component that preceded the excitation. An early contralateral inhibition was detected in 44% of cells that were driven by the ipsilateral input, whereas an early ipsilateral component was detected in 17% of cells driven by the contralateral input. 5. We confirmed hypotheses about the laterality and time course of the inhibitory and excitatory components by introducing interaural level differences (ILDs) into the binaural clicks and thus varying the strengths of the different components. 6. Inhibitory components may play a role in shaping the sensitivity of individual cells to ITDs of stimuli other than clicks; they were also apparent in responses to noise.(ABSTRACT TRUNCATED AT 400 WORDS)
Article
Three experiments were conducted to investigate the dependence of echo suppression on the auditory stimulation just prior to a test stimulus. Subjects sat in an anechoic chamber between two loudspeakers, one which presented the ‘‘lead’’ sound, and the other the delayed ‘‘lag’’ sound. In the first experiment, subjects reported whether or not they heard an echo coming from the vicinity of the lag loudspeaker during a test click pair. In seven of nine listeners, perception of the lagging sound was strongly diminished by the presence of a train of ‘‘conditioning’’ clicks presented just before the test click. Echo threshold increased (subjects were less sensitive to echoes) as the number of clicks in the train increased from 3 to 17. For a fixed number of clicks, the effect was essentially independent of click rate (from 1/s through 50/s) and duration of the train (from 0.5 through 8 s). A second experiment demonstrated a similar buildup of echo suppression with white noise bursts, regardless of whether the bursts in the conditioning train were repeated samples of frozen noise, or were independent samples of noise. Using an objective procedure for measuring echo threshold, the third experiment demonstrated that both lead and lag stimuli must be presented during the conditioning train in order to produce the buildup of suppression. When only the lead sound was presented during the conditioning train, the perceptibility of the lag sound during the test burst appeared to be enhanced.
Article
The precedence effect, as investigated by Wallach e t a l. [Am. J. Psychol. 6 2 , 324–336 (1949)] was studied in three experiments. Experiment I was a replication of the original work of Wallach e t a l. Although the first click pair appears to dominate the perception of the position of the lateral image, the effect of the first click pair does not appear to ‘‘offset’’ or ‘‘cancel’’ the effect of the second click pair in terms of producing a lateral image at midline. The data are consistent with Zurek’s [J. Acoust. Soc. Am. 6 7 , 952–964 (1980)] proposal that the binaural system is less sensitive to the interaural temporal difference of the second click pair. Experiment II indicated that the effect of the first click pair on lateral judgments still dominates that of the second click pair when the images are judged to be off midline. In all of these studies, the variability of the data is quite high. Experiment III showed that the first click pair also led to a larger change in masked thresholds (masking‐level differences, MLDs) than does the second click pair. These data reconfirm the use of two‐click stimuli for demonstrations of the precedence effect and they describe some of the limitations of the procedure and the generalities of the effect.
Article
Physical measurements of the transfer function from a free−field sound source to a microphone in the subject’s ear canal indicate that there are two independent localization cues generated by the pinna. For sound sources in the vertical median plane, there is a systematic change in the frequency response as a function of elevation angle, and a disparity between the left−ear and right−ear responses which also changes systematically with elevation angle. Independent psychophysical measurements indicate that these pinna cues are detectable by subjects, and both are used by subjects in vertical localization tasks. Subject Classification: 65.62, 65.75.
Article
The role of spectral cues in the sound source to ear transfer function in median plane sound localization is investigated in this paper. At first, transfer functions were measured and analyzed. Then, these transfer functions were used in experiments where sounds from a source on the median plane were simulated and presented to subjects through headphones. In these simulation experiments, the transfer functions were smoothed by ARMA models with different degrees of simplification to investigate the role of microscopic and macroscopic patterns in the transfer functions for median plane localization. The results of the study are summarized as follows: (1) For front-rear judgment, information derived from microscopic peaks and dips in the low-frequency region (below 2 kHz) and the macroscopic patterns in the high-frequency region seems to be utilized; (2) for judgment of elevation angle, major cues exist in the high-frequency region above 5 kHz. The information in macroscopic patterns is utilized instead of that in small peaks and dips.
Article
Listeners reported the apparent spatial positions of wideband noise bursts that were presented either by loudspeakers in free field or by headphones. The headphone stimuli were digitally processed with the aim of duplicating, at a listener's eardrums, the waveforms that were produced by the free-field stimuli. The processing algorithms were based on each subject's free-field-to-eardrum transfer functions that had been measured at 144 free-field source locations. The headphone stimuli were localized by eight subjects in virtually the same positions as the corresponding free-field stimuli. However, with headphone stimuli, there were more front-back confusions, and source elevation seemed slightly less well defined. One subject's difficulty with elevation judgments, which was observed both with free-field and with headphone stimuli, was traced to distorted features of the free-field-to-eardrum transfer function.
Article
Running interaural cross correlation is a basic assumption to model the performance of the binaural auditory system. Although this concept is particularly suited to simulate psychoacoustic localization phenomena, there exist some localization effects which cannot be explained by pure cross correlation. In this paper a model of interaural cross correlation is extended by a "contralateral-inhibition mechanism" and by "monaural detectors" in order to simulate a wide range of psychoacoustic lateralization data. The extended model explains lateralization of pure tones with interaural time differences as well as with interaural level differences. Multiple images are predicted for tones with characteristic combinations of interaural signal parameters and for noise signals with different degrees of interaural cross correlation. The model is also capable of simulating dynamic lateralization phenomena, such as the "law of the first wave front" which is dealt with in a companion paper [Lindemann, J. Acoust. Soc. Am. 80, 1623-1630 (1986)]. The present paper is restricted to a comparison of the model predictions for stationary signals with the results of dichotic listening experiments.
Article
The spectral cues used for median plane localization are described by 3 experiments. First, the frequency spectrum necessary for localization is measured by noting the accuracy of subjects localizing low and high pass filtered white noise. Second, several high pass, low pass, bandpass, and bandstop filters are associated with the subjective impression of direction by observing what directions are most frequently perceived by subjects localizing white noise colored by each filter. Third, the frequency responses of several artificial ears are measured for different angles of median plane sound incidence. Results show that sound spectra from 4 to 16 kHz are necessary for localization. Frontal cues are a 1 octave notch, with a lower frequency cutoff between 4 and 10 kHz and increased energy above 13 kHz. The 'above' cue is a 1/4 octave peak between 7 and 9 kHz, with a high frequency cutoff at 10 kHz. The 'behind' cue is a small peak from 10 to 12 kHz. Increases in frontal elevation are signaled by an increase in the lower cutoff frequency of the 1 octave notch. This notch appears to be generated by time delayed reflections off the posterior concha wall interfering with sound directly entering the external auditory canal.
Article
Localization of sound sourcesoutside the median plane is influenced primarily by differences in head shadow and arrival time of the signal at the two ears of the observer. For sources located within this plane, localization is influenced primarily by the irregularities of the pinna. By progressively occluding these cavities, it is shown that localization ability decreases with increasing occlusion, that it is better for signals in the anterior than in the posterior sector of the median plane, and that high‐frequency signal content is more important than the low. A number of hypotheses regarding localization in the median plane are noted.