Article

Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited.

Kresge Hearing Research Institute, University of Michigan, Ann Arbor 48109-0506, USA.
The Journal of the Acoustical Society of America (impact factor: 1.55). 06/2002; 111(5 Pt 1):2219-36.
Source: PubMed

ABSTRACT The virtual auditory space technique was used to quantify the relative strengths of interaural time difference (ITD), interaural level difference (ILD), and spectral cues in determining the perceived lateral angle of wideband, low-pass, and high-pass noise bursts. Listeners reported the apparent locations of virtual targets that were presented over headphones and filtered with listeners' own directional transfer functions. The stimuli were manipulated by delaying or attenuating the signal to one ear (by up to 600 micros or 20 dB) or by altering the spectral cues at one or both ears. Listener weighting of the manipulated cues was determined by examining the resulting localization response biases. In accordance with the Duplex Theory defined for pure-tones, listeners gave high weight to ITD and low weight to ILD for low-pass stimuli, and high weight to ILD for high-pass stimuli. Most (but not all) listeners gave low weight to ITD for high-pass stimuli. This weight could be increased by amplitude-modulating the stimuli or reduced by lengthening stimulus onsets. For wideband stimuli, the ITD weight was greater than or equal to that given to ILD. Manipulations of monaural spectral cues and the interaural level spectrum had little influence on lateral angle judgements.

0 0
 · 
1 Bookmark
 · 
36 Views
  • Source
    Article: Objective Measurement of Perceived Auditory Quality in Multichannel Audio Compression Coding Systems*
    [show abstract] [hide abstract]
    ABSTRACT: Objective quality assessment methods have been used widely for the evaluation of audio coding systems. However, even though many different competing multichannel audio com-pression coding systems are being developed, most current quality assessment methods only predict results for monaural or stereo signals. A prediction method is introduced that can be used for multichannel audio compression coding systems. The method introduces three variables as measures of the degradations in spatial quality: interaural time difference (ITD) distortion, interaural level difference (ILD) distortion, and interaural cross-correlation coef-ficient (IACC) distortion. Simultaneously ten model output variables proposed in ITU-R BS.1387-1 are extracted from binaural signals that are synthesized using head-related transfer functions. The prediction model is trained and verified using results from subjective listening tests of multichannel audio compression coding systems that were performed by participants in the MPEG audio group. This new model, using the three interaural and ten nonspatial statistics, shows encouraging results in the prediction of perceived quality. 0 INTRODUCTION Low-bit-rate audio coding technology now is being used in multichannel audio compression technologies that manipulate the spatial impressions of the listener. As the number of competing compression coding systems in-creases, reliable quality assessment becomes important for evaluating these systems. Because a good predictive or objective assessment model would enable easy compari-son of the different compression schemes, numerous ob-jective quality assessment methods, such as those de-scribed in [1]–[7], have been proposed. Thanks to the efforts of the participants in the International Telecommu-nication Union Radiocommunication Sector (ITU-R) to combine those methods and develop a single best method, ITU-R Recommendation BS.1387-1 [8] has been estab-lished and is being used widely. However, because its scope is restricted to evaluating degradations caused by known coding artifacts, the method described in [8] cannot predict the perceived quality of newly developed audio coding technologies that import novel coding schemes to accomplish extremely high efficiency in compression with intermediate sound quality [9]. Moreover, the recommen-dation cannot be used to objectively assess multichannel audio coding systems because it was designed for monau-ral and stereo sounds only [9]. Two recent models for the objective assessment of the quality of multichannel sound sources have been proposed [10], [11]. However, to date no satisfactory predictions of the perceptual quality of newly developed low-bit-rate multichannel coding systems have been reported. In this paper a prediction model is introduced that can be used for the objective quality assessment of multichannel audio compression coding systems, focusing on recently intro-duced efficient-compression coding technologies. An adequate predictive model for the perceived quality of multichannel sound must satisfy the following condi-tions. First, the listening environment for the multichannel audio reproduction system must be modeled. Second, not only timbral degradations but also spatial degradations, such as sound localization errors, must be quantified. Lastly, the model must be trained and verified with reli-able judgments of sound quality taken from listening tests using a large ensemble of different kinds of degradations in spatial and timbral quality.
    Journal of the Audio Engineering Society 01/2008; 56(1/2):3-17. · 0.43 Impact Factor
  • Source
    Article: Improvements of sound localization abilities by the facial ruff of the barn owl (Tyto alba) as demonstrated by virtual ruff removal.
    [show abstract] [hide abstract]
    ABSTRACT: When sound arrives at the eardrum it has already been filtered by the body, head, and outer ear. This process is mathematically described by the head-related transfer functions (HRTFs), which are characteristic for the spatial position of a sound source and for the individual ear. HRTFs in the barn owl (Tyto alba) are also shaped by the facial ruff, a specialization that alters interaural time differences (ITD), interaural intensity differences (ILD), and the frequency spectrum of the incoming sound to improve sound localization. Here we created novel stimuli to simulate the removal of the barn owl's ruff in a virtual acoustic environment, thus creating a situation similar to passive listening in other animals, and used these stimuli in behavioral tests. HRTFs were recorded from an owl before and after removal of the ruff feathers. Normal and ruff-removed conditions were created by filtering broadband noise with the HRTFs. Under normal virtual conditions, no differences in azimuthal head-turning behavior between individualized and non-individualized HRTFs were observed. The owls were able to respond differently to stimuli from the back than to stimuli from the front having the same ITD. By contrast, such a discrimination was not possible after the virtual removal of the ruff. Elevational head-turn angles were (slightly) smaller with non-individualized than with individualized HRTFs. The removal of the ruff resulted in a large decrease in elevational head-turning amplitudes. The facial ruff a) improves azimuthal sound localization by increasing the ITD range and b) improves elevational sound localization in the frontal field by introducing a shift of iso-ILD lines out of the midsagittal plane, which causes ILDs to increase with increasing stimulus elevation. The changes at the behavioral level could be related to the changes in the binaural physical parameters that occurred after the virtual removal of the ruff. These data provide new insights into the function of external hearing structures and open up the possibility to apply the results on autonomous agents, creation of virtual auditory environments for humans, or in hearing aids.
    PLoS ONE 01/2009; 4(11):e7721. · 4.09 Impact Factor

Full-text (2 Sources)

View
0 Downloads
Available from
14 Jun 2013

Keywords

apparent locations
 
Duplex Theory
 
ears
 
high-pass noise bursts
 
high-pass stimuli
 
interaural level difference
 
interaural level spectrum
 
interaural time difference
 
ITD weight
 
lateral angle judgements
 
lengthening stimulus onsets
 
Listener weighting
 
low weight
 
manipulated cues
 
monaural spectral cues
 
perceived lateral angle
 
resulting localization response biases
 
spectral cues
 
virtual auditory space technique
 
virtual targets