# Reproduction of Virtual Sound Sources Moving at Supersonic Speeds in Wave Field Synthesis

**75**Bookmarks

**·**

**14,122**Views

- [Show abstract] [Hide abstract]

**ABSTRACT:**The SoundScape Renderer (SSR) comes with ABSOLUTELY NO WARRANTY. The SSR is free software and released under the GNU General Public License, either version 3 of the License, or (at your option) any later version. For de-tails, see the enclosed file COPYING.

Page 1

Audio Engineering Society

Convention Paper

Presented at the 125th Convention

2008 October 2–5 San Francisco, CA, USA

The papers at this Convention have been selected on the basis of a submitted abstract and extended precis that have

been peer reviewed by at least two qualified anonymous reviewers. This convention paper has been reproduced from

the author’s advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes

no responsibility for the contents. Additional papers may be obtained by sending request and remittance to Audio

Engineering Society, 60 East 42ndStreet, New York, New York 10165-2520, USA; also see www.aes.org. All rights

reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the

Journal of the Audio Engineering Society.

Reproduction of Virtual Sound Sources

Moving at Supersonic Speeds in Wave Field

Synthesis

Jens Ahrens and Sascha Spors

Deutsche Telekom Laboratories, Technische Universit¨ at Berlin, Ernst-Reuter-Platz 7, 10587 Berlin, Germany

Correspondence should be addressed to Jens Ahrens (jens.ahrens@telekom.de)

ABSTRACT

In conventional implementations of wave field synthesis, moving sources are reproduced as sequences of

stationary positions. As reported in the literature, this process introduces various artifacts. It has been

shown recently that these artifacts can be reduced when the physical properties of the wave field of moving

virtual sources are explicitly considered. However, the findings were only applied to virtual sources moving

at subsonic speeds. In this paper we extend the published approach to the reproduction of virtual sound

sources moving at supersonics speeds. The properties of the actually reproduced sound field are investigated

via numerical simulations.

1.

Since several decades, the problem of physically

recreating a given wave field has been addressed

in the audio community. Independent of the cho-

sen approach, two rendering techniques exist: Data

based and model based reproduction [1]. The for-

mer case aims at perfectly reproducing a captured

sound field. This situation will not be treated in

this paper. We concentrate on the latter case where

a sound scene is composed of a number of virtual

sound sources derived from analytical spatial source

INTRODUCTION

models. For stationary virtual scenes accurate re-

production techniques exist. However, the reproduc-

tion of dynamic scenes implicates certain peculiari-

ties. This is mostly due to the fact that the speed

of sound in air is constant. When a source moves,

the propagation speed of the emitted wave field is

not affected. However, the emitted wave field differs

from that of a static source in various ways. For ex-

ample, in sources moving slower than the speed of

sound, the sound waves emitted in the direction of

motion experience an increase in frequency. Sound

Page 2

Ahrens AND SporsSupersonic sources

waves emitted in opposite direction of motion expe-

rience a decrease in frequency. The whole of these

alterations is known as Doppler Effect [2].

Typical implementations of sound field reproduction

systems do not take the Doppler Effect into account.

Dynamic virtual sound scenes are rather reproduced

as a sequence of stationary snapshots. Thus, not

only the virtual source but also its entire wave field

is moved from one time instant to the next.

This concatenation leads to Doppler-like frequency

shifts.However, these frequency shifts occur due

to warping of the time axis rather than due to the

constant speed of sound, a circumstance which intro-

duces artifacts. Furthermore, this approach is lim-

ited to the reproduction of virtual sources moving

slower than the speed of sound. The artifacts have

been recently discussed in the literature in the con-

text of wave field synthesis [3]. We are not aware

of an according publication focussing on alternative

sound field reproduction methods.

a treatment of moving virtual sources in binaural

(HRTF-based) reproduction.

Various alternative implementations of the conven-

tional approach of concatenating stationary source

positions as outlined above are being applied both

frame-based as well as in a sample-by-sample fash-

ion. Most notably, in [3] it is proposed to incorporate

the retarded time of a moving source (see section 2)

into the driving function of a stationary source. Re-

sults presented ibidem show that this strategy still

leaves prominent artifacts.

As shown by the authors in [6], the mentioned arti-

facts occurring in conventional implementations can

be avoided when the physical properties of the wave

field of moving sound sources are a priori taken into

account. However, the approach in [6] was exclu-

sively applied to virtual sources moving slower than

the speed of sound. In this paper, we extend this

approach to the reproduction of virtual sources mov-

ing at supersonic speeds. Our work can also be re-

garded as an extension of the approach presented

in [7] which focuses on the reproduction of the fre-

quency content present in supersonic booms of air-

crafts but does not physically reproduce the actual

wave front.

Note that the considerations presented in this paper

are of relevance only for sound field reproduction ap-

proaches which employ time delays in the procedure

of yielding the loudspeaker driving signals.

See [4, 5] for

α

∆x

x

y

y = y0

r

n

x

x0

x0

Fig. 1: The coordinate system and geometry used

in this paper. The dots • denote the positions of the

secondary sources used for wave field synthesis. The

grey-shaded area denotes the listening area.

2.

The fundamental prerequisite for model-based sound

field reproduction is the knowledge of the sound field

that is to be recreated. In this section, we derive

analytical expressions of the sound field of a moving

sound source. For simplicity, we assume a monopole

source. However, the presented approach also allows

for the treatment of arbitrary source types.

derivation below follows [8, 9].

The time-domain free-field Green’s function of a

stationary sound source at position xs, i.e. its

spatio-temporal impulse response, is denoted by

g(x − xs,t).

coordinate system.The time-domain Green’s

functionofamoving

g?x − xs(˜t(x,t)),t −˜t(x,t)?,

notes the time instant when the impulse was emit-

ted. Confer to figure 2. g?x − xs(˜t(x,t)),t −˜t(x,t)?

is referred to as retarded Green’s function [8].˜t(x,t)

is dependent on the location of the receiver x and

the time t that the receiver experiences.

Assume a monochromatic harmonic source oscillat-

ing at angular frequency ωs. Its source function s0(˜t)

reads in complex notation

THE WAVE FIELD OF A MOVING SOURCE

The

See figure 1 for a sketch of the

soundsourceis then

whereby ˜t(x,t) de-

s0(˜t) = a0· ejωs˜ t.

(1)

In order to yield the wave field produced by a mov-

ing source with spatio-temporal impulse response

g?x − xs(˜t(x,t)),t −˜t(x,t)?

s0(˜t), we model s0(˜t) as a dense sequence of weighted

Dirac pulses.Each Dirac pulse of the sequence

multiplied by g?x − xs(˜t(x,t)),t −˜t(x,t)?yields the

wave field created by the respective Dirac pulse. To

driven by the signal

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 2 of 11

Page 3

Ahrens AND SporsSupersonic sources

x

y

xs(˜t(x,t))

xs(t)

x

|x|

|x − xs(˜t(x,t))|

v

Fig. 2: Derivation of the Green’s function of a moving sound source.

yield the wave field emitted due to the entire se-

quence of Dirac pulses, we integrate over˜t as

s(x,t) =

∞

?

−∞

s0(˜t) · g?x − xs(˜t),t −˜t?

d˜t ,

(2)

whereby we temporarily altered the nomenclature

for convenience (˜t =˜t(x,t)).

Assuming a moving monopole sound source, its

Green’s function explicitly reads

g?x − xs(˜t(x,t)),t −˜t(x,t)?=

1

4π

=

δ

?

t −˜t(x,t) −|x−xs(˜ t(x,t))|

|x − xs(˜t(x,t))|

c

?

.

(3)

Note that

τ(x,t) =|x − xs(˜t(x,t))|

c

(4)

is referred to as retarded time [8]. It denotes the du-

ration of sound propagation from the source to the

receiver. In the remainder of this paper, M =v

notes the Mach number, with v being the speed of

the sound source.

For convenience, we assume the virtual source to

move uniformly along the x-axis in positive x-

direction (cf. to figure 2). As outlined in [6], ar-

bitrary trajectories can be approximated by assum-

ing a piece-wise uniform motion and an appropriate

cde-

translation and rotation of the coordinate system.

At time t = 0 the source is located at position xs(0).

For this particular source trajectory, the integral in

equation (2) can be solved via the substitution

u =˜t(x,t) + τ(x,t)(5)

and the exploitation of the sifting property of the

delta function [10]. It turns out that the integral has

different solutions for M < 1, M = 1, and M > 1.

In the following sections, we present solutions to the

integral in (2) for subsonic (M < 1) as well as super-

sonic (M > 1) sound sources and briefly comment

on the case of sources moving at the speed of sound

(M = 1).

2.1.

For M < 1, the integral boundaries in (2) can be

kept and the solution, i.e. the sound field sM<1(x,t)

of a source moving at a speed v < c reads then

Sound sources moving at subsonic speeds

sM<1(x,t) =

1

4π·s0(˜t(x,t))

Ψ(x,t)

,

(6)

whereby

˜t(x,t) = t −MΦ(x,t) + Ψ(x,t)

c(1 − M2)

,

Ψ(x,t) =

?

Φ2(x,t) + y2(1 − M2) ,

Φ(x,t) = x − vt − xs(0) .

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 3 of 11

Page 4

Ahrens AND SporsSupersonic sources

−202

−1

0

1

2

3

4

5

−1

−0.5

0

0.5

1

x → [m]

(a) ℜ{s(x,t0)}

y → [m]

−202

−1

0

1

2

3

4

5

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(b) ℜ{pWFS(x,t0)}. The loudspeaker array indicated by

the dotted line is situated symmetrically around the y-

axis at y0 = 1 m and its overall length is 8 m.

loudspeakers are positioned at intervals of ∆x = 0.1 m.

Tapering is applied.

The

Fig. 3: Simulated wave fields of a source oscillating monochromatically at fs= 500 Hz and moving along

the x-axis in positive x-direction at v = 120m

s. Due to the employment of the complex notation for time

domain signals (see equation (1)), only the real part ℜ{·} of the considered wave field is depicted. The wave

fields have been scaled to have comparable levels. The values of the sound pressure are clipped as indicated

by the colorbars.

A snapshot of the wave field of a moving sound

source described by equation (6) is depicted in figure

3(a).

For M = 0, i.e. a static source, equation (6) reads

sM=0(x,t) =

1

4π·s0(t − τ)

|x − xs|

(7)

which corresponds to the familiar expression for the

sound field of a static harmonic monopole sound

source [6].

2.2.

For sound sources moving at supersonic speeds, the

integral in (2) has to be split into a sum of two in-

tegrals after the substitution (5) reading

Sound sources moving at supersonic speeds

sM>1(x,t) =

∞

?

u1

(·) du +

∞

?

u2

(·) du,

(8)

whereby

u1,2=1

v

?

±(xs(0) − x) + y

?

M2− 1

?

.

(·) denotes the argument of the integral in (2).

The solution yields the wave field sM>1(x,t) of a

monopole sound source moving at a supersonic speed

v reading

sM>1(x,t) =

=

s1(x,t) + s2(x,t)for Φ(x,t)2+ y2(1 − M2)

≥ 0

and xs(0) + vt ≥ x

elsewhere ,

0

(9)

with

s1,2(x,t) =

1

4π

s0(˜t1,2(x,t))

Ψ(x,t)

,

˜t1,2(x,t) = t −MΦ(x,t) ± Ψ(x,t)

c(1 − M2)

,

The most prominent property of the wave field of a

supersonic source is the formation of the so-called

Mach cone, a conical sound pressure front following

the moving source. See figure 4(a). Note that the

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 4 of 11

Page 5

Ahrens AND SporsSupersonic sources

−4 −202

−3

−2

−1

0

1

2

3

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(a) Wave field sM>1(x,t) of a supersonic source.

−4−2 02

−3

−2

−1

0

1

2

3

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(b) Backward travelling component s1(x,t).

−4−202

−3

−2

−1

0

1

2

3

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(c) Forward travelling component s2(x,t).

Fig. 4: Wave field of a source traveling at 600m/s

(M ≈ 1.7).

Mach cone is a direct consequence of causality.

For the receiver this has two implications:

He/She does not receive any sound wave before the

arrival of the Mach cone, (2) after the arrival of the

Mach cone the receiver is exposed to a superposition

of the wave field which the source radiates into back-

ward direction s1(x,t) and the wave field s2(x,t)

which the source had radiated into forward direc-

tion before the arrival of the Mach cone. s1(x,t)

carries a frequency shifted version of the emitted sig-

nal propagating in opposite direction to the source

motion (figure 4(b)), s2(x,t) carries a time-reversed

version of the emitted signal following the source

(figure 4(c)). The latter is generally also shifted in

frequency.

(1)

2.3.

The integral in (2) can also be solved for M = 1. In

that case, the lower integral boundary is finite, the

upper boundary is infinite. The result then resem-

bles the circumstances for M > 1, i.e the receiver is

not exposed to the source’s wave field at all times.

It is rather such that the source moves at the lead-

ing edge of the sound waves it emits. The wave field

can not surpass the source. The leading edge of the

wave field is termed sound barrier.

Unlike for M > 1, the resulting wave field is not

composed of two different components. It contains

only one single component carrying the frequency

shifted input signal.

Informal listening suggests that it can not be as-

sumed that the human ear is aware of the details of

the properties of the wave field of a transonic source

(a source moving exactly at the speed of sound).

We therefore do not present an explicit treatment

here. For convenience, we propose to assume that

the wave field of a transonic source is perceptually

indistinguishable from the wave field s1(x,t) of a

source moving at a speed slightly faster than the

speed of sound c.

Sound sources moving at the speed of sound

3.

In this section, we demonstrate how a moving vir-

tual sound source can be reproduced using the find-

ings derived in section 2. Exemplarily, we use wave-

field synthesis (WFS) employing a linear array of

secondary sources (loudspeakers).

The theoretical basis of WFS employing linear sec-

ondary source arrays is given by the two-dimensional

Rayleigh I integral [11, 12]. It states that a linear

WAVE FIELD SYNTHESIS

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 5 of 11

Page 6

Ahrens AND SporsSupersonic sources

distribution of monopole line sources is capable of

reproducing a desired wave field (a virtual source)

in one of the half planes defined by the secondary

source distribution. The wave field in the other half

(where the virtual source is situated) is a mirrored

copy of the desired wave field. For convenience, the

secondary source array is assumed to be parallel to

the x-axis at y = y0 as depicted in figures 1 and

3(b). The listening area is chosen to be at y > y0.

The two-dimensional Rayleigh I integral determines

the sound pressure pWFS(x,t) created by such a

setup reading

pWFS(x,t) =

∞

?

−∞

−∂

?

∂ns(x,t)|x=x0

???

d(x0,t)

∗tg(x,t) dx0.

(10)

s(x,t) denotes the sound field of the virtual source

and

∂nthe gradient in the direction normal to the

secondary source distribution (confer also to figure

1). The asterisk ∗tdenotes convolution with respect

to time.

The driving function d(x0,t) for a loudspeaker at po-

sition x0is thus yielded by evaluating the gradient

of the desired virtual sound field in direction normal

to the loudspeaker distribution at the position of the

respective loudspeaker.

Due to the fact that the physical requirements can

not be perfectly fulfilled in practical implementa-

tions, the virtual source’s wave field is not perfectly

reproduced in the receiver’s half-space.

(10) requires an infinitely long continuous distri-

bution of secondary sources, practical implementa-

tions can only employ a finite number of discrete

loudspeakers. The array has thus a finite length.

Furthermore, equation (10) requires secondary line

sources which are positioned perpendicular to the

receiver plane [12]. Practical implementations typ-

ically employ loudspeakers with closed cabinets as

secondary sources. These are more accurately de-

scribed by point sources rather than line sources.

This fact is known as secondary source mismatch

and has to be compensated for as

∂

Equation

dcorr(x,t) = f(t) ∗td(x,t) .

f(t) is a filter with frequency response F(ω) =

2√2πjkdref, the asterisk ∗tdenotes convolution with

respect to time, and drefdenotes the reference dis-

tance from the secondary source array, to which the

(11)

amplitude of the reproduced wave field is referenced.

See [12] for a thorough treatment of the properties

of WFS.

For convenience, we do not explicitly compensate for

the secondary source mismatch in the analytical ex-

pressions for the driving functions. However, in the

simulations this compensation is performed.

3.1.

For a virtual harmonic monopole sound source of

angular frequency ωsmoving uniformly along the x-

axis as described in section 2, the driving function

d(x,t) derived from (6) and (10) reads [6]

Driving function for subsonic sources

dsub(x,t) =y(1 − M2)

Ψ(x,t)

?

1

Ψ(x,t)+

jωs

c(1 − M2)

× s(x,t) .

?

(12)

×

Note that dsub(x,t) in equation (12) implicitly in-

cludes static virtual sources.

The wave field reproduced by a linear WFS array

driven by equation (12) is depicted in figure 3(b).

The overall length of the loudspeaker array is 8 m.

The virtual source moves at a speed v = 120m

the x-axis in positive x-direction (M ≈1

3.2.

Driving function for supersonic sources

The driving function for supersonic sources derived

from (9) and (10) reads

salong

3).

dsup(x,t) = d1(x,t) + d2(x,t) =

y(1 − M2)

Ψ(x,t)

× s1(x,t) +

+

Ψ(x,t)

× s2(x,t) .

=

?

1

Ψ(x,t)+

jωs

c(1 − M2)

?

×

y(1 − M2)

?

1

Ψ(x,t)−

jωs

c(1 − M2)

?

(13)

×

3.3.

As outlined in section 2.3, we propose to reproduce

s1(x,t) of a virtual source moving slightly faster

than the speed of sound in order to approximate a

transonic source. The appropriate driving function

is then d1(x,t).

Driving function for transonic sources

4.

In this section, we present a number of simulations

in order to analyze the properties of the proposed

RESULTS

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 6 of 11

Page 7

Ahrens AND SporsSupersonic sources

0

0.511.52

350

400

450

500

550

600

650

[dB]

−80

−70

−60

−50

−40

−30

−20

t → [s]

f → [Hz]

(a) Real source. The emitted frequency is

500 Hz.

0

0.511.52

350

400

450

500

550

600

650

[dB]

−80

−70

−60

−50

−40

−30

−20

t → [s]

f → [Hz]

(b) Truncation artifacts. The length of the

array is 40 m. The emitted frequency is

500 Hz.

0

0.511.52

3000

3500

4000

4500

5000

[dB]

−80

−70

−60

−50

−40

−30

−20

t → [s]

f → [Hz]

(c) Spatial aliasing. The desired signal is

the S-shaped one in the middle. The emit-

ted frequency is 4000 Hz.

Fig. 5: Spectrograms illustrating artifacts apparent

in the reproduced wave field of a subsonic source

(v = 40m/s). The virtual source passes the receiver

at t ≈ 1 s.

approach with focus on the case of M > 1. The case

of M < 1 is thoroughly treated in [6].

We assume a linear array of secondary monopole

sources. The secondary sources are placed at an in-

terval of ∆x = 0.1 m throughout the simulations.

The loudspeaker array is situated parallel to the x-

axis and symmetrically around the y-axis at y0= 1

m. Its overall length is 14 m except where stated

explicitly.

As inherent to WFS, the reproduced wave field only

approximates the desired one for y > y0. Due to the

fact that we assume secondary monopole sources,

the reproduced wave field on the other side of the

loudspeaker array (where y < y0) is a mirrored ver-

sion.

4.1.

field

As outlined in [6], the reproduced wave field suf-

fers from two major artifacts: (1) echo-like artifacts

due to spatial truncation of the secondary source ar-

ray, and (2) spatial aliasing when the frequency con-

tent of the reproduced wave field is above the spatial

aliasing frequency. Figure 5 shows spectrograms of

the reproduced wave field observed at xR= [0 −4]T.

The loudspeaker array similar to the one used in the

simulations in figure 3, i.e. the loudspeaker array is

situated symmetrically around the y-axis at y0= 1

m and its overall length is 8 m. The loudspeakers

are positioned at intervals of ∆x = 0.1 m

In figure 5(b) a pre- and a post-echo additional to the

desired signal are apparent. The shorter the array

the closer in time to the desired signal the echoes oc-

cur. These truncation artifacts can be significantly

reduced by the application of tapering (i.e. an at-

tenuation of the secondary sources towards the very

ends of the array) [11, 6].

Figure 5(c) depicts the spectrogram of a virtual

source reproduced above the spatial aliasing fre-

quency. For the given array with a loudspeaker spac-

ing of ∆ = 0.1 m the spatial aliasing frequency is

approximately 1700 Hz [13].

Finally, another artifact resulting from spatial trun-

cation of the secondary source distribution is an in-

correct amplitude envelope of the receiver signal.

This circumstance can be observed when comparing

e.g. figures 5(a) and 5(b). At the very ends of the

depicted time window, the receiver signal due to the

real source is significantly higher in amplitude than

Artifacts apparent in the reproduced wave

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 7 of 11

Page 8

Ahrens AND SporsSupersonic sources

−4−20

−1

0

1

2

3

4

5

−1

−0.5

0

0.5

1

x → [m]

(a) ℜ{s(x,t0)}

y → [m]

−4−20

−1

0

1

2

3

4

5

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(b) ℜ{pWFS(x,t0)}. No limitation of the temporal band-

width. Strong aliasing artifacts are apparent (see text).

−4−20

−1

0

1

2

3

4

5

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(c) ℜ{pWFS(x,t0)}, fmax= 3000 Hz.

−4−20

−1

0

1

2

3

4

5

−1

−0.5

0

0.5

1

x → [m]

y → [m]

(d) ℜ{pWFS(x,t0)}, fmax= 2000 Hz.

Fig. 6: Simulated wave fields of a source oscillating monochromatically at fs= 500 Hz and moving along

the x-axis in positive x-direction at v = 600m/s (M ≈ 1.7). Due to the employment of the complex notation

for time domain signals (see equation (1)), only the real part ℜ{·} of the considered wave field is depicted.

The wave fields have been scaled to have comparable levels. The values of the sound pressure are clipped as

indicated by the colorbars. The loudspeaker array in figures 6(b)-6(d) is indicated by the dotted line. It is

situated symmetrically around the y-axis at y0= 1 m and its overall length is 14 m. The loudspeakers are

positioned at intervals of ∆x = 0.1 m.

the receiver signal due to the virtual source. In the

center of the plot, i.e. when the source is behind the

secondary sources from the receivers point of view,

the amplitude due to the virtual source is similar to

that due to the real source.

4.2.

M > 1

Figure 6(b) shows a simulation of a WFS system

reproducing the wave field depicted in figure 6(a).

The virtual source moves at v = 600m/s, i.e. M ≈

1.7. Due to the omnidirectionality of the secondary

sources, the reproduced wave field in figure 6(b) is

Direct application of the driving function for

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 8 of 11

Page 9

Ahrens AND SporsSupersonic sources

3456

t → [s]

789

x 10−3

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2x 104

f1,2(x1,t) → [Hz]

f1

f2

(a) f1,2(x1,t). Negative frequencies indicate time

reversal of the input signal.

3456

t → [s]

1

Ψ(x1,t).

789

x 10−3

0

5

10

15

20

1

Ψ(x1,t)

(b)

Fig. 7: Details of the wave field of a source of v =

600m/s (M ≈ 1.7) oscillating at fs = 500 Hz ob-

served at x1 = [1 1]T. The Mach cone arrives at

t ≈ 4 · 10−3s.

symmetric with respect to the secondary source con-

tour. Note that strong artifacts are apparent. It can

be shown that these artifacts occur due to temporal

as well as spatial aliasing.

This can by verified by analyzing the instantaneous

frequencies f1(t) and f2(t) of the reproduced wave

field components s1(x,t) and s2(x,t). Confer to fig-

ure 7(a). It can be seen that f1(t) and f2(t) are

infinite at the singularity of the Mach cone, i.e. at

the moment of the arrival of the Mach cone. After

the arrival they decrease quickly to moderate values.

The former means that f1(t) and f2(t) will exceed

any limit imposed on a reproduction system due to

discrete treatment of time and discretization of the

secondary source distribution.

4.3.

In order to prevent temporal aliasing in digital sys-

tems due to discretization of the time, it is desirable

to limit the bandwidth of the temporal spectrum of

the driving function. Typical bandwidths in digital

systems are 22050 Hz for systems using a tempo-

ral sampling frequency of 44100 Hz and 24000 Hz

for systems using a temporal sampling frequency of

48000 Hz.

In order to prevent respectively reduce spatial alias-

ing of the WFS system under consideration, it is

desirable to further limit the bandwidth of the tem-

poral spectrum of the driving function to values in

the order of the spatial aliasing frequency which is

typically a few thousand Hertz. Recall that the crit-

ical frequency above which spatial aliasing occurs in

the given secondary source array is approximately

1700 Hz (confer to section 4.1).

A simple means to limit the bandwidth is to sim-

ply fade-in the driving signal from a moment on

when its temporal frequency has dropped below a

given threshold. This strategy also avoids the cir-

cumstance that the amplitude of the driving sig-

nal is infinite at the moment of arrival of the Mach

cone. Real-world implementations of WFS systems

can not reproduce arbitrarily high amplitudes.

Confer to figure 7(b). It depicts the factor Ψ(x,t)−1

which determines the amplitude of the wave field

around the Mach cone.

The simulations in figures 6(c) and 6(d) show the

reproduced wave field when the driving function is

faded-in after the instantaneous frequency of the

driving function has dropped below 3000 Hz (figure

6(c)) respectively 2000 Hz (figure 6(d)). The alias-

ing artifacts are significantly reduced.

Note that the shorter the fade-in of the driving func-

tion is the better the impulsive property of the Mach

cone is preserved. However, shorter fade-ins result

in stronger spatial aliasing since they impose more

high frequency content onto a signal.

Modified driving function

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 9 of 11

Page 10

Ahrens AND SporsSupersonic sources

Finally, it has to be considered that spatial aliasing

is not necessarily audible under all circumstances.

5.

Informal listening suggests that the human audi-

tory system is not aware of all the properties of the

wave field of supersonic sources. Especially the fact

that the wave field contains a component carrying

a time-reversed version of the source’s input signal

is confusing. Depending on the specific situation, it

might be preferable to exclusively reproduce s1(x,t),

i.e. the component of the wave field carrying the non-

reversed input signal.

Furthermore, only the localization when exposed to

s1(x,t) is plausible since s1(x,t) assures localization

of the source in its appropriate location (however

with some bias due to the retarded time τ). Expo-

sure of the receiver to s2(x,t) suggests localization

of the source in the direction where it “comes from”.

This also seems unnatural. Finally, the exposure of

the receiver to a superposition of s1(x,t) and s2(x,t)

suggests the localization of two individual sources.

PERCEPTUAL ASPECTS

6.

An approach to the reproduction of the wave field

of virtual sound sources moving at supersonic speeds

was presented. The approach constitutes an exten-

sion to a treatment of the reproduction of the wave

field of virtual sound sources moving at subsonic

speeds previously published by the authors. It was

shown that the reproduced wave field suffers from

spatial aliasing artifacts due to the fact that the in-

stantaneous frequency of the virtual sound field is

infinite at the moment of arrival of the Mach cone.

As workaround, it was proposed to fade-in the driv-

ing signal for a given secondary source right after

the instantaneous frequency of the driving signal has

dropped below a desired threshold. A short fade-in

preserves the impulsive quality of the Mach cone.

In order to optimize the reproduction of the sound

field of supersonic virtual sources, it is necessary to

perform preceptive experiments investigating which

properties of the virtual wave field have to be repro-

duced in order to evoke a plausible perception both

in terms of frequency content and localization.

CONCLUSIONS

ACKNOWLEDGEMENTS

We thank Holger Waubke of Austrian Academy of

Sciences for providing us with the notes of his lecture

on theoretical acoustics [9].

7.

REFERENCES

[1] R. Rabenstein and S. Spors.

sound field reproduction.

Sondhi, M., Huang, Y, (Eds.), Springer Hand-

book on Speech Processing and Speech Com-

munication, Springer Verlag, 2007.

Multichannel

In Benesty, J.,

[2] C. Doppler.

pelsterne und einiger anderer Gestirne des

Himmels.In Abhandlungen der k¨ oniglichen

b¨ ohmischen Gesellschaft der Wissenschaften, 2,

pp. 465–482, 1842.

¨Uber das farbige Licht der Dop-

[3] A. Franck, A. Gr¨ afe, T. Korn, and M. Strauß.

Reproduction of moving virtual sound sources

by wave field synthesis: An analysis of artifacts.

32nd Int. Conference of the AES, Hillerød,

Denmark, Sept. 2007.

[4] H.Strauss.Simulation

virtuelle

instation¨ arer

auditiveSchallfelder

bungen. Fortschrittberichte VDI 10/652, VDI

Verlag, D¨ usseldorf, 2000.

f¨ urUmge-

[5] Y. Iwaya and Y. Suzuki.

sound with the doppler effect in sound space.

Applied Acoustics, Technical note, 68:916–922,

2007.

Rendering moving

[6] J. Ahrens and S. Spors. Reproduction of mov-

ing virtual sound sources with special attention

to the doppler effect. In 124th Convention of

the AES, Amsterdam, The Netherlands, May

17–20 2008.

[7] N. Epain and E. Friot. Indoor sonic boom repro-

duction using ANC. In Proceedings of Active,

Williamsburg, Virginia, Sep. 20–22 2004.

[8] J.D. Jackson. Classical Electrodynamics. Wiley,

New York, 1975.

[9] H. Waubke. Aufgabenstellung zur Seminarar-

beit zur Vorlesung ”Theoretische Akustik”.

IEM Graz, 2003.

[10] B. Girod, R. Rabenstein, and A. Stenger. Sig-

nals and Systems. J.Wiley & Sons, 2001.

[11] E.W. Start. Direct sound enhancement by wave

field synthesis. PhD thesis, Delft University of

Technology, 1997.

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 10 of 11

Page 11

Ahrens AND SporsSupersonic sources

[12] S. Spors, R. Rabenstein, and J. Ahrens. The

theory of wave field synthesis revisited.

124th Convention of the AES, Amsterdam, The

Netherlands, May 17–20 2008.

In

[13] S. Spors and R. Rabenstein. Spatial aliasing

artifacts produced by linear and circular loud-

speaker arrays used for wave field synthesis. In

120th Convention of the AES, Paris, France,

May 20–23 2006.

AES 125thConvention, San Francisco, CA, USA, 2008 October 2–5

Page 11 of 11