Conference PaperPDF Available

Delay Network Architectures for Room and Coupled Space Modeling

Authors:

Abstract

Feedback delay network reverberators have decay filters associated with each delay line to model the frequency dependent reverberation time (T60) of a space. The decay filters are typically designed such that all delay lines independently produce the same T60 frequency response. However, in real rooms, there are multiple , concurrent T60 responses that depend on the geometry and physical properties of the materials present in the rooms. In this paper, we propose the Grouped Feedback Delay Network (GFDN), where groups of delay lines share different target T60s. We use the GFDN to simulate coupled rooms, where one room is significantly larger than the other. We also simulate rooms with different materials , with unique decay filters associated with each delay line group, designed to represent the T60 characteristics of a particular material. The T60 filters are designed to emulate the materials' absorption characteristics with minimal computation. We discuss the design of the mixing matrix to control inter-and intra-group mixing , and show how the amount of mixing affects behavior of the room modes. Finally, we discuss the inclusion of air absorption filters on each delay line and physically motivated room resizing techniques with the GFDN.
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
DELAY NETWORK ARCHITECTURES FOR ROOM AND COUPLED SPACE MODELING
Orchisama Das , Jonathan S. Abel , Elliot K. Canfield-Dafilou
Center for Computer Research in Music and Acoustics
Stanford University
Stanford, CA, USA
[orchi|abel|kermit]@ccrma.stanford.edu
ABSTRACT
Feedback delay network reverberators have decay filters associ-
ated with each delay line to model the frequency dependent re-
verberation time (T60) of a space. The decay filters are typically
designed such that all delay lines independently produce the same
T60 frequency response. However, in real rooms, there are mul-
tiple, concurrent T60 responses that depend on the geometry and
physical properties of the materials present in the rooms. In this
paper, we propose the Grouped Feedback Delay Network (GFDN),
where groups of delay lines share different target T60s. We use the
GFDN to simulate coupled rooms, where one room is significantly
larger than the other. We also simulate rooms with different ma-
terials, with unique decay filters associated with each delay line
group, designed to represent the T60 characteristics of a particular
material. The T60 filters are designed to emulate the materials’ ab-
sorption characteristics with minimal computation. We discuss the
design of the mixing matrix to control inter- and intra-group mix-
ing, and show how the amount of mixing affects behavior of the
room modes. Finally, we discuss the inclusion of air absorption
filters on each delay line and physically motivated room resizing
techniques with the GFDN.
1. INTRODUCTION
Feedback delay networks (FDNs) are efficient structures for syn-
thesizing room impulse responses (RIRs). RIRs consist of a set of
sparse early reflections which increase in density over time, build-
ing toward late reverberation where the impulse density is high and
statistically Gaussian. Feedback delay networks are composed of
delay lines in parallel, which are connected through a feedback
matrix (or mixing matrix), which is unitary to conserve system en-
ergy [1]. Jot proposed adding shelf filters to the delay lines to yield
a desired frequency dependent T60 [2, 3]. Since then, FDNs have
become one of the most popular structures for synthesizing rever-
beration due to the relative efficiency of the approach. Recent re-
search on FDNs has focused on mixing matrix design to increase
echo density [4], modal analysis [5, 6], time-varying FDNs [7],
directional FDNs [8], and reverberation time control by accurate
design of the decay filters [9, 10].
In this paper, we propose a new delay network architecture for
physically informed room modeling. We also provide an alternate
design technique to the one proposed in [10], where a 10-band
graphic equalizer consisting of cascaded second order peak-notch
Copyright: © 2020 Orchisama Das, Jonathan S. Abel, and Elliot K. Canfield-
Dafilou . This is an open-access article distributed under the terms of the Creative
Commons Attribution 3.0 Unported License, which permits unrestricted use, distribu-
tion, and reproduction in any medium, provided the original author and source are
credited.
M11 M12
M21 M22
d
b1
b2
zτ1
zτ2
g1(z)
g2(z)
c1
c2
+
u(n)
y(n)
|
1
|
1
|
1
|
N1
|
N1
|
N1
|
N2
|
N2
|
N2
|
N1
|
N2
|
N1
|
N2
Figure 1: GFDN block diagram.
IIR filters was fit to a desired T60 response. Our argument is that
the T60 response of a room depends on the physical configuration
of the room, and room modes at the same frequency need not share
the same T60. The T60 response of each material depends on its
frequency dependent absorption coefficients and its volume to sur-
face area ratio, according to the Sabine theory of late-field rever-
beration [11]. We propose a new architecture, called the grouped
feedback delay network (GFDN), where groups of delay lines have
the same target T60 response associated with them. These filters
are designed to be low-order filters consisting of cascaded shelf
and resonant biquad filters. Low order filters significantly reduce
the computation required in [10], where each delay line has an IIR
filter of order 20. The interaction among the different delay line
groups is controlled by a block mixing matrix. As applications
of the proposed GFDN, we model coupled rooms, a single room
composed of different materials, and propose an efficient means
of incorporating air absorption and a physically accurate method
to resize the modeled rooms.
In §2, we introduce the structure of the GFDN, and discuss
block mixing matrix design. In §3, we use the GFDN to simu-
late the impulse response of a large room coupled with a smaller
room, such as box seating in an opera hall. Two different sets
of decay filters, associated with two delay line groups are used,
and the mixing matrix is parameterized by a coupling coefficient,
which effectively controls the amount of coupling between the two
rooms. In §4, a single room made of different materials is mod-
eled with the GFDN. Delay line groups have different T60 filters
associated with each material. T60 filter design according to mate-
rial absorption characteristics is discussed in §4.1. The amount of
mixing controls the behavior of the GFDN modes. Modal analysis
as a function of the mixing matrix shows how the T60 characteris-
tics of different materials interact as occupancy of a room changes
DAFx.1
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
[6]. In §5, we build upon the method proposed in [12] and discuss
efficient room resizing with the GFDN by taking into considera-
tion air absorption, delay line lengths and T60 filters. The paper is
concluded in §6.
2. GROUPED FEEDBACK DELAY NETWORKS
A standard feedback delay network consists of Ndelay lines of
length τiseconds i= 1,2,...,N, each with its associated decay
filter, gi(z), connected through an N×Nfeedback matrix, M.
For a frequency dependent T60(z), the decay filter gains are related
to the delay line length as
gi(z) = 0.001 exp τi
T60(z).(1)
The same T60(z)is used to design the decay filters in all N
delay lines. In the proposed grouped feedback delay network ar-
chitecture, we use different T60(z)for each set of delay lines.
In Fig. 1, a GFDN with two sets of delay lines are shown. For
a total of Ndelay lines, N1delay lines have a decay response,
T601(z), and N2delay lines have a decay response, T602(z), such
that N1+N2=N. The two groups of decay filter gains, g1(z)
and g2(z)are calculated according to the different T60(z)s. The
mixing matrix Mis now an N×Nblock matrix made of the sub-
matrices, Mij RNi×Nj, i, j = 1,2. With ci,bi,giCNi×1
and τiRNi×1, the transfer function of Fig. 1, H(z), can be
written as
H(z) = Y(z)
U(z)
=d+c1c2g1(z)zτ10
0g2(z)zτ2
Ig1(z)zτ10
0g2(z)zτ2M11 M12
M21 M221b1
b2!.
(2)
The mixing matrix determines the amount of coupling be-
tween various delay lines. This property controls the rate at which
the echo density increases. A room with many objects and com-
plex geometry will mix faster than an empty room with simple
geometry. The mixing matrix can be designed to have a desired
mixing time according to the method in [6], where the Kronecker
product of a 2×2rotation/reflection matrix (parameterized by an
angle θ) with itself is taken log2(N)times to give an N×Nor-
thonormal matrix, M(θ)
R(θ) = cos θsin θ
sin θcos θ
MN×N(θ) = R(θ)R(θ). . . R(θ).
(3)
A well-diffused room with fast mixing time can be achieved by
a scaled Hadamard mixing matrix (θ=π
4). Similarly, a “room”
with no mixing and no increase in echo density can be synthesized
by an Identity mixing matrix (θ= 0). The parameter θcan be
chosen to give a desired mixing time, where θ=π
4yields the
maximum amount of mixing and smaller positive values give less
mixing.
In the GFDN, we can choose different, independent θval-
ues for each delay line group (the diagonal submatrices M11 and
V1V2
R2
R1
S
Figure 2: Coupled rooms
M22). The off-diagonal submatrices (M12 and M21) then control
how strongly coupled the groups are to each other. This gives us
independent control over the intra- and inter-group mixing charac-
teristics. The design of these submatrices will be described thor-
oughly in the following section.
3. COUPLED ROOMS
Two or more rooms can be coupled through an acoustically trans-
parent aperture. If the acoustic source is present in the smaller
room with a shorter decay time, the sound will travel to the larger
room and spill back into the first room. Such a configuration is
shown in Fig. 2. The resulting impulse response will have a non-
exponential decay. The first part of the decay has a steeper slope
due to the short decay rate of the first room, whereas the latter
part has a gentler slope representing the longer decay rate of the
second room. This is known as the Double-slope effect (DSE).
The physics of sound propagation in coupled rooms was studied
in [13]. The effect of the volume ratio, absorption ratio and aper-
ture size on the double slope profile was studied in [14]. Coupled
spaces are ubiquitous in the real world. They are found in con-
cert halls, opera halls, and churches [15] where columns, arches,
domes, etc., divide the space into two or more subspaces with dif-
ferent absorption properties.
3.1. Coupled mixing matrix design
The mixing matrix is crucial in simulating coupled rooms since
it controls diffusion within each room and among the rooms. A
method for mixing matrix design in coupled rooms has been sug-
gested in [16], where a sign-agnostic Procrustes method is used to
convert an arbitrary matrix to its nearest orthonormal form. Here,
we take a different approach. The diagonal submatrices that rep-
resent mixing in rooms 1and 2respectively can be characterized
by two mixing angles, θ1and θ2depending on the occupancy of
the rooms. The off-diagonal matrices represent the coupling be-
tween two rooms, and can be represented by matrices R12,R21 ,
multiplied by a scalar, α, which represents the amount of coupling.
M=M(θ1)αR12
αR21 M(θ2).(4)
This coupled mixing matrix is required to be orthonormal by
design. Using this criteria, i.e., MTM=I, we come up with the
DAFx.2
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
following constraints:
1. R12 and R21 need to be orthonormal.
2. M(θ1)TR12 +RT
21M(θ2) = 0
R21 =M(θ2)RT
12M(θ1).
3. Mneeds to be scaled by 1
1+α2.
Let R12 =M(θ1)1
2M(θ2)1
2=M(θ1
2)M(θ2
2). Therefore,
R21 =M(θ2
2)M(θ1
2). Now, the orthonormal mixing matrix
is
M=1
1 + α2M(θ1)αM(θ1
2)M(θ2
2)
αM(θ2
2)M(θ1
2)M(θ2).
(5)
Let 1
1+α2= cos ϕand α
1+α2= sin ϕ, then our mixing matrix
is characterized by a coupling angle,ϕ[0,π
4]radians. When
ϕ= 0, we get minimum coupling (diagonal M), and when ϕ=
π
4, we get maximum coupling between the two rooms. The final
parameterized coupled mixing matrix is
M(θ1, θ2, ϕ) = cos ϕM(θ1) sin ϕM(θ1
2)M(θ2
2)
sin ϕM(θ2
2)M(θ1
2) cos ϕM(θ2).
(6)
3.2. Evaluation
To simulate coupled rooms, R1and R2, we design an 8delay line
GFDN, with 4delay lines each representing the smaller and larger
room, with the source placed in R2and listener placed in R1.
The source and listener locations are determined by the b1, b2
and c1, c2coefficients respectively. The T60 filters of the two
rooms are first order low shelf filters parameterized by the DC and
Nyquist gains and transition frequency. The smaller room, R1, (in
blue) has a shorter decay time, so its T60(0) = 1 s, T60() =
0.2s and fT= 1 kHz. The larger room, R2(in red) has T60(0) =
3s, T60() = 1 s and fT= 4 kHz. The decay filters, g1(z),g2(z)
calculated according to (1) are shown in Fig. 3a.
The impulse responses of the coupled GFDN1as a function
of linearly spaced coupling angles (normalized by π
4) is shown in
Fig. 3b. As expected, when ϕ= 0, the rooms are decoupled and
the GFDN gives zero output. Increasing ϕincreases diffusion be-
tween the two rooms, giving denser reverb. The normalized echo
density (NED) [17], which is a perceptual measure of reverbera-
tion that helps quantify when early decay switches to late reverb,
is plotted in black. The NED plots show that denser reverberation
is achieved more quickly as ϕincreases. The effect of the smaller
room dominates in the coupled room RIRs as ϕincreases. This ap-
pears to go against the finding in [14], where subjects of a listening
test perceived more reverberance as coupling coefficient increased.
However, in our case the source is in the smaller room whereas the
listener is in the bigger room. So, the perceived reverberance will
decrease as coupling coefficient increases.
Additionally, we calculate the slopes of the two-stage decay
of the synthesized RIR. We do this by fitting the sum of two de-
caying exponentials and a constant to the energy envelope of the
1All sound examples are available at https://ccrma.stanford.
edu/~orchi/FDN/GFDN/GFDN.html
102103104
Frequency (Hz)
0.5
1
2
T60 (s)
102103104
Frequency (Hz)
-8
-6
-4
-2
0
T60 Filter gain (dB)
Room 1 (small)
Room 2 (big)
(a) Desired T60 response (left). Delay line T60 filters (right).
102103
Time (ms)
0
1
0
1
0
1
0
1
0
1
Amplitude
coupling =0
coupling =0.25
coupling =0.5
coupling =0.75
coupling =1
Larger room
Coupled rooms
Smaller room
(b) Impulse responses for different coupling coefficients. Nor-
malized Echo Density (NED) in black.
Figure 3: Coupled Rooms modeled with GFDN
synthesized RIR
henv(t) = γ0+γ1exp t
T1+γ2exp t
T2.(7)
We use MATLAB’s fmincon to find the decay rates T1, T2,
and update γusing weighted least squares (with more weight on
the tail). Two-stage decay of the RIR, with the fitted curve can be
seen in Fig. 4a. The ratio of T60 s calculated from T1and T2, as
a function of the coupling coefficient, is shown in Fig. 4b. This is
known as the decay ratio [14]. A larger decay ratio indicates more
coupling.
DAFx.3
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
0 0.5 1 1.5 2
Time (s)
-100
-80
-60
-40
-20
0
Magnitude (dB)
(a)
0 0.2 0.4 0.6 0.8 1
/( /4)
1.5
2
2.5
3
3.5
4
Decay Ratio
(b)
Figure 4: Top - Two-stage decay in coupled GFDN impulse re-
sponse for ϕ=π
4. Red line indicates energy envelope, yellow line
is the curve fit and the black dotted lines are the 2-stage decay fits.
Bottom - Decay Ratio (ratio of T60s of two rooms) v/s normalized
coupling angle.
4. SINGLE ROOM WITH DIFFERENT MATERIALS
As described in [11], the acoustic energy density of a room, w(t),
with volume V, and absorbing surface area A, decays exponen-
tially as a function of time
w(t) = w0et
τ
τ=V
gcA
T60 =2 log (0.001)τ
,(8)
where cis the speed of sound, and gis a geometric constant. A
room is typically constructed of several absorbing materials, each
with its unique frequency dependent absorption S(ω), and surface
area, a. The T60 of the room, and of the individual materials is
given by
Material Frequency (Hz)
125 250 500 1000 2000 4000
Plywood 0.28 0.22 0.17 0.22 0.10 0.11
Glass 0.35 0.25 0.18 0.12 0.07 0.04
Carpet 0.02 0.06 0.14 0.37 0.66 0.65
Air 0.10 0.30 0.60 1.00 1.90 5.80
Table 1: Absorption coefficients of different materials as function
of frequency. Absorption coefficients of air is the ANSI standard
at 20C, 30 50% humidity.
T60room (ω) = 2 log (0.001) V
gc PiaiSi(ω)
T60mati(ω) = 2 log (0.001) V
gcaiSi(ω)
.(9)
To model such a room with the GFDN, we associate different
T60 filters with each group of delay lines, corresponding to differ-
ent materials present in the room. Groups of delay lines that share
the same T60 filter represent surfaces in the room that are made
of the same material. T60 corresponding to a material depends on
the volume to surface area ratio V
a, and the materials’ absorption
characteristics S(ω). This is unlike standard FDNs, where a single
T60 filter representing the room’s reverberation time is associated
with all delay lines. This yields significant computational savings,
because T60 filters associated with most materials can be repre-
sented by simple low order IIR filters. However, a single T60 filter
based on room geometry (9), would require a very high-order fil-
ter. A physical motivation behind this design choice is that in real
rooms, multiple modes at the same frequency can have different
decay rates, that depend on the properties and distances between
surfaces from which the acoustic waves get reflected.
4.1. T60 filter design
By specifying the absorption coefficients and the volume to sur-
face area ratio in a room, T60 filters for several materials can be
designed. Table 1 shows the absorption coefficients of three com-
mon construction materials and air in octave bands. In Fig. 5, filter
fits to T60 responses of plywood, glass, carpet and air is shown.
Often, (Fig. 5b, 5d), a simple first order shelf filter is enough
to model the desired T60 response. The first order shelf filter, pa-
rameterized by its DC gain, γ0, Nyquist gain, γπand transition
frequency ωT, is given by
H(s) = γ0γπs
ωT+γ0
γπ1
2
s
ωTγ0
γπ1
2+ 1
.(10)
For materials like plywood that have resonant shelf-like T60
characteristics (Figs. 5a), we can cascade a peak/notch biquad filter
with a first order shelf filter. Thus, a third order filter is needed.
The transfer function of the peak (or notch) biquad, parameterized
by its peak frequency ωc, gain at peak frequency γωcand quality
factor Q, is given by
H(s) = s
ωc2+γωc
Qs
ωc+ 1
s
ωc2+1
Qs
ωc+ 1
.(11)
DAFx.4
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
50 100 200 500 1000 2000 5000 10000 20000
Frequency (Hz)
-1.2
-1
-0.8
-0.6
-0.4
Material T60 (dB)
(a) Plywood
50 100 200 500 1000 2000 5000 10000 20000
Frequency (Hz)
-0.14
-0.12
-0.1
-0.08
-0.06
-0.04
-0.02
Material T60 (dB)
(b) Glass
50 100 200 500 1000 2000 5000 10000 20000
Frequency (Hz)
-0.8
-0.6
-0.4
-0.2
Material T60 (dB)
(c) Carpet
50 100 200 500 1000 2000 5000 10000 20000
Frequency (Hz)
-0.25
-0.2
-0.15
-0.1
-0.05
Material T60 (dB)
(d) Air
Figure 5: T60 filter fits to different materials for a delay line length of 10 ms. Circles represent theoretical T60 values calculated according
to Sabine’s equation.
The details of converting these analog filter coefficients to digital
filter coefficients is given in [18].
Some materials have a steeper T60 slope, and a first or second
order shelf filter is not enough to model their responses (Fig. 5c).
For such materials, we use the method in [19] to cascade multiple
second order shelf filters (NObiquads per octave) to achieve a
desired transition bandwidth in number of octaves β, upper cutoff
frequency ωu, DC gain γ0, and Nyquist gain γπ. The total number
of biquads needed is N=βNO. The center frequencies of the
Nfilters are ωcµ= 2βωu, and the DC and Nyquist gains of
each of the filters is γ0µ=N
γ0. For Q=1
2, the transfer
function of the cascaded shelf filters is given by
H(s) =
N1
Y
µ=0
Hµ(s)
Hµ(s) = s
ωcµ2γ
1
2
0µ+s
ωcµ(γ0µγπµ)1
4
Q+γ
1
2
πµ
s
ωcµ2γ1
2
0µ+s
ωcµ(γ0µγπµ)1
4
Q+γ1
2
πµ
.
(12)
For designing the T60 filter of carpet (Fig. 5c), we chose β= 3
and NO= 1, giving a total of 3biquads, and a filter order of 6.
4.2. Evaluation
We synthesize the RIR of a 5×5×5m3cubical room with a
carpeted floor (25 m2), a glass window on a wall (8m2), and ply-
wood on the ceiling and rest of the walls (77 m2) with a 16 delay
line GFDN, with 8delay lines dedicated to modeling plywood, and
4delay lines for carpet and glass each. We vary the mixing matrix
from minimum to maximum occupancy (identity to Hadamard).
T60s for smaller mixing angles are longer. The modal decompo-
sition of the GFDN, calculated according to [5], for four different
mixing angles is shown in Fig. 6. As per our previous work [6],
mode dampings approach each other and mode frequencies repel
as mixing increases. This effect is clearly visible, as mode T60
responses start by resembling the individual filter characteristics
of the three materials for no mixing, but as mixing increases, they
scatter towards each other. For a fully mixed GFDN, the T60 s are
well mixed and converge within a narrow band, thus giving a more
diffused RIR.
However, the unusually high T60 of the modes produce very
long RIRs. This is because we have not taken into account the
effect of air absorption in the room, which is discussed in the fol-
lowing section.
DAFx.5
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
20 50 100 200 500 1000 2000 5000 20000
Frequency (Hz)
100
101
Pole T60 (s)
(a) Fraction mixing = 0
20 50 100 200 500 1000 2000 5000 20000
Frequency (Hz)
100
101
Pole T60 (s)
(b) Fraction mixing = 0.25
20 50 100 200 500 1000 2000 5000 20000
Frequency (Hz)
100
101
Pole T60 (s)
(c) Fraction mixing = 0.5
20 50 100 200 500 1000 2000 5000 20000
Frequency (Hz)
100
101
Pole T60 (s)
(d) Fraction mixing = 1
Figure 6: GFDN modes (T60 v/s frequency) as function of mixing
matrix angle. Increase in mixing causes modes to approach each
other in damping and scatter.
5. AIR ABSORPTION AND ROOM RESIZING
As described by Sabine’s equation (8), reverberation time is re-
lated to volume and surface area of the room. In small rooms,
the reverberation time and characteristic are predominantly a re-
sult of the materials properties, but in larger rooms, the effect of
air absorption becomes significant as the volume increases. The
architecture of the GFDN described above can be useful for real-
istically modeling rooms of different sizes. In addition to having
each delay line group represent a single material, we can cascade
an air absorption filter (first-order shelf) with each delay line. Nat-
urally, this increases the filter order for each delay line, however it
improves our ability to model the reverberation characteristics of
realistic rooms.
Fig. 7 shows the frequency responses of the T60 filters, im-
pulse responses, and spectrograms of the room described in §4.2
with and without the effect of air absorption. Since this is a medium
sized room, the effect of inclusion of air has a noticeable effect in
the damping the high frequencies. The inclusion of air absorp-
tion also has a significant effect in making the reverberation sound
natural and less metallic.
Now, say we have a GFDN that models a room that we like
but we want to increase or decrease the size of the room. Since
each delay line encodes the size, materials properties, and ratio
of absorbing surface area to volume, we can resize the room by
scaling the delay line lengths, τand recalculating the g(z)filter
coefficients to account for the changes in surface area and volume.
Let L0by the nominal length of the original room. To scale
the size of the room by a factor of L, we proportionally scale the
delay line lengths
τscaled =L
L0
τoriginal .(13)
Then, based on the new absorbing surface area, volume, and delay
line lengths, we recalculate the target T60s and the g(z)filters for
each delay line as described in §4. For small rooms, the material
properties will be more significant and in large rooms, air absorp-
tion will more significant.
Updating the delay line lengths scales the room mode frequen-
cies and fixes the temporal spacing of the early reflections and
mixing time while modifying the filters correctly updates the tar-
get frequency dependent T60s for a room of the new volume and
surface area. This means that we do not need to modify the mix-
ing matrix. Note that the method for room size control here is
a refinement to the methods described in [12] since we explicitly
have filters for materials and air absorption in each delay line. If
real-time room size control is desired, we can forgo modifying the
delay line lengths to avoid pitch shift artifacts. Moreover, if we
start with the geometry of the room we are modeling, it is easy
to scale the dimensions independently. For example, we can raise
the roof by modifying the delay line lengths associated with the
room’s height only. We have to modify the g(z)filters associated
with air absorption since the room’s volume is changed, however
we only need to change the materials filters associated with the
changes to the surface area of the walls as the floor and ceiling do
not change. Alternatively, we do not need to explicitly know the
geometry of the room. Since we group some delay lines together,
we can modify individual groups to modify different components
of the room abstractly.
Fig. 8 shows the impulse response, modes, and spectrogram
for the GFDN designed for the medium sized room described in
§4.2. We additionally show impulse responses and spectrograms
DAFx.6
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
102103104
Frequency in Hz
-5
-4
-3
-2
-1
0
Magnitude in dB
without air
with air
(a)
0 50 100 150 200 250 300
Time in milliseconds
-1
-0.5
0
0.5
1
1.5
Amplitude of IRs
without air
with air
(b)
(c)
Figure 7: Top - T60 filter responses of a GFDN designed to model
a medium sized room without and with air absorption. Eight de-
lay lines are used to model plywood, four for carpet, and four for
glass. Middle - impulse responses of the GFDN without and with
air absorption. Bottom - spectrograms of the GFDN without and
with air absorption.
0 50 100 150 200 250 300
Time in milliseconds
-1
0
1
2
3
Amplitude of IRs
room scale: 0.5
room scale: 1
room scale: 2
(a)
20 50 100 200 500 1000 2000 5000 20000
Frequency (Hz)
100
101
Pole T60 (s)
room scale: 0.5
room scale: 1
room scale: 2
(b)
(c)
Figure 8: Top - impulse responses of a GFDN designed to model
a medium sized room and scaled to have its nominal length halved
and doubled. Middle - modes of the same rooms. Bottom - spec-
trograms of the same rooms.
DAFx.7
Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8–12, 2020
of the same room, scaled to have the nominal length halved and
doubled. One can clearly see how the room sizing operation effects
the mode frequencies, early reflection spacing, mixing time, and
frequency dependent T60s.
6. CONCLUSION
In this paper, we have proposed the Grouped Feedback Delay Net-
work, which has different decay filters in different groups of delay
lines, motivated by the fact that in real rooms neighboring modes
do not have a single T60. Instead, they are distributed in a band.
We have used the GFDN to synthesize RIRs of coupled rooms,
where one room is significantly larger than the other. We have also
discussed the design of a parameterized orthonormal coupled mix-
ing matrix that controls the occupancy of the individual rooms and
amount of coupling between them. Single rooms composed of dif-
ferent materials (or different absorbing surfaces) have been mod-
eled with the GFDN. Delay line attenuation filters have been de-
signed to represent T60 characteristics of different absorbing mate-
rials in the room, instead of the space as a whole. Unlike [10], our
filters are of a lower order than a 10-band GEQ; hence the GFDN is
computationally more efficient. Modal analysis has shown that the
mode T60s of the synthesized RIRs follow the individual groups’
decay response when there is no mixing, and approach each other
as mixing increases, as previously investigated in [6]. Finally, we
have discussed the effect of air absorption in attenuating the over-
all T60 response of the modeled room. Methods for room resizing
by altering the delay line lengths and decay filter gains have been
proposed. We have provided relevant sound examples wherever
applicable.
The GFDN cannot be used to exactly match the measured T60
response of a particular space. However, it is a cheap way to gener-
ate approximate, physically informed RIRs when the configuration
of the room is known. Therefore, we think it will find applications
in VR audio, where cheap, approximate and dynamic artificial re-
verberation is a requirement. Perceptual evaluation of GFDN RIRs
with those synthesized by ray-tracing/image-source/FDTD meth-
ods is a topic we leave open for future work.
7. ACKNOWLEDGEMENT
The authors would like to thank Sebastian J. Schlecht for sharing
his FDN Toolbox [20] written in MATLAB, which was used to im-
plement FDN modal analysis with the Ehrlich-Aberth algorithm.
8. REFERENCES
[1] Michael A. Gerzon, “Unitary (energy-preserving) multi-
channel networks with feedback,” Electron. Lett., vol. 12,
no. 11, pp. 278–279, 1976.
[2] Jean-Marc Jot and Antoine Chaigne, “Digital delay networks
for designing artificial reverberators, in Proc. Audio Eng.
Soc. Conv. 90, 1991.
[3] Jean-Marc Jot, “An analysis/synthesis approach to real-time
artificial reverberation, in IEEE Int. Conf. Acoust., Speech,
Signal Process., 1992, vol. 2, pp. 221–224.
[4] Sebastian J. Schlecht and Emanuël A. P. Habets, “Dense
reverberation with delay feedback matrices, in IEEE
Workshop Appl. Signal Process. Audio Acoustics (WASPAA),
2019, pp. 150–154.
[5] Sebastian J. Schlecht and Emanuël A. P. Habets, “Modal
decomposition of feedback delay networks,” IEEE Trans.
Signal Process., vol. 67, no. 20, pp. 5340–5351, 2019.
[6] Orchisama Das, Elliot K. Canfield-Dafilou, and Jonathan S.
Abel, “On the behavior of delay network reverberator
modes,” in IEEE Workshop Appl. Signal Process. Audio
Acoustics (WASPAA), 2019, pp. 50–54.
[7] Sebastian J. Schlecht and Emanuël A. P. Habets, “Time-
varying feedback matrices in feedback delay networks and
their application in artificial reverberation, J. Acoust. Soc.
Amer., vol. 138, no. 3, pp. 1389–1398, 2015.
[8] Benoit Alary, Archontis Politis, Sebastian J. Schlecht, and
Vesa Välimäki, “Directional feedback delay network,” J.
Audio Eng. Soc., vol. 67, no. 10, pp. 752–762, 2019.
[9] Sebastian J. Schlecht and Emanuël A. P. Habets, “Accurate
reverberation time control in feedback delay networks, Int.
Conf. Digit. Audio Effects, pp. 337–344, 2017.
[10] Karolina Prawda, Vesa Välimäki, Sebastian J Schlecht, et al.,
“Improved reverberation time control for feedback delay net-
works,” in Int. Conf. Digit. Audio Effects, 2019.
[11] Wallace C. Sabine, Collected papers on acoustics, Peninsula
Publishing, Los Alto, CA, 1993.
[12] Elliot K. Canfield-Dafilou and Jonathan S. Abel, “Resizing
rooms in convolution, delay network, and modal reverbera-
tors,” in Int. Conf. Digit. Audio Effects, 2018.
[13] Carl F. Eyring, “Reverberation time measurements in cou-
pled rooms,” J. Acoust. Soc. Amer., vol. 3, no. 2A, pp. 181–
206, 1931.
[14] David T. Bradley and Lily M. Wang, “The effects of simple
coupled volume geometry on the objective and subjective re-
sults from nonexponential decay, J. Acoust. Soc. Amer., vol.
118, no. 3, pp. 1480–1490, 2005.
[15] John S. Anderson and Magdalena Bratos-Anderson, “Acous-
tic coupling effects in st paul’s cathedral, london, J. Sound
Vib., vol. 236, no. 2, pp. 209–225, 2000.
[16] Sebastian J. Schlecht and Emanuël A. P. Habets, “Sign-
agnostic matrix design for spatial artificial reverberation with
feedback delay networks,” in Proc. Audio Eng. Soc. Int.
Conf. Spatial Reproduction-Aesthetics Sci., 2018.
[17] Jonathan S. Abel and Patty Huang, “A simple, robust mea-
sure of reverberation echo density,” in Audio Eng. Soc. Conv.
121, 2006.
[18] Julius O. Smith, Physical Audio Signal Processing,
http://ccrma.stanford.edu/ jos/pasp/, 2010.
[19] Frank Schultz, Nara Hahn, and Spors Sascha, “Shelving filter
cascade with adjustable transition slope and bandwidth,” in
Audio Eng. Soc. Conv. 148, 2020.
[20] Sebastian J. Schlecht, “FDNTB: The Feedback Delay Net-
work Toolbox,” in Proceedings of the 23-rd Int. Conf. on
Digital Audio Effects (DAFx2020), G. Evangelista, Ed., Sept.
2020, vol. 1.
DAFx.8
... Recently, many extensions of the FDN have been proposed, such as the directional feedback delay network (DFDN) (Alary and Politis, 2020;Alary, Politis, et al., 2019) which operates onto an Ambisonics signal (see section 2.3.2.2), or a grouped feedback delay network (GFDN) where the FDN channels are parametrized in two distinct groups such that double slopes behavior can be achieved (Das, Abel, and Canfield-Dafilou, 2020;. Furthermore, Schlecht and Habets (2015) proposed a time varying FDN, where a unitary feedback matrix is regularly rotated such that time-variability of real rooms are imitated. ...
... Solutions have been proposed, such as the grouped feedback delay network (GFDN) (Das, Abel, and Canfield-Dafilou, 2020; where a FDN is forced to exhibit two distinct decay rates and a the feedback matrix is adjusted to provide appropriate coupling between the two line groups. However, as it will be discussed in section 3.2.2, ...
... with the filter matrices having the form Figure 3.1 GFDN structure, as proposed in Das, Abel, and Canfield-Dafilou (2020) and . single-channel connections; ...
Thesis
This thesis takes place within the RASPUTIN project and focuses on the development, evaluation and use of immersive acoustic virtual reality simulation tools for the purpose of helping blind individuals prepare in-situ navigations in unfamiliar reverberant environments. While several assistive tools, such as sensory substitution devices, can provide spatial information during navigation, an alternative approach is to devise a real-time room acoustic simulation and auralization engine for use by blind individuals at home to enable them to virtually navigate in unfamiliar environments under controlled circumstances, hence building mental representations of these spaces prior to in-situ navigation. In this thesis, I tackle three aspects of this subject. The first part focuses on efficient simulations and auralizations of coupled volumes, which occur in many buildings of interest for navigation preparation (e.g. city halls, hospitals, or museums) and whose simulation and auralization can be challenging. The second part focuses on the individualization of head related transfer functions, which is a necessary step in providing individualized and convincing auditory experiences. Finally, the last part investigates some aspects of the space cognition following use of different learning paradigms, such as tactile maps.
... We proposed the Grouped Feedback Delay Network (GFDN) [17] to connect multiple FDNs whilst maintaining system stability. In a GFDN, groups of delay lines have the same target T 60 response associated with them, compared to traditional FDNs, in which all delay lines share the same decay characteristics. ...
... For two coupled rooms, each with individual mixing matrices M 2 1 and M 2 2 , and a coupling coefficient of α, the coupled mixing matrix can be characterized as [17], ...
Article
Full-text available
Feedback Delay Networks are one of the most popular and efficient means of generating artificial reverberation. Recently, we proposed the Grouped Feedback Delay Network (GFDN), which couples multiple FDNs while maintaining system stability. The GFDN can be used to model reverberation in coupled spaces that exhibit multi-stage decay. The block feedback matrix determines the inter- and intra-group coupling. In this paper, we expand on the design of the block feedback matrix to include frequency-dependent coupling among the various FDN groups. We show how paraunitary feedback matrices can be designed to emulate diffraction at the aperture connecting rooms. Several methods for the construction of nearly paraunitary matrices are investigated. The proposed method supports the efficient rendering of virtual acoustics for complex room topologies in games and XR applications.
... In this paper, we use a delay network architecture we recently proposed called the Grouped Feedback Delay Network (GFDN) [16] for modeling coupled spaces. In a GFDN, groups of delay lines have the same target T 60 response associated with them, compared to traditional FDNs, in which all delay lines share the same decay characteristics. ...
... In [18], DFDNs were extended to include direction as well as frequency dependent energy decay. In contrast, the GFDN has been used to model coupled rooms, and rooms having walls and objects with different absorption characteristics [16]. ...
Article
Full-text available
Delay Network reverberators are an efficient tool for synthesizing reverberation. We propose a novel architecture, called the Grouped Feedback Delay Network (GFDN) reverberator, with groups of delay lines sharing different target decay rates, and use it to simulate coupled room acoustics. Coupled spaces are common in apartments, concert halls, and churches where two or more volumes with different reverberation characteristics are linked via an aperture. The difference in reverberation times (T60s) of the coupled spaces leads to unique phenomena, such as multi-stage decay. Here the GFDN is used to simulate coupled spaces with groups of delay line filters representing the T60 s of the coupled rooms. A parameterized, orthonormal mixing matrix is presented that provides control over the mixing times of the rooms and amount of coupling between the rooms. As an example application we measure a coupled bedroom and bathroom system separated by a door in an apartment and use the GFDN to synthesize the late field for different openings of the door separating the two rooms, thereby varying coupling between the rooms.
... Our prior work introduced a modal decomposition technique for FDNs [8], representing the FDN as a sum of complex onepole resonators, or modes. These modes find utility in system estimation [9], oscillation analysis [10], [11], resonance excitation optimization [12], [13], matching desired room modes [14], and enhancing the design of coupled room reverberators [15]- [17]. ...
Article
Full-text available
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
... Combining two FDNs with different parameters allows for generating various non-exponential attenuation patterns, such as fadein control or two-stage decay [25][26][27]10]. An extended method based on the FDN has been proposed to synthesize double-slope decays of coupled rooms [23,9]. However, no FDN-based method is capable of synthesizing reverberation, which has an arbitrary and non-exponential energy decay. ...
Article
Full-text available
Previous research on late-reverberation modeling has mainly focused on exponentially decaying room impulse responses, whereas methods for accurately modeling non-exponential reverberation remain challenging. This paper extends the previously proposed basic dark-velvet-noise reverberation algorithm and proposes a parametrization scheme for modeling late reverberation with arbitrary temporal energy decay. Each pulse in the velvet-noise sequence is routed to a single dictionary filter that is selected from a set of filters based on weighted probabilities. The probabilities control the spectral evolution of the late-reverberation model and are optimized to fit a target impulse response via non-negative least-squares optimization. In this way, the frequency-dependent energy decay of a target late-reverberation impulse response can be fitted with mean and maximum reverberation-time errors of 4% and 8%, respectively, requiring about 50% less coloration filters than a previously proposed filtered-velvet-noise algorithm. Furthermore, the extended dark-velvet-noise reverberation algorithm allows the modeled impulse response to be gated, the frequency-dependent reverberation time to be modified, and the model's spectral evolution and broadband decay to be decoupled. The proposed method is suitable for the parametric late-reverberation synthesis of various acoustic environments, especially spaces that exhibit a non-exponential energy decay, motivating its use in musical audio and virtual reality.
... Our prior work introduced a modal decomposition technique for FDNs [8], representing the FDN as a sum of complex onepole resonators, or modes. These modes find utility in system estimation [9], oscillation analysis [10], [11], resonance excitation optimization [12], [13], matching desired room modes [14], and enhancing the design of coupled room reverberators [15]- [17]. ...
Preprint
Full-text available
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
... Recent research extended Schroeder allpass filters and reverberators, e.g., allowing frequency-dependent gains in Schroeder allpass filters [35]; connecting FDNs to room geometry [36]; adding controls of directional distribution of sound to FDNs [37]; imbuing FDNs with the allpass prop- erty [34]; generalizing FDN feedback to a matrix of filters [38], including the case of velvet noise [39,40] feedback matrices in particular [41]; and studying coupled and parallel FDNs [42,43]. This article complements these works, providing new insight on Schroeder allpass filters and FDN architectures with good time-varying properties. ...
Conference Paper
Full-text available
Feedback delay networks (FDNs) are recursive filters, which are widely used for artificial reverberation and decorrelation. While there exists a vast literature on a wide variety of reverb topologies, this work aims to provide a unifying framework to design and analyze delay-based reverberators. To this end, we present the Feedback Delay Network Toolbox (FDNTB), a collection of the MAT-LAB functions and example scripts. The FDNTB includes various representations of FDNs and corresponding translation functions. Further, it provides a selection of special feedback matrices, topologies, and attenuation filters. In particular, more advanced algorithms such as modal decomposition, time-varying matrices, and filter feedback matrices are readily accessible. Furthermore, our toolbox contains several additional FDN designs. Providing MATLAB code under a GNU-GPL 3.0 license and including illustrative examples, we aim to foster research and education in the field of audio processing.
Article
Full-text available
Artificial reverberation algorithms are used to enhance dry audio signals. Delay-based reverberators can produce a realistic effect at a reasonable computational cost. While the recent popularity of spatial audio algorithms is mainly related to the reproduction of the perceived direction of sound sources, there is also a need to spatialize the reverberant sound field. Usually, multichannel reverberation algorithms output a series of decorrelated signals yielding an isotropic energy decay. This means that the reverberation time is uniform in all directions. However, the acoustics of physical spaces can exhibit more complex direction-dependent characteristics. This paper proposes a new method to control the directional distribution of energy over time, within a delay-based reverberator, capable of producing a directional impulse response with anisotropic energy decay. We present a method using multichannel delay lines in conjunction with a direction-dependent transform in the spherical harmonic domain to control the direction-dependent decay of the late reverberation. The new reverberator extends the feedback delay network, retaining its time-frequency domain characteristics. The proposed directional feedback delay network reverberator can produce non-uniform direction-dependent decay time, suitable for anisotropic decay reproduction on a loudspeaker array or in binaural playback through the use of ambisonics.
Conference Paper
Full-text available
Artificial reverberation algorithms generally imitate the frequency-dependent decay of sound in a room quite inaccurately. Previous research suggests that a 5% error in the reverberation time (T60) can be audible. In this work, we propose to use an accurate graphic equalizer as the attenuation filter in a Feedback Delay Network re-verberator. We use a modified octave graphic equalizer with a cascade structure and insert a high-shelf filter to control the gain at the high end of the audio range. One such equalizer is placed at the end of each delay line of the Feedback Delay Network. The gains of the equalizer are optimized using a new weighting function that acknowledges nonlinear error propagation from filter magnitude response to reverberation time values. Our experiments show that in real-world cases, the target T60 curve can be reproduced in a perceptually accurate manner at standard octave center frequencies. However, for an extreme test case in which the T60 varies dramatically between neighboring octave bands, the error still exceeds the limit of the just noticeable difference but is smaller than that obtained with previous methods. This work leads to more realistic artificial reverberation.
Article
Full-text available
Feedback delay networks (FDNs) belong to a general class of recursive filters which are widely used in sound synthesis and physical modeling applications. We present a numerical technique to compute the modal decomposition of the FDN transfer function. The proposed pole finding algorithm is based on the Ehrlich-Aberth iteration for matrix polynomials and has improved computational performance of up to three orders of magnitude compared to a scalar polynomial root finder. The computational performance is further improved by bounds on the pole location and an approximate iteration step. We demonstrate how explicit knowledge of the FDN's modal behavior facilitates analysis and improvements for artificial reverberation. The statistical distribution of mode frequency and residue magnitudes demonstrate that relatively few modes contribute a large portion of impulse response energy.
Conference Paper
A shelving filter that exhibits an adjustable transition band is derived from a cascade of second order infinite impulse response shelving filters. Two of three parameters, i.e. shelving level, transition slope and transition bandwidth, can be freely adjusted in order to describe the design specifications. The accuracy of the resulting response depends on the number of deployed biquads per octave. If this is set too small, deviations in level and bandwidth as well as a rippled slope can occur. The shelving filter cascade might be used in applications, that require a fractional-order slope in a certain bandwidth, such as for sound reinforcement system equalization, sound field synthesis and audio production.
Article
This paper introduces a time-variant reverberation algorithm as an extension of the feedback delay network (FDN). By modulating the feedback matrix nearly continuously over time, a complex pattern of concurrent amplitude modulations of the feedback paths evolves. Due to its complexity, the modulation produces less likely perceivable artifacts and the time-variation helps to increase the liveliness of the reverberation tail. A listening test, which has been conducted, confirms that the perceived quality of the reverberation tail can be enhanced by the feedback matrix modulation. In contrast to the prior art time-varying allpass FDNs, it is shown that unitary feedback matrix modulation is guaranteed to be stable. Analytical constraints on the pole locations of the FDN help to describe the modulation effect in depth. Further, techniques and conditions for continuous feedback matrix modulation are presented.
Article
A simple, robust method for measuring echo density from a reverberation impulse response is presented. Based on the property that a reverberant field takes on a Gaussian distribution once an acoustic space is fully mixed, the measure counts samples lying outside a standard deviation in a given impulse response window and normalizes by that expected for Gaussian noise. The measure is insensitive to equalization and reverberation time, and is seen to perform well on both artificial reverberation and measurements of room impulse responses. Listening tests indicate a correlation between echo density measured in this way and perceived temporal quality or texture of the reverberation.
Article
This project focuses on the individual and interactive acoustic effects of three architectural parameters on the double slope profile from a simple coupled volume system created in the computer modeling program ODEON. The three variables studied are the volume ratio between the main and secondary spaces, the absorption ratio between the two spaces, and aperture size. The resulting energy decay profiles are analyzed using T30/T15 coupling coefficient ratios and Bayesian analysis. Coupling coefficient results show general trends in the effects of the three architectural parameters that match previous research results and the predominant interactive effect between the three variables involving a large coupled volume. Similar results are suggested by the Bayesian analysis based on the newly developed quantifiers, decay ratio and ΔdB, although a more complex relationship among the variables may exist at larger volume ratios. A subset of the simulated sound fields have been auralized and used to conduct psychoacoustic testing. The subjective testing results indicate that perceived reverberation increases with greater coupled volume size and aperture size, correlating well with objective results. The outcomes also suggest that higher perceived reverberation coincides with larger decay ratios and smaller ΔdB values. Subjective results based on clarity showed no significant effects.