Conference PaperPDF Available

On The Behavior of Delay Network Reverberator Modes

Authors:

Abstract

The mixing matrix of a Feedback Delay Network (FDN) reverberator is used to control the mixing time and echo density profile. In this work, we investigate the effect of the mixing matrix on the modes (poles) of the FDN with the goal of using this information to better design the various FDN parameters. We find the modal decomposition of delay network reverberators using a state space formulation, showing how modes of the system can be extracted by eigenvalue decomposition of the state transition matrix. These modes, and subsequently the FDN parameters, can be designed to mimic the modes in an actual room. We introduce a parameterized orthonormal mixing matrix which can be continuously varied from identity to Hadamard. We also study how continuously varying diffusion in the mixing matrix affects the damping and frequency of these modes. We observe that modes approach each other in damping and then deflect in frequency as the mixing matrix changes from identity to Hadamard. We also quantify the perceptual effect of increasing mixing by calculating the normalized echo density (NED) of the FDN impulse responses over time.
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
ON THE BEHAVIOR OF DELAY NETWORK REVERBERATOR MODES
Orchisama Das, Elliot K. Canfield-Dafilou, Jonathan S. Abel
Center for Computer Research in Music and Acoustics,
Stanford University, Stanford, CA 94305 USA,
orchi|kermit|abel@ccrma.stanford.edu
ABSTRACT
The mixing matrix of a Feedback Delay Network (FDN) rever-
berator is used to control the mixing time and echo density profile.
In this work, we investigate the effect of the mixing matrix on the
modes (poles) of the FDN with the goal of using this information
to better design the various FDN parameters. We find the modal
decomposition of delay network reverberators using a state space
formulation, showing how modes of the system can be extracted
by eigenvalue decomposition of the state transition matrix. These
modes, and subsequently the FDN parameters, can be designed to
mimic the modes in an actual room. We introduce a parameterized
orthonormal mixing matrix which can be continuously varied from
identity to Hadamard. We also study how continuously varying dif-
fusion in the mixing matrix affects the damping and frequency of
these modes. We observe that modes approach each other in damp-
ing and then deflect in frequency as the mixing matrix changes from
identity to Hadamard. We also quantify the perceptual effect of in-
creasing mixing by calculating the normalized echo density (NED)
of the FDN impulse responses over time.
Index TermsFeedback Delay Network, Artificial Reverber-
ation, Modal Analysis, Normalized Echo Density, Mixing Matrix
1. INTRODUCTION
Artificial reverberation techniques aim to synthesize an impulse re-
sponse using a combination of filters. An ideal artificial reverber-
ator reproduces a set of sparse early reflections which increase in
density over time, building toward late reverberation where the im-
pulse density is high and statistically Gaussian. In other words,
the “echo-density” (number of echoes per second) should increase
quadratically in time and is a good psychoacoustic measure of per-
ceived reverberation. Another characteristic of an ideal reverberator
is the frequency response—the number of modes should increase
with frequency squared, which means high frequency modes are
densely packed.
The first artificial reverberators were proposed by Schroeder [1]
and used comb filters in parallel followed by allpass filters in series.
While this architecture produces the desired increase in echo density
over time, unnatural coloration in the impulse response persisted
[2]. Gerzon [3] generalized delay network reverberators, suggesting
the use of a unitary feedback matrix to mix the outputs of the delay
lines into the inputs of each other. Jot and Chaigne later proposed
the Feedback Delay Network (FDN) [4, 5] which matched the delay
line lengths with shelf filters designed to yield a desired frequency
dependent T60. Since then, FDNs have gained popularity for creat-
ing efficient artificial reverberation. Some important contributions
in FDN research include [6, 7] in which the authors propose a cir-
culant feedback matrix for efficient implementation and maximum
diffusion, and those by Schlecht [8, 9] which deal with time-varying
FDNs and their practical implementation. Schlecht also studied the
properties of mixing matrices that produce lossless FDNs [10]. In a
recent paper [11], he investigated the modal decomposition of feed-
back delay networks using the Ehrlich-Aberth iteration for finding
poles and also studied the statistical distribution of mode frequen-
cies and amplitudes. In this paper, we explicitly derive the state
transition matrix for the case of frequency dependent T60s.
The design of an optimum mixing matrix to produce a desired
perceptual effect is still somewhat ambiguous. In this paper, we
first derive the FDN state transition matrix for scalar gains and first-
order shelf filters (for frequency dependent decay rates), and show
that the poles (modes) of the system are given by eigenvalues of the
state transition matrix. Next, we use the concept of homotopy [12]
to gradually alter the mixing matrix from a state of minimum dif-
fusion to maximum diffusion and observe how the pole trajectories
vary with mixing. We observe coupling between nearby modes,
where they first approach each other and then deflect, similar to
what was observed by Weinreich in piano strings [13]. We discuss
different ways of designing delay line T60 filters to model rooms
which have walls made of different materials. Then, we calculate
the normalized echo density (NED) [14] of the impulse responses
at different levels of mixing and compare how mixing affects the
time at which the late reverb starts (mixing time). NED has been
shown to be a good indicator of the psychoacoustic perception of
reverb [15, 16]. The effect of mixing time on echo density, which is
shown to be a polynomial function dependent on delay line lengths,
has been studied in [17]. Using curve fitting and empirical estima-
tion, we derive a parametric function relating mixing time to mean
delay line length and mixing matrix. This relationship between mix-
ing time and level of diffusion sheds some light on designing FDN
mixing matrices to achieve a desired perceptual effect.
The rest of the paper is organized as follows: in §2 we setup the
FDN state space equations and derive the state transition matrix for
frequency independent and frequency dependent decay rates. We
show that the modes of the FDN are eigenvalues of the state tran-
sition matrix. In §3, we discuss how to smoothly vary the mixing
matrix from zero to maximum diffusion. In §4, we see how mixing
affects the mode trajectories and discuss how different decay filters
can be designed. We also generate FDN impulse responses for dif-
ferent mixing matrices and show how their NEDs evolve with time.
Finally, we come up with a parametric equation relating the mix-
ing time to the mean delay line length and the mixing matrix. We
conclude the paper in §5 and delineate scope for future work.
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
2. FEEDBACK DELAY NETWORK MODAL
DECOMPOSITION
Let us consider a feedback delay network with Ndelay lines of
length τ1, τ2,...,τNsamples each as shown in Fig. 1. The mix-
ing matrix denoted by Mis typically orthonormal and of the order
(N×N), the 60 dB decay time of each delay line (T60) is controlled
by the filter gi(z). The input and output at time nis given by u(n)
and y(n)respectively. The vectors band cdenote the input and
output gains and dis the direct path gain.
The FDN can be represented as a state space system, with the
state vector xdefined by the values stored in each memory element.
x= [x1, x2,...,xτ1, xτ1+1,...,xPiτi]T(1)
With a state transition matrix A, the state space equations can be
written as
x(n) = Ax(n1) + bu(n)
y(n) = cTx(n) + du(n)
Y(z) = hcT(zIA)1b+diU(z)
(2)
2.1. State Transition Matrix
It is obvious from (2) that the modes of the FDN are the eigenvalues
of the matrix A. In this section, we will derive Afor two cases—
first the case of frequency independent T60s, in which case the delay
line filter is a scalar gain, and second the more commonly used fre-
quency dependent T60 case, in which gi(z)is typically given by a
low-shelf filter.
Since Ais a large sparse matrix of the order (PN
i=1 τi×
PN
i=1 τi), we will use block matrix notation, and denote xas a
stacked vector [˜
xT
1,...,˜
xT
N]and Aas a block matrix with N×N
sub-matrices.
˜
x1(n)
˜
x2(n)
.
.
.
˜
xN(n)
=
˜
A11 ˜
A12 . . . ˜
A1N
˜
A21 ˜
A22 . . . ˜
A2N
.
.
..
.
.....
.
.
˜
AN1˜
A2N. . . ˜
ANN
˜
x1(n1)
˜
x2(n1)
.
.
.
˜
xN(n1)
(3)
2.1.1. Frequency independent decay times
In this case, gi(z) = gifor i= 1 . . . N . Each sub-vector of the
state vector xcan be written as
˜
xT
i= [xPi1
j=1 τj+1,...,xPi
j=1 τj]T(4)
The sub-matrices, ˜
Aij are of the order (τi×τj) for i, j = 1 . . . N .
The diagonal blocks can be written as
˜
Aii =
0. . . 0giMii
1. . . 0 0
.
.
.....
.
..
.
.
0. . . 1 0
,(5)
where Mii is the i, ith element of M. The off-diagonal blocks have
zeros everywhere, except the (1, τj)-th element which is given by
˜
Aij (1, τj) = gjMji (6)
M
d
b1
b2
b3
b4
z1z1... z1
τ1
x1x2··· xτ1
z1z1... z1
τ2
xτ1+1 ··· xτ1+τ2
z1z1... z1
τ3
xτ1+τ2+1 ··· xτ1+τ2+τ3
z1z1... z1
τ4
xτ1+τ2+τ3+1 ···xτ1+τ2+τ3+τ4
g1(z)
g2(z)
g3(z)
g4(z)
c1
c2
c3
c4
v1
v2
v3
v4
+
u(n)
y(n)
Figure 1: State space FDN block diagram.
+
z1
+
wi(n)
x(n)b0i
b1ia1i
vi(n)
Figure 2: First order shelf filter block diagram.
2.1.2. Frequency dependent decay times
In this case, the delay-line filter is a first-order low shelf filter of the
form
gi(z) = b0i+b1iz1
1a1iz1(7)
This ensures that the lower frequency modes have a longer decay
than the higher frequency modes, which is consistent with what is
observed in actual rooms. We design a shelf filter by setting the
filter coefficients according to desired gains at DC and Nyquist. The
direct form II diagram of the shelf filter is given in Fig. 2. The state
vector now has an additional state wiper sub-vector.
˜
xT
i= [xPi1
j=1 τj+1,...,xPi
j=1 τj, wi]T(8)
The sub-matrices, ˜
Aij are of the order (τi+ 1 ×τj+ 1) for
i, j = 1 . . . N . The diagonal blocks can be written as
˜
Aii =
0. . . 0b0iMii Mii
1. . . 0 0 0
.
.
.....
.
..
.
..
.
.
0. . . 1b1ia1ib01 a1i
(9)
The off-diagonal blocks have zeros everywhere, except the (1, τj)
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
and (1, τj+ 1)-th elements which are given by
˜
Aij (1, τj) = b0jMji ,
˜
Aij (1, τj+ 1) = Mj i
(10)
3. HOMOTOPY
Now that we have derived the state transition matrix of the FDN,
hence its modes, we wish to observe how these modes change as
we modify the mixing matrix M. For this, we must alter the mix-
ing matrix smoothly from a state of no mixing to maximum mix-
ing. A slow and continuous change from one function to another
is known as homotopy [12]. In [8], the authors create a feedback
matrix evolution equivalent to a recursive update using linear mod-
ulation functions. In this paper, we parameterize Mas a function
of θ. Starting with the (2×2) rotation matrix, R(θ)which is or-
thonormal, we can generate an (N×N)orthonormal mixing matrix
M(θ)by taking the Kronecker product of R(θ)with itself log2N
times. It is to be noted that the Kronecker product of two orthonor-
mal matrices is also orthonormal [18].
R(θ) = cos θsin θ
sin θcos θ
MN×N(θ) = R(θ)R(θ). . . R(θ)
(11)
Starting with θ= 0 gives us M(0) = IN×N, i.e, Nparallel
delay lines with no mixing among them. By incrementally increas-
ing θby radians, new mixing matrices M(θ+)are generated
with increased mixing among delay lines, until θ=π
4, when the
mixing matrix becomes Hadamard and depicts the case of maxi-
mum mixing. From now on, we will define θas the mixing angle,
and parametrize the mixing matrix as M(θ).
4. RESULTS AND DISCUSSION
4.1. Mode Trajectories
In Fig. 3 we show the modes in the complex plane and the frequency
dependent T60 for a FDN with two delay lines, with τ1= 5 (in red)
and τ2= 19 (in blue); g1(z)is a constant with T60 = 40 sam-
ples, g2(z)is a low-shelf filter with T60DC = 150 samples and
T60Nyq uist = 50 samples. The color intensity increases from light
to dark as θgoes from 0to π
4. We have squared the radii of the
modes in Fig. 3a to make the pole movement easier to see.
The modes are in complex conjugate pairs. With no mixing,
they are distributed uniformly in frequency. As mixing increases,
the modes in the delay lines that are close in frequency approach
each other in damping rapidly, and then deflect in frequency. This
coupling behavior is more accentuated in the lower frequencies,
with the modes at DC approaching each other and bifurcating. The
modes at higher frequencies show less movement, and the mode at
Nyquist remains stationary. In fact, once fully mixed, the shorter
delay line that started with a constant T60 shows low-pass behavior.
The coupling among these modes is similar to what was ob-
served by Weinreich in piano strings that are tuned with a slight
deviation in frequency [13]. Due to this mistuning, the strings have
slightly different angular frequencies and are coupled due to the dy-
namic motion of the bridge. The frequencies of the normal modes
of the strings as a function of the deviation in angular frequency
follow the same trajectory as our delay line modes when mixing
increases.
-1 -0.5 0 0.5 1
-1
-0.5
0
0.5
1
(a) Poles in the complex plane.
-1 -0.5 0 0.5 1
Normalized Frequency (rad/s)
0
50
100
150
T60 in samples
(b) Frequency dependent T60.
Figure 3: Modes of FDN for two delay line case with varying
amounts of mixing. Red indicates a delay line with five taps and
a scalar gain. Blue indicates a delay line with nineteen taps with
a shelf filter. Lighter colors show the low mixing case and darker
colors maximum mixing.
4.2. Designing different decay filters
Typically in an FDN reverberator, the decay filters are designed
such that all delay lines independently produce the same T60 fre-
quency response. However, it is likely that the physical configura-
tion of a room would require multiple, concurrent T60 responses.
One might use one set of filters to model air absorption while an-
other set of filters can be used to model the absorption due to the
materials in the room. For example, a church might have a pair
of parallel walls with glass windows, and another set of walls cov-
ered with drapes. The absorption coefficient of glass decreases with
increase in frequency, so its T60 response can be modeled by a low-
shelf filter, while the T60 response of the drapery is better modeled
with low-shelf resonant filter [19]. It would be realistic to assign
part of the delay line filters to model the glass and the others to
model the drapery. In [20], the T60 filters were designed such that
all walls mimic the frequency-dependent absorption of cotton car-
pet. The mixing matrix could be adjusted to emulate the occupancy
of the church—adding furniture would increase interaction between
the modes, which is equivalent to increasing the mixing among the
delay lines. Similarly, FDNs could have different filters in the delay
lines with high diffusion to mimic the reverberant characteristics of
coupled rooms [21]. Designing mixing matrices for coupled rooms
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
0
2
Amplitude
10 20 50 100 200 500 1000 2000
Time(ms)
0
0.5
1
NED
Figure 4: Impulse Responses and NED profiles for mixing matrices
with θset to π
40 ,π
8, and π
4, corresponding to 10%,50%, and 100%
mixing respectively (bottom to top).
with bleed between them has been previously studied in [22].
4.3. Echo Density and Mixing Time
In Fig. 4, we show how the normalized echo density profile (cal-
culated with a 50 ms Hanning window) changes with change in
mixing angle θfor a FDN with 16 delay lines, with delay lengths
uniformly distributed between 1020 ms at a sampling frequency
of 48 kHz with T60DC = 4 s and T60Ny quist = 2 s. As ex-
pected, as the amount of mixing increases, the echo density be-
comes Gaussian more quickly. Audio examples of the impulse
responses are available at https://ccrma.stanford.edu/
˜orchi/FDN/IR.html
A FDN artificial reverberator has many parameters that need
to be tuned, such as mixing matrix, number of delay lines, their
lengths etc. to produce a desired perceptual effect. The tuning of
these parameters is still considered somewhat an art. The mixing
time of a feedback delay network is defined as the time in which
the echo density reaches a threshold, T= 0.9in this case [15]. In
[17], the authors come up with a closed form solution for choosing
a mean delay length, given a desired mixing time. We wish to relate
mixing time with the mixing angle. Figure 5 shows the mixing time
against the mixing angle for the same FDN for 4 mean delay line
lengths, ¯τof 10,20,50, and 100 ms. For each mean delay line
length, we perform Monte Carlo simulations, such that each of the
16 delay line lengths are random and uniformly distributed on the
interval
τi∼ U ¯τi
φ,¯τiφ,(12)
where φis the golden ratio, and plot the average mixing time. An
exponential relationship is observed between mixing time and mix-
ing angle θ. For θ= 0, there is no mixing and the echo density
never becomes Gaussian, which means the mixing time is theoreti-
cally infinite. For a given mean delay line length, it is possible to fit
a curve to this plot using non-linear least squares and estimate what
value of θwould produce the desired mixing time. The parametric
equation for tmix in seconds, given θand a mean delay line length
Fraction of mixing
Mixing time(s)
10ms
20ms
50ms
100ms
Figure 5: Mixing time as a function of mixing angle θ(divided
by π
4) for average delay line lengths ¯τof 10,20,50, and 100 ms
(bottom to top). Dashed lines are fitted curves using the parametric
equation.
of ¯τin ms is given in (13).
tmix =αexp (βθ) + γ
α= 0.397¯τ20.02γ
β= 20.246 0.09¯τ
γ= 0.0075¯τ
(13)
Combined with Schlecht’s method for predicting mean delay
line length [17], our method for picking a mixing matrix M(θ)to
achieve a desired mixing time will aid in the design of FDN re-
verberators. These results are also directly applicable to the FDN
resizing algorithm described in [23].
5. CONCLUSION
We have studied the modal decomposition of a Feedback Delay Net-
works. While the modal decomposition for a FDN with a scalar
gain is well known [6], we explicitly derived the modal decompo-
sition with a shelf filter in the delay lines. We changed the mixing
matrix smoothly from minimum mixing to maximum mixing and
observed coupling among nearby modes from different delay lines.
We used a 2delay line example with a small number of modes for
visualizing modal behavior. We also studied how mixing affects the
echo density profile and mixing time of the FDN impulse response,
and came up with a parametric equation to choose a mixing matrix,
given a mean delay line length and a desired mixing time.
We could not do modal decompositions of FDNs with longer
delay lines that are commonly used in practice, because of the lim-
ited efficiency of MATLAB’s eig function when dealing with large
matrices. Nor could we find a good numeric method that could effi-
ciently and accurately give us all the eigenvalues of a large, sparse,
non-symmetric matrix. It would be interesting to use the numerical
method described in [11] to find the eigenvalues of large state tran-
sition matrices with many more states in a future work. We could
then generalize the observed coupled modal behavior in this paper
for practical FDNs with thousands of modes.
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
6. REFERENCES
[1] M. R. Schroeder, “Natural sounding artificial reverberation,”
Journal of the Audio Engineering Society, vol. 10, no. 3, pp.
219–223, 1962.
[2] J. A. Moorer, “About this reverberation business,” Computer
music journal, pp. 13–28, 1979.
[3] M. Gerzon, “Unitary (energy-preserving) multichannel net-
works with feedback,” Electronics Letters, vol. 12, no. 11, pp.
278–279, 1976.
[4] J.-M. Jot and A. Chaigne, “Digital delay networks for de-
signing artificial reverberators,” in Audio Engineering Society
Convention 90. Audio Engineering Society, 1991.
[5] J.-M. Jot, “An analysis/synthesis approach to real-time artifi-
cial reverberation, in [Proceedings] ICASSP-92: 1992 IEEE
International Conference on Acoustics, Speech, and Signal
Processing, vol. 2. IEEE, 1992, pp. 221–224.
[6] D. Rocchesso and J. O. Smith, “Circulant and elliptic feed-
back delay networks for artificial reverberation,IEEE Trans-
actions on Speech and Audio Processing, vol. 5, no. 1, pp.
51–63, 1997.
[7] D. Rocchesso, “Maximally diffusive yet efficient feedback de-
lay networks for artificial reverberation, IEEE Signal Pro-
cessing Letters, vol. 4, no. 9, pp. 252–255, 1997.
[8] S. J. Schlecht and E. A. Habets, “Time-varying feedback ma-
trices in feedback delay networks and their application in arti-
ficial reverberation, The Journal of the Acoustical Society of
America, vol. 138, no. 3, pp. 1389–1398, 2015.
[9] ——, “Practical considerations of time-varying feedback de-
lay networks,” in Audio Engineering Society Convention 138.
Audio Engineering Society, 2015.
[10] ——, “On lossless feedback delay networks,IEEE Trans-
actions on Signal Processing, vol. 65, no. 6, pp. 1554–1564,
2017.
[11] ——, “Modal decomposition of feedback delay networks,
arXiv preprint arXiv:1901.08865, 2019.
[12] J.-H. He, “Homotopy perturbation technique,” Computer
methods in applied mechanics and engineering, vol. 178, no.
3-4, pp. 257–262, 1999.
[13] G. Weinreich, “Coupled piano strings,” The Journal of the
Acoustical Society of America, vol. 62, no. 6, pp. 1474–1484,
1977.
[14] J. S. Abel and P. Huang, “A simple, robust measure of rever-
beration echo density,” in Audio Engineering Society Conven-
tion 121. Audio Engineering Society, 2006.
[15] P. Huang, J. S. Abel, H. Terasawa, and J. Berger, “Reverbera-
tion echo density psychoacoustics,” in Audio Engineering So-
ciety Convention 125. Audio Engineering Society, 2008.
[16] A. Lindau, L. Kosanke, and S. Weinzierl, “Perceptual evalua-
tion of model-and signal-based predictors of the mixing time
in binaural room impulse responses,” Journal of the Audio En-
gineering Society, vol. 60, no. 11, pp. 887–898, 2012.
[17] S. J. Schlecht and E. A. Habets, “Feedback delay networks:
Echo density and mixing time,” IEEE/ACM Transactions on
Audio, Speech, and Language Processing, vol. 25, no. 2, pp.
374–383, 2017.
[18] P. A. Regalia and M. K. Sanjit, “Kronecker products, unitary
matrices and signal processing applications,” SIAM review,
vol. 31, no. 4, pp. 586–613, 1989.
[19] J. S. Abel, D. P. Berners, and J. Herrera, “Signal processing
techniques for digital audio effects,Course Notes for Stan-
ford Music 424, 2004.
[20] E. De Sena, H. Hacιhabibo˘
glu, Z. Cvetkovi´
c, and J. O. Smith,
“Efficient synthesis of room acoustics via scattering delay net-
works,” IEEE/ACM Transactions on Audio, Speech and Lan-
guage Processing (TASLP), vol. 23, no. 9, pp. 1478–1492,
2015.
[21] J. S. Abel and M. J. Wilson, “Luciverb: Iterated convolution
for the impatient,” in Audio Engineering Society Convention
133. Audio Engineering Society, 2012.
[22] S. J. Schlecht and E. A. Habets, “Sign-agnostic matrix de-
sign for spatial artificial reverberation with feedback de-
lay networks,” in AES International Conference on Spatial
Reproduction-Aesthetics and Science. Audio Engineering
Society, 2018.
[23] E. K. Canfield-Dafilou and J. S. Abel, “Resizing rooms in
convolution, delay network, and modal reverberators,” in Pro-
ceedings of the 21st International Conference on Digital Au-
dio Effects, 2018.
... Since then FDNs have become one of the most popular structures for synthesizing reverberation due to the relative efficiency of the approach. Recent research on FDNs has focused on mixing matrix design to increase echo density [6], modal analysis [7,8], time-varying FDNs [9], scattering FDNs [10], and reverberation time control by accurate design of the decay filters [11,12]. ...
... A room with many objects and complex geometry will mix faster than an empty room with simple geometry. The mixing matrix can be designed to have a desired mixing time according to the method in [8], where the Kronecker product of a 2 × 2 rotation/reflection matrix (parameterized by an angle θ) with itself is taken log 2 (N) times to give an N × N orthonormal matrix, M(θ) ...
Article
Full-text available
Delay Network reverberators are an efficient tool for synthesizing reverberation. We propose a novel architecture, called the Grouped Feedback Delay Network (GFDN) reverberator, with groups of delay lines sharing different target decay rates, and use it to simulate coupled room acoustics. Coupled spaces are common in apartments, concert halls, and churches where two or more volumes with different reverberation characteristics are linked via an aperture. The difference in reverberation times (T60s) of the coupled spaces leads to unique phenomena, such as multi-stage decay. Here the GFDN is used to simulate coupled spaces with groups of delay line filters representing the T60 s of the coupled rooms. A parameterized, orthonormal mixing matrix is presented that provides control over the mixing times of the rooms and amount of coupling between the rooms. As an example application we measure a coupled bedroom and bathroom system separated by a door in an apartment and use the GFDN to synthesize the late field for different openings of the door separating the two rooms, thereby varying coupling between the rooms.
... Our prior work introduced a modal decomposition technique for FDNs [8], representing the FDN as a sum of complex onepole resonators, or modes. These modes find utility in system estimation [9], oscillation analysis [10], [11], resonance excitation optimization [12], [13], matching desired room modes [14], and enhancing the design of coupled room reverberators [15]- [17]. ...
Article
Full-text available
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
... Our prior work introduced a modal decomposition technique for FDNs [8], representing the FDN as a sum of complex onepole resonators, or modes. These modes find utility in system estimation [9], oscillation analysis [10], [11], resonance excitation optimization [12], [13], matching desired room modes [14], and enhancing the design of coupled room reverberators [15]- [17]. ...
Preprint
Full-text available
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
... Recent research on FDNs has focused on mixing matrix design to increase echo density [5], modal analysis [6], [7], time-varying FDNs [8], allpass FDNs for colorless reverberation [9] and scattering FDNs for dense reverberation [10]. Grouping of delay lines to control direction-dependent energy decay, known as the Directional Feedback Delay Network, was proposed by Alary et al. in [11]. ...
Article
Full-text available
Feedback Delay Networks are one of the most popular and efficient means of generating artificial reverberation. Recently, we proposed the Grouped Feedback Delay Network (GFDN), which couples multiple FDNs while maintaining system stability. The GFDN can be used to model reverberation in coupled spaces that exhibit multi-stage decay. The block feedback matrix determines the inter- and intra-group coupling. In this paper, we expand on the design of the block feedback matrix to include frequency-dependent coupling among the various FDN groups. We show how paraunitary feedback matrices can be designed to emulate diffraction at the aperture connecting rooms. Several methods for the construction of nearly paraunitary matrices are investigated. The proposed method supports the efficient rendering of virtual acoustics for complex room topologies in games and XR applications.
... The versatility of the FDN architecture allows for a wide variety of applications and extensions. It is used to produce binaural [195,196] and multichannel reverberation [205,206], as well as to synthesize multipleslope decay for coupled spaces [207,208]. Other artificial reverberation techniques, such as scattering delay networks [209,210], digital waveguide networks [204], digital waveguide mesh [211], and finite difference time domain methods [191,192,193,194] have a close relation to the FDN structure. ...
Thesis
Full-text available
In this dissertation, the discussion is centered around the sound energy decay in enclosed spaces. The work starts with the methods to predict the reverberation parameters, followed by the room impulse response measurement procedures, and ends with an analysis of techniques to digitally reproduce the sound decay. The research on the reverberation in physical spaces was initiated when the first formula to calculate room's reverberation time emerged. Since then, finding an accurate and reliable method to predict reverberation has been an important area of acoustic research. This thesis presents a comprehensive comparison of the most commonly used reverberation time formulas, describes their applicability in various scenarios, and discusses their accuracy when compared to results of measurements. The common sources of uncertainty in reverberation time calculations, such as bias introduced by air absorption and error in sound absorption coefficient, are analyzed as well. The thesis shows that decreasing such uncertainties leads to a good prediction accuracy of Sabine and Eyring equations in diverse conditions regarding sound absorption distribution. The measurement of the sound energy decay plays a crucial part in understanding the propagation of sound in physical spaces. Nowadays, numerous techniques to capture room impulse responses are available, each having its advantages and drawbacks. In this dissertation, the majority of commonly used measurement techniques are listed, whereas the exponential swept-sine is described in more detail. This work elaborates on the external factors that may impair the measurements and introduce error to their results, such as stationary and non-stationary noise, as well as time variance. The dissertation introduces Rule of Two, a method of detecting nonstationary disturbances in sweep measurements. It also shows the importance of using median as a robust estimator in non-stationary noise detection. Artificial reverberation is a popular sound effect, used to synthesize sound energy decay for the purpose of audio production. This dissertation offers an insight into artificial reverberation algorithms based on recursive structures. The filter design proposed in this work offers precise control over the decay rate while being efficient enough for real-time implementation. The thesis discusses the role of the delay lines and feedback matrix in achieving high echo density in feedback delay networks. It also shows that four velvet-noise sequences are sufficient to obtain smooth output in interleaved velvet noise reverberator. The thesis shows that the accuracy of reproduction increases the perceptual similarity between measured and synthesised impulse responses. The insights collected in this dissertation offer insights into the intricacies of reverberation prediction, measurement and synthesis. The results allow for reliable estimation of parameters related to sound energy decay, and offer an improvement in the field of artificial reverberation.
... The modal density is another key characteristics of reverberation [191]. A popular method to increase the modal density in delay networks is to vary slightly the length of delay lines over time thus producing more modes [182,192,116,178], which requires the use of fractional delay lines [193]. ...
Thesis
Full-text available
Available online with the related articles at: http://urn.fi/URN:ISBN:978-952-64-0472-1 In this dissertation, the reproduction of reverberant sound fields containing directional characteristics is investigated. A complete framework for the objective and subjective analysis of directional reverberation is introduced, along with reverberation methods capable of producing frequency- and direction-dependent decay properties. Novel uses of velvet noise are also proposed for the decorrelation of audio signals as well as artificial reverberation. The methods detailed in this dissertation offer the means for the auralization of reverberant sound fields in real-time, with applications in the context of Immersive sound reproduction such as virtual and augmented reality.
... A higher number of delays increases both modal and echo density, but also the computational complexity. Although attempts have been made [21,22], it remains open how to achieve spectrally and temporally smooth FDNs with a minimal number of delays. A closely connected topic is the choice of delay length. ...
Conference Paper
Full-text available
Feedback delay networks (FDNs) are recursive filters, which are widely used for artificial reverberation and decorrelation. While there exists a vast literature on a wide variety of reverb topologies, this work aims to provide a unifying framework to design and analyze delay-based reverberators. To this end, we present the Feedback Delay Network Toolbox (FDNTB), a collection of the MAT-LAB functions and example scripts. The FDNTB includes various representations of FDNs and corresponding translation functions. Further, it provides a selection of special feedback matrices, topologies, and attenuation filters. In particular, more advanced algorithms such as modal decomposition, time-varying matrices, and filter feedback matrices are readily accessible. Furthermore, our toolbox contains several additional FDN designs. Providing MATLAB code under a GNU-GPL 3.0 license and including illustrative examples, we aim to foster research and education in the field of audio processing.
... To achieve smooth reverberation, a sufficient echo density, i.e., the number of echoes per time unit produced by the algorithm and their distribution [26], should be obtained. Echo density is affected by a few factors, such as the lengths and the distribution of the delay lines, the type of the feedback matrix [30] and its size, all of which are discussed below. ...
Conference Paper
Full-text available
Reverberation is one of the most important effects used in audio production. Although nowadays numerous real-time implementations of artificial reverberation algorithms are available, many of them depend on a database of recorded or pre-synthesized room impulse responses, which are convolved with the input signal. Implementations that use an algorithmic approach are more flexible but do not let the users have full control over the produced sound, allowing only a few selected parameters to be altered. The real-time implementation of an artificial reverberation synthesizer presented in this study introduces an audio plugin based on a feedback delay network (FDN), which lets the user have full and detailed insight into the produced reverb. It allows for control of reverberation time in ten octave bands, simultaneously allowing adjusting the feedback matrix type and delay-line lengths. The proposed plugin explores various FDN setups, showing that the lowest useful order for high-quality sound is 16, and that in the case of a Householder matrix the implementation strongly affects the resulting reverberation. Experimenting with delay lengths and distribution demonstrates that choosing too wide or too narrow a length range is disadvantageous to the synthesized sound quality. The study also discusses CPU usage for different FDN orders and plugin states.
Conference Paper
Full-text available
Artificial reverberation is an audio effect used to simulate the acoustics of a space while controlling its aesthetics, particularly on sounds recorded in a dry studio environment. Delay-based methods are a family of artificial reverberators using recirculating delay lines to create this effect. The feedback delay network is a popular delay-based reverberator providing a comprehensive framework for parametric reverberation by formalizing the recirculation of a set of interconnected delay lines. However, one known limitation of this algorithm is the initial slow build-up of echoes, which can sound unrealistic, and overcoming this problem often requires adding more delay lines to the network. In this paper, we study the effect of adding velvet-noise filters, which have random sparse coefficients, at the input and output branches of the reverberator. The goal is to increase the echo density while minimizing the spectral coloration. We compare different variations of velvet-noise filtering and show their benefits. We demonstrate that with velvet noise, the echo density of a conventional feedback delay network can be exceeded using half the number of delay lines and saving over 50% of computing operations in a practical configuration using low-order attenuation filters.
Article
Full-text available
Feedback delay networks (FDNs) belong to a general class of recursive filters which are widely used in sound synthesis and physical modeling applications. We present a numerical technique to compute the modal decomposition of the FDN transfer function. The proposed pole finding algorithm is based on the Ehrlich-Aberth iteration for matrix polynomials and has improved computational performance of up to three orders of magnitude compared to a scalar polynomial root finder. The computational performance is further improved by bounds on the pole location and an approximate iteration step. We demonstrate how explicit knowledge of the FDN's modal behavior facilitates analysis and improvements for artificial reverberation. The statistical distribution of mode frequency and residue magnitudes demonstrate that relatively few modes contribute a large portion of impulse response energy.
Conference Paper
Full-text available
Feedback delay networks (FDNs) are an ef?cient tool for creating artificial reverberation. Recently, various designs for spatially extending the FDN were proposed. A central topic in the design of spatial FDNs is the choice of the feedback matrix that governs the interaction between spatially distributed elements and therefore the spatial impression. In the design prototype, the feedback matrix is chosen to be unilossless such that the reverberation time is infinite. However, in physics- and aesthetics-driven design of spatial FDNs, the target feedback matrix is not necessarily unilossless. This contribution proposes an optimization method for finding a close unilossless feedback matrix and improves the accuracy by relaxing the specification of the target matrix phase component and focussing on the sign-agnostic component.
Article
Full-text available
An acoustic reverberator consisting of a network of delay lines connected via scattering junctions is proposed. All parameters of the reverberator are derived from physical properties of the enclosure it simulates. It allows for simulation of unequal and frequency-dependent wall absorption, as well as directional sources and microphones. The reverberator renders the first-order reflections exactly, while making progressively coarser approximations of higher-order reflections. The rate of energy decay is close to that obtained with the image method (IM) and consistent with the predictions of Sabine and Eyring equations. The time evolution of the normalized echo density, which was previously shown to be correlated with the perceived texture of reverberation, is also close to that of IM. However, its computational complexity is one to two orders of magnitude lower, comparable to the computational complexity of a feedback delay network (FDN), and its memory requirements are negligible.
Article
Full-text available
The mixing time of room impulse responses denotes the moment when the diffuse reverberation tail begins. A diffuse ("mixed") sound field can physically be defined by (1) equidistribution of acoustical energy and (2) a uniform acoustical energy flux over the complete solid angle. Accordingly, the perceptual mixing time could be regarded as the moment when the diffuse tail cannot be distinguished from that of any other position or listener's orientation in the room. This, for instance, provides an opportunity for reducing the part of binaural room impulse responses that has to be updated dynamically in Virtual Acoustic Environments. Several authors proposed model- and signal-based estimators for the mixing time in rooms. Our study aims at an evaluation of all measures as predictors of a perceptual mixing time. Therefore, we collected binaural impulse response data sets with an adjustable head and torso simulator for a representative sample of rectangular shaped rooms. Altering the transition time into a homogeneous diffuse tail in real time in an adaptive, forced-choice listening test, we determined just audible perceptual mixing times.We evaluated the performance of all potential predictors by linear regression and finally obtained formulae to estimate the perceptual mixing time from measured impulse responses or physical properties of the room.
Article
Feedback delay networks (FDNs) are frequently used to generate artificial reverberation. This contribution discusses the temporal features of impulse responses produced by FDNs, i.e., the number of echoes per time unit and its evolution over time. This so-called echo density is related to known measures of mixing time and their psychoacoustic correlates such as auditive perception of the room size. It is shown that the echo density of FDNs follows a polynomial function, whereby the polynomial coefficients can be derived from the lengths of the delays for which an explicit method is given. The mixing time of impulse responses can be predicted from the echo density, and conversely, a desired mixing time can be achieved by a derived mean delay length. A Monte Carlo simulation confirms the accuracy of the derived relation of mixing time and delay lengths.
Article
Lossless Feedback Delay Networks (FDNs) are commonly used as a design prototype for artificial reverberation algorithms. The lossless property is dependent on the feedback matrix, which connects the output of a set of delays to their inputs, and the lengths of the delays. Both, unitary and triangular feedback matrices are known to constitute lossless FDNs, however, the most general class of lossless feedback matrices has not been identified. In this contribution, it is shown that the FDN is lossless for any set of delays, if all irreducible components of the feedback matrix are diagonally similar to a unitary matrix. The necessity of the generalized class of feedback matrices is demonstrated by examples of FDN designs proposed in literature.
Article
A series of psychoacoustic experiments were carried out to explore the relationship between an objective measure of reverberation echo density, called the normalized echo density (NED), and subjective perception of the time-domain texture of reverberation. In one experiment, 25 subjects evaluated the dissimilarity of signals having static echo densities. The reported dissimilarities matched absolute NED differences with an R2 of 93%. In a 19-subject experiment, reverberation impulse responses having evolving echo densities were used. With an R2 of 90%, the absolute log ratio of the late field onset times matched reported dissimilarities between impulse responses. In a third experiment, subjects consistently reported breakpoints in echo pattern character at NEDs of 0.3 and 0.7.
Conference Paper
Feedback delay networks (FDNs) can be efficiently used to generate parametric artificial reverberation. Recently, the authors proposed a novel approach to time-varying FDNs by introducing a time-varying feedback matrix. The formulation of the time-varying feedback matrix was given in the complex eigenvalue domain, whereas this contribution specifies the requirements for real valued time-domain processing. In addition, the computational costs of different time-varying feedback matrices, which depend on the matrix type and modulation function, are discussed. In a performance evaluation, the proposed orthogonal matrix modulation is compared to a direct interpolation of the matrix entries.
Article
This paper introduces a time-variant reverberation algorithm as an extension of the feedback delay network (FDN). By modulating the feedback matrix nearly continuously over time, a complex pattern of concurrent amplitude modulations of the feedback paths evolves. Due to its complexity, the modulation produces less likely perceivable artifacts and the time-variation helps to increase the liveliness of the reverberation tail. A listening test, which has been conducted, confirms that the perceived quality of the reverberation tail can be enhanced by the feedback matrix modulation. In contrast to the prior art time-varying allpass FDNs, it is shown that unitary feedback matrix modulation is guaranteed to be stable. Analytical constraints on the pole locations of the FDN help to describe the modulation effect in depth. Further, techniques and conditions for continuous feedback matrix modulation are presented.
Article
A simple, robust method for measuring echo density from a reverberation impulse response is presented. Based on the property that a reverberant field takes on a Gaussian distribution once an acoustic space is fully mixed, the measure counts samples lying outside a standard deviation in a given impulse response window and normalizes by that expected for Gaussian noise. The measure is insensitive to equalization and reverberation time, and is seen to perform well on both artificial reverberation and measurements of room impulse responses. Listening tests indicate a correlation between echo density measured in this way and perceived temporal quality or texture of the reverberation.