Conference PaperPDF Available

Dense Reverberation With Delay Feedback Matrices

Authors:

Abstract

This paper received the best paper award at WASPAA 2019. Feedback delay networks (FDNs) belong to a general class of re-cursive filters which are widely used in artificial reverberation and decorrelation applications. One central challenge in the design of FDNs is the generation of sufficient echo density in the impulse response without compromising the computational efficiency. In a previous contribution, we have demonstrated that the echo density of an FDN grows polynomially over time, and that the growth depends on the number and lengths of the delays. In this work, we introduce so-called delay feedback matrices (DFMs) where each matrix entry is a scalar gain and a delay. While the computational complexity of DFMs is similar to a scalar-only feedback matrix, we show that the echo density grows significantly faster over time, however, at the cost of non-uniform modal decays.
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
DENSE REVERBERATION WITH DELAY FEEDBACK MATRICES
Sebastian J. Schlecht and Emanuël A.P. Habets
International Audio Laboratories Erlangen, Germany
sebastian.schlecht@audiolabs-erlangen.de
ABSTRACT
Feedback delay networks (FDNs) belong to a general class of re-
cursive filters which are widely used in artificial reverberation and
decorrelation applications. One central challenge in the design of
FDNs is the generation of sufficient echo density in the impulse
response without compromising the computational efficiency. In a
previous contribution, we have demonstrated that the echo density
of an FDN grows polynomially over time, and that the growth de-
pends on the number and lengths of the delays. In this work, we
introduce so-called delay feedback matrices (DFMs) where each
matrix entry is a scalar gain and a delay. While the computational
complexity of DFMs is similar to a scalar-only feedback matrix,
we show that the echo density grows significantly faster over time,
however, at the cost of non-uniform modal decays.
Index TermsFeedback Delay Network, Artificial Reverber-
ation, Echo Density
1. INTRODUCTION
If sound is emitted in a room, the sound waves travel across the
space and are repeatedly reflected at the room boundaries resulting
in acoustic reverberation [1]. If the sound is reflected at a smooth
boundary the reflection is coherent (specular), while it is incoher-
ent (scattered) when reflected by a rough surface. In small to large
rooms, the mean free path of specular reflections is between 10 -
100 ms, whereas the time scale of incoherent reflections is a dense
response of a few milliseconds [2]. In geometric room acoustic, an
incoherent reflection can be effectively generated from a set of close
image sources [2], see Fig. 1. Consequently, scattering increases the
total number of image sources and therefore the echo density [3]. In
this work, we propose a method to effectively introduce scattering-
like effects for artificial reverberation filter structure.
Many artificial reverberators have been developed in recent
years [4] among which the feedback delay network (FDN), orig-
inally proposed by Gerzon [5] and further developed by Jot and
Chaigne [6], is one of the most popular. The FDN consists of N
delay lines combined with attenuation filters which are fed back
via a scalar feedback matrix A. To maximize the echo density,
we assume throughout this work that feedback matrix Ais dense,
i.e., all matrix entries are non-zero [7]. In [3], the present authors
demonstrated that the absolute echo density of an FDN grows poly-
nomially with the degree equal to the number of delay lines Nand
inversely to the delay lengths, i.e., shorter delay lines produce faster
increasing absolute echo density. A major challenge of FDN design
is to achieve sufficient echo density because of an inherent trade-
off between three aspects: computational complexity, mode density
The International Audio Laboratories Erlangen are a joint institution of
the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Fraun-
hofer Institut für Integrierte Schaltungen IIS.
Source
Receiver
Wall 1
Wall 2
First order
image sources
Second order
image sources
Figure 1: Second order reflections at two rough surfaces represented
with an equivalent set of image sources (inspired by [2]). The en-
ergy of the image sources is indicated by the color saturation.
and echo density. A higher number of delays increases both modal
and echo density, but also the computational complexity. With a
fixed number delays, shorter delays increases the echo density, how-
ever, results simultaneously in a lower modal density [6].
To reproduce a scattering-like effect, we require a set of long
delays which are proportional to the mean free path [8], and a set of
filters to add the short-term density to each reflection (see Fig. 1).
Feedforward-feedback allpass filters have been introduced in series
to increase the short-term echo density [9, 10]. Alternatively, all-
pass filters may be placed after the delay lines [11] which in turn
doubles the effective size of the FDN [12]. Instead of allpass filters,
FIR filters with pseudo-random exponentially decaying coefficients
were proposed [13].
In this contribution, we propose an FDN with a delay feedback
matrix (DFM), i.e., each matrix entry has a delay and a gain. We
demonstrate that the DFM can increase the echo density signifi-
cantly without introducing further filtering. In Section 2, we present
the new filter structure and compare it to the standard FDN. In Sec-
tion 3, we demonstrate the echo density of the proposed FDN. In
Section 4, an example structure is presented.
2. PROPOSED FEEDBACK DELAY NETWORK
In the following, we present the proposed extension to the FDN and
explain the stability conditions.
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
A(z)
X(z)b1
zm1
c1Y(z)
b2
zm2
c2
b3
zm3
c3
Figure 2: Delay feedback delay network with three delays (N= 3)
and a delay feedback matrix A(z)instead of the standard scalar
feedback matrix A.
2.1. Delay Feedback Matrix
The proposed FDN is similarly structured as a standard FDN, but
employs a delay feedback matrix (DFM) A(z)instead of a scalar
feedback matrix A(see Fig. 2). The transfer function of the pro-
posed FDN is
H(z) = c>[Dm(z1)A(z)]1b+d, (1)
where the column vectors band cof bi’s and cis, re-
spectively. The lengths of the Ndelays in samples are
given by m= [m1,...,mN]NN.Dm(z) =
diag[zm1, zm2,...,zmN]is the diagonal N×Ndelay ma-
trix. The attenuation filters which are usually applied in the feed-
back loop are omitted for clarity and can be added as an additional
processing unit in series with the delay lines [6]. The N×NDFM
A(z)is given by
A(z) = UDM(z),(2)
where denotes the Hadamard (element-wise) product, Uis a
scalar matrix and Mare the non-negative integer matrix delays and
the matrix entries of the delay matrix DM(z)are [DM(z)]ij =
zMij where 1i, j N. If all matrix delays are zero, i.e.,
M=0, the DFM becomes the standard scalar feedback matrix.
For this reason, we denote the zero matrix delay with MS=0.
From a design perspective, the matrix delays of the DFM M
should be relatively small compared to the main delays m. This
is because the attenuation filters are typically proportional to the
main delays and every deviation of this proportionality by the DFM
distorts the reverberation time specification. To achieve a scattering-
like effect, the matrix delays Mare set to less than 10% of the
corresponding main delays m(see Section 3).
The implementation of the DFM is similar to the scalar feed-
back matrix. The main difference is that the signal vector is re-
trieved by N2memory access operations instead of N. All other
arithmetic operations, i.e., element-wise multiplication with Uand
the writing operations to the delay lines, are unaltered. Conse-
quently, the additional computational complexity introduced by the
DFM is relatively small.
2.2. Stability
The poles of the proposed FDN are the eigenvalues of the polyno-
mial matrix P(z) = Dm(z1)A(z), which in turn are the roots
of the polynomial p(z) = det(P(z)) [14]. The polynomial degree
of p(z)is then equal to the total number of system poles of the pro-
posed FDN. FDNs are commonly designed as lossless systems, i.e.,
all system poles lie on the unit circle, to allow precise control over
the reverberation time by introducing additional proportional atten-
uation filters [6]. The lossless property of general unitary-networks
which in particular applies to the proposed FDN, was described by
Gerzon [15]. A sufficient lossless condition is given by A(z)being
paraunitary, i.e., A(z1)>A(z) = Ifor real coefficients, where I
is the identity matrix [15].
Consequently, an FDN with a scalar feedback matrix, i.e.,
MS=0, is lossless if Uis unitary. Further, we show that A(z)is
paraunitary, if
MP
ij =mout
i+min
jfor 1i, j N, (3)
where mout,min NNare vectors of integer delays in samples
and we denote MP
ij as the matrix elements of matrix delays MP.
From close inspection, it can be observed that
A(z) = UDMP(z) = Dmin (z)U Dmout (z).(4)
As paraunitary matrices are closed under multiplication and
Dmin (z)and Dmout (z)are paraunitary, we can conclude that
A(z)is paraunitary, if Uis unitary.
Unlike the scalar case MSand the paraunitary case MP,A(z)
is not necessarily paraunitary for other matrix delays MNP such
that no lossless FDN might exist for such matrix delays. In the
following, we show a sufficient condition for the proposed FDN to
be stable instead. To establish the stability condition, we recall a
version of Rouché’s theorem for polynomial matrices:
Theorem 1 (Rouché, [16]).Let Q(z)and R(z)be matrix polyno-
mials. Assume that Q(z)is invertible on the simple closed curve
γG. If kQ(z)1R(z)k2<1for all zγ, then Q(z) + R(z)
and Q(z)have the same number of eigenvalues inside γ, counting
multiplicities.
With this theorem, we can prove the following stability condi-
tion:
Theorem 2. The proposed FDN in (1) is stable if
kA(eıω)k2<1for all ω . (5)
Proof. So, let us assign Q(z) = Dm(z1)and R(z) = A(z)
such that P(z) = Q(z) + R(z). Further, let us choose the unit cir-
cle as the closed curve γ. Now, all the eigenvalues of Q(z)are zero
and therefore within γ. For zγ, we have z=eıω and Q(eıω)
is unitary for all ωsuch that kQ(eıω)1R(eıω )k2=kR(eıω )k2.
Thus according to Theorem 1, all eigenvalues of P(z)are within
the unit circle if kR(eıω)k2<1.
Theorem 2 provides also practical means to stabilize any non-
stable FDN by dividing A(z)by maxωkA(eıω)k2. Nonetheless,
the missing lossless property makes the non-paraunitary DFM more
tuning-intensive to avoid unwanted ringing modes as the modal de-
cays are possibly non-uniform. While there might be possible op-
timization schemes to improve the modal decay distribution, in this
pioneering study, we choose the non-paraunitary DFM empirically.
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
m1
m2
time n
(a) Standard FDN with scalar feedback matrix.
m1
m2
time n
(b) Proposed FDN with a paraunitary DFM.
m1
m2
time n
(c) Proposed FDN with a non-paraunitary DFM.
Figure 3: Echo lattice FDNs with two delay lines and three different DFMs: scalar MS, paraunitary MPand non-paraunitary MNP . Each
dot indicates a single echo. The x and y coordinates indicate the time component associated with the first and second delay line, respectively,
m1and m2. The echo time nis indicated by the diagonal dotted line. The matrix delays Mare added diagonally. The red arrows indicate
the three paths p∈ {[1,2,2],[2,1,2],[2,1,1]}. More details regarding the echo lattice can be found in [3].
3. ECHO DENSITY
In this section, we present the echo density1of the proposed FDN
and show that it grows significantly faster with reflection order for
paraunitary and non-paraunitary DFMs.
3.1. Absolute Echo Density
The absolute echo density, i.e., the number of echoes at the output
per time unit, of the standard FDN is polynomial over time and
the exact polynomial is given in [3]. Echoes are closely related
to echo paths pdescribing the sequence of delay lines which are
traversed by an impulse input until it reaches the output. Formally,
an echo path can be defined as p∈ {1,2,...,N}lof length lwhich
denotes the number of involved delay lines. The echo time npis
defined as the time at which the echo appears at the output of the
FDN. In standard FDN, the echo time npof path pis given by
np=Pl
i=1 mpi[3], whereas the more general version for the
proposed FDN is
np=
l
X
i=1
mpi+
l1
X
i=1
Mpi+1,pi.(6)
We rewrite (6) in terms of the delay count
[cp]i=|{k|pk=i}| for 1iN(7)
and the matrix routing count
[Cp]ij =|{k|pk=jand pk+1 =i}| for 1i, j N, (8)
where |·| denotes the cardinality of a set. Hence, the delay and ma-
trix routing counts denote the number of occurrences of delay lines
and matrix delays in an echo path p, respectively. These counts sim-
plify the following development as they fully determine the echo
time np. In fact, the echo time of path pmay be expressed by
np=
N
X
i=1
[cpm]i+
N
X
i,j=1
[CpM]ij .(9)
1Although echo typically refers to a subjectively distinct reflection [1],
we refer to a non-zero output of the FDN as an echo. This definition is in
accordance to the term echo density in the literature [17, 3].
It is crucial to observe that the echo time npfor a scalar feed-
back matrix (MS=0) is invariant under permutations of pbe-
cause cpis invariant under permutations. The number of paths pof
length lwith different echo times npis
ES(l) = N+l1
N1!.(10)
This quantity is a special case of the equilateral approximation given
in [3]. For a paraunitary DFM with MPas in (3) with mout and
min, we have
MP
pi+1,pi=mout
pi+1,+min
pi(11)
such that npis invariant under permutation except for the first and
last element. Thus, the number of paths pof length l2with
different echo times npis
EP(l) = N2 N+l3
N1!.(12)
Because of the bound on the binomial coeffcients q
rqr
r!, the
number of paths ES(l)and EP(l)grows polynomially with length
l. For a non-paraunitary DFM with properly chosen matrix de-
lays MNP, the echo time npis different for each permutation of
pwhere the contribution of MNP is different. More precisely,
the echo times of two paths pand qare equal if cp=cqand
Cp=Cq. The number of paths ENP(l)is difficult to give in
closed form. In fact, ENP(l)is equal to the number of 2-abelian
equivalence classes of words of length lover an alphabet of size N
[18]. In [19, Theorem 5.1], Cassaigne et al. recently showed that
ENP(l)is asymptotically equal to r lN(N1) ,(13)
where ris a rational constant depending on N. Thus, ENP (l)grows
polynomially, however with degree N(N1) instead of N. Fig. 3
illustrates the echo times of an FDNs with different DFMs in the
echo lattice. In the following, we present a quantitative analysis of
the absolute echo densities depending on the feedback matrix.
The derivation of a closed form expression of the absolute echo
density requires the usages of sophisticated methods (see [3]) which
is beyond the scope for this work. However, if we neglect the spe-
cific delay lengths, the number of echo paths with discrete echo
times can be related to the absolute echo density. A first approx-
imation of the absolute echo density can be given by scaling the
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
0 500 1,000 1,500 2,000 2,500 3,000
0
2
4
6
Time [ms]
Amplitude and Echo Density [lin]
SP NP
Figure 4: Echo density profile and impulse response of the proposed
FDN with three different DFMs: scalar (S), paraunitary (P), and,
non-paraunitary (NP) and damping factor γ. The solid line indicates
the impulse response, whereas the dashed line is the measured echo
density profile. The three plots are offset for better readability by 2.
number of echo paths with the geometric mean of the main delay
lengths QN
i=1 m1/N
i.
Similar to the standard FDN case, the main delays mand delay
matrix Mshould be chosen such that as few as possible echo times
npcoincide. Naturally, this condition is only satisfiable in the early
part of the impulse response and that is only where it mostly matters
from a perceptual point of view [3].
3.2. Echo Density Profile
Abel and Huang proposed in [17, 20] the echo density profile. The
underlying assumption is that the sound pressure amplitudes in a
reverberant sound field exhibit a Gaussian distribution. With a short
sliding rectangular window of 23 ms, the empirical standard de-
viation of impulse response amplitudes is calculated. To deter-
mine how well the empirical amplitude distribution approximates
a Gaussian behavior, the proportion of samples outside the empir-
ical standard deviation is determined and compared to the propor-
tion expected for a Gaussian distribution. With increasing time and
increasing echo density, the echo density profile increases reaches
one for a fully dense frame of the impulse response. The perceptual
validity was confirmed in [21].
4. EXAMPLES
We give an example of the proposed FDN and compare it to a
standard FDN. The number of delay lines N= 4, and the delays
are m= [15805,5001,9535,7201] samples, with a sampling fre-
quency of 48 kHz. The delays are chosen extremely high on pur-
pose to make the difference in echo density both easily audible and
visible. Nonetheless, the echo density is affected equivalently for
proportionally shorter delays. The matrix gains are an orthogonal
Hadamard matrix
U=
0.5 0.5 0.5 0.5
0.50.5 0.50.5
0.5 0.50.50.5
0.50.50.5 0.5
(14)
Table 1: Number of paths with different echo times for N= 4 and
the three different delay matrices MS,MP, and, MNP .
l2345678
ES(l)10 20 35 56 84 120 165
EP(l)16 64 160 320 560 896 1344
ENP(l)16 64 244 856 2728 7892 20876
and input band output gains care all 1. The delay matrices are
MP=
456 1 10 447
751 296 305 742
511 56 65 502
647 192 201 638
,
MNP =
963 950 556 770
139 858 489 21
286 3 773 137
610 525 162 117
.
We have introduced a reverberation time of approximately 3 s by
replacing each delay element z1by γz1, where γ= 0.99995
for the scalar and paraunitary case, and γ= 0.99992 for the non-
paraunitary case to guarantee stability according to Theorem 2. Ta-
ble 1 shows the number of paths with different echo times for the
three DFM types: ES(l),EP(l), and, ENP(l). As shown in Sec-
tion 3, the number of echoes produced by the paraunitary and non-
paraunitary DFMs are significantly higher than for the scalar DFM.
Fig. 4 shows the impulse responses and the corresponding echo
density profiles. As predicted by the number of echo paths in Sec-
tion 3, the echo density profile of the impulse responses increases
from the scalar (S) to the paraunitary (P), and further to the non-
paraunitary (NP) feedback matrix. For instance, the echo density
profile at time 1.5 s in this order is 0.05, 0.25, and, 0.82, respec-
tively. For illustration, we have synthesized audio examples from
the three DFMs depicted in Fig. 4 and provided them online2. It is
easily audible that the impulse responses of (P) and (NP) become
dense much more quickly than for (S). However, at the end of the
impulse response of (NP), slight metallic ringing is perceivable, due
to the non-uniform distribution of modal decays caused by the non-
lossless property discussed in Section 2.
5. CONCLUSION
In this work, we propose an extension to the feedback delay net-
work (FDN) by replacing the scalar feedback matrix with a delay
feedback matrix (DFM), which introduces a different delay for each
feedback matrix entry. Based on the stability property, we present
two types of DFMs: paraunitary and non-paraunitary. The echo
density of FDNs with such DFMs is discussed in terms of the num-
ber of echo paths of increasing length. Alternatively, we analyze
the perceived echo density of the impulse responses with the echo
density profile. In an example, we show that both proposed DFMs
increases the echo density significantly. While the non-paraunitary
DFM surpasses the echo density of the paraunitary DFM, it also in-
troduces metallic ringing due to non-uniform modal decays which
impedes the tuning for high-quality artificial reverberation.
2https://www.audiolabs-erlangen.de/resources/
2019-WASPAA- DFM-FDN/
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 20-23, 2019, New Paltz, NY
6. REFERENCES
[1] H. Kuttruff, Room Acoustics, Fifth Edition. CRC Press, June
2009.
[2] S. Siltanen, T. Lokki, S. Tervo, and L. Savioja, “Modeling
incoherent reflections from rough room surfaces with image
sources,” J. Acoust. Soc. Amer., vol. 131, no. 6, pp. 4606–
4614, 2012.
[3] S. J. Schlecht and E. A. P. Habets, “Feedback delay networks:
Echo density and mixing time,” IEEE/ACM Trans. Audio,
Speech, Lang. Proc., vol. 25, no. 2, pp. 374–383, 2017.
[4] V. Välimäki, J. D. Parker, L. Savioja, J. O. Smith III, and
J. S. Abel, “Fifty years of artificial reverberation,IEEE/ACM
Trans. Audio, Speech, Lang. Proc., vol. 20, no. 5, pp. 1421–
1448, July 2012.
[5] M. A. Gerzon, “Synthetic stereo reverberation: Part One,” Stu-
dio Sound, vol. 13, pp. 632–635, 1971.
[6] J. M. Jot and A. Chaigne, “Digital delay networks for design-
ing artificial reverberators,” in Proc. Audio Eng. Soc. Conv.,
Paris, France, Feb. 1991, pp. 1–12.
[7] D. Rocchesso, “Maximally diffusive yet efficient feedback de-
lay networks for artificial reverberation,IEEE Signal Pro-
cess. Lett., vol. 4, no. 9, pp. 252–255, 1997.
[8] ——, “The Ball within the Box: A Sound-Processing
Metaphor,” Comput. Music J., vol. 19, no. 4, pp. 47–57, 1995.
[9] M. R. Schroeder and B. F. Logan, “"Colorless" artificial rever-
beration,” Audio, IRE Transactions on, vol. AU-9, no. 6, pp.
209–214, 1961.
[10] J. A. Moorer, “About this reverberation business,Comput.
Music J., vol. 3, no. 2, pp. 13–17, June 1979.
[11] L. Dahl and J. M. Jot, “A Reverberator based on Absorbent
All-pass Filters,” in Proc. Int. Conf. Digital Audio Effects
(DAFx), Verona, Italy, Dec. 2000, pp. 1–6.
[12] S. J. Schlecht and E. A. P. Habets, “Time-varying feedback
matrices in feedback delay networks and their application in
artificial reverberation,J. Acoust. Soc. Amer., vol. 138, no. 3,
pp. 1389–1398, Sept. 2015.
[13] P. Rubak and L. G. Johansen, “Artificial Reverberation Based
on a Pseudo-Random Impulse Response,” in Proc. Audio Eng.
Soc. Conv., Amsterdam, The Netherlands, May 1998, pp. 1–
14.
[14] S. J. Schlecht and E. A. P. Habets, “Modal decomposition
of feedback delay networks ,” IEEE Trans. Signal Process.,
2019.
[15] M. A. Gerzon, “Unitary (energy-preserving) multichannel
networks with feedback,” Electronics Letters, vol. 12, no. 11,
pp. 278–279, 1976.
[16] T. R. Cameron, “Spectral Bounds for Matrix Polynomials with
Unitary Coefficients,Electronic Journal of Linear Algebra,
vol. 30, no. 1, pp. 585–591, Feb. 2015.
[17] J. S. Abel and P. P. Huang, “A Simple, Robust Measure of
Reverberation Echo Density,” in Proc. Audio Eng. Soc. Conv.,
San Francisco, CA, USA, Oct. 2006, pp. 1–10.
[18] J. Karhumäki, S. Puzynina, M. Rao, and M. A. Whiteland,
“On cardinalities of k-abelian equivalence classes,Theoreti-
cal Computer Science, vol. 658, pp. 190–204, Jan. 2017.
[19] J. Cassaigne, J. Karhumäki, S. Puzynina, and M. A. White-
land, “k-Abelian Equivalence and Rationality,” Fundamenta
Informaticae, vol. 154, no. 1-4, pp. 65–94, Jan. 2017.
[20] P. P. Huang and J. S. Abel, “Aspects of reverberation echo den-
sity,” in Proc. Audio Eng. Soc. Conv., New York, NY, USA,
Oct. 2007, pp. 1–7.
[21] A. Lindau, L. Kosanke, and S. Weinzierl, “Perceptual Eval-
uation of Model- and Signal-Based Predictors of the Mixing
Time in Binaural Room Impulse Responses,J. Audio Eng.
Soc., vol. 60, no. 11, pp. 887–898, 2012.
... Recent research on FDNs has focused on mixing matrix design to increase echo density [5], modal analysis [6], [7], time-varying FDNs [8], allpass FDNs for colorless reverberation [9] and scattering FDNs for dense reverberation [10]. Grouping of delay lines to control direction-dependent energy decay, known as the Directional Feedback Delay Network, was proposed by Alary et al. in [11]. ...
... Schlecht has proposed adding delays [5] and filters [10] to the feedback matrix in an FDN. This has the advantage of increasing the echo density of the RIR more quickly, leading to dense, rich-sounding reverberation. ...
... The feedback matrix is energy-preserving if it is paraunitary [5]. A paraunitary matrix must satisfy the following criteria ...
Article
Full-text available
Feedback Delay Networks are one of the most popular and efficient means of generating artificial reverberation. Recently, we proposed the Grouped Feedback Delay Network (GFDN), which couples multiple FDNs while maintaining system stability. The GFDN can be used to model reverberation in coupled spaces that exhibit multi-stage decay. The block feedback matrix determines the inter- and intra-group coupling. In this paper, we expand on the design of the block feedback matrix to include frequency-dependent coupling among the various FDN groups. We show how paraunitary feedback matrices can be designed to emulate diffraction at the aperture connecting rooms. Several methods for the construction of nearly paraunitary matrices are investigated. The proposed method supports the efficient rendering of virtual acoustics for complex room topologies in games and XR applications.
... Since a time-varying feedback matrix is less likely to cause artifacts in the reverberation sound, this method has been found to improve the sound quality of the reverberation tail [23]. Another approach is to introduce short delays in the feedback matrix, so that each matrix element consists of a gain and a delay [24]. Also, separate early reflection modules using finite impulse response (FIR) filters for FDNs have been suggested [25]. ...
... FDN32 is considered to produce a sufficiently dense impulse response, whereas FDN16 has an impulse response that is slightly too sparse and would benefit from improvement. Figure 7: Normalized echo density of two conventional FDNs, a DFM-FDN [24], and the three proposed VFDN structures. The OVN30 configuration has VNSs only at the output, and the OVN15 and VN15 configurations have VNSs both at the input and the output. ...
... All the proposed structures in Fig. 7 have the same delay lines and feedback matrix as the FDN16. The delay feedback matrix (DFM) corresponds to a recently proposed FDN structure having delay lines in its feedback matrix [24]. Fig. 7 shows the growth of the echo density of the VFDN16 with the short VN15 sequences is even faster than that of the target FDN32. ...
Conference Paper
Full-text available
Artificial reverberation is an audio effect used to simulate the acoustics of a space while controlling its aesthetics, particularly on sounds recorded in a dry studio environment. Delay-based methods are a family of artificial reverberators using recirculating delay lines to create this effect. The feedback delay network is a popular delay-based reverberator providing a comprehensive framework for parametric reverberation by formalizing the recirculation of a set of interconnected delay lines. However, one known limitation of this algorithm is the initial slow build-up of echoes, which can sound unrealistic, and overcoming this problem often requires adding more delay lines to the network. In this paper, we study the effect of adding velvet-noise filters, which have random sparse coefficients, at the input and output branches of the reverberator. The goal is to increase the echo density while minimizing the spectral coloration. We compare different variations of velvet-noise filtering and show their benefits. We demonstrate that with velvet noise, the echo density of a conventional feedback delay network can be exceeded using half the number of delay lines and saving over 50% of computing operations in a practical configuration using low-order attenuation filters.
... showing that the system poles are the eigenvalues of A [3]. Unfortunately, for large delays m, this eigenvalue problem quickly becomes numerically intractable [32]. An efficient numerical method to compute the modal decomposition (5) from (2) using the Ehrlich-Aberth Iteration is presented in [32]. ...
... showing that the system poles are the eigenvalues of A [3]. Unfortunately, for large delays m, this eigenvalue problem quickly becomes numerically intractable [32]. An efficient numerical method to compute the modal decomposition (5) from (2) using the Ehrlich-Aberth Iteration is presented in [32]. ...
Article
Full-text available
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
... showing that the system poles are the eigenvalues of A [3]. Unfortunately, for large delays m, this eigenvalue problem quickly becomes numerically intractable [32]. An efficient numerical method to compute the modal decomposition (5) from (2) using the Ehrlich-Aberth Iteration is presented in [32]. ...
... showing that the system poles are the eigenvalues of A [3]. Unfortunately, for large delays m, this eigenvalue problem quickly becomes numerically intractable [32]. An efficient numerical method to compute the modal decomposition (5) from (2) using the Ehrlich-Aberth Iteration is presented in [32]. ...
Preprint
Full-text available
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
... A main challenge in designing FDNs is to generate a sufficient echo density in the RIR while maintaining computational efficiency [26]. To accelerate the echo density growth over time and reproduce a scattering-like effect, the mixing matrix U can be generalized to a filter matrix [27], where each entry is a finite impulse response (FIR) filter. ...
Conference Paper
Full-text available
This paper seeks to improve the state-of-the-art in delay-network-based analysis-synthesis of measured room impulse responses (RIRs). We propose an informed method incorporating improved energy decay estimation and synthesis with an optimized feedback delay network. The performance of the presented method is compared against an end-to-end deep-learning approach. A formal listening test was conducted where participants assessed the similarity of reverberated material across seven distinct RIRs and three different sound sources. The results reveal that the performance of these methods is influenced by both the excitation sounds and the reverberation conditions. Nonetheless, the proposed method consistently demonstrates higher similarity ratings compared to the end-to-end approach across most conditions. However, achieving an indistinguishable synthesis of measured RIRs remains a persistent challenge, underscoring the complexity of this problem. Overall, this work helps improve the sound quality of analysis-based artificial reverberation.
... The primary criterion for choosing the feedback matrix for an FDN is ensuring losslessness of the structure, i.e., the energy of the system should not decay when the attenuation filters are not in use [202,203,250]. Apart from this, the matrices are also used to enhance specific properties of the FDN, such as the increase in the echo density, computational efficiency, and spectral flatness [26,225,226,251,252,253]. ...
Thesis
Full-text available
In this dissertation, the discussion is centered around the sound energy decay in enclosed spaces. The work starts with the methods to predict the reverberation parameters, followed by the room impulse response measurement procedures, and ends with an analysis of techniques to digitally reproduce the sound decay. The research on the reverberation in physical spaces was initiated when the first formula to calculate room's reverberation time emerged. Since then, finding an accurate and reliable method to predict reverberation has been an important area of acoustic research. This thesis presents a comprehensive comparison of the most commonly used reverberation time formulas, describes their applicability in various scenarios, and discusses their accuracy when compared to results of measurements. The common sources of uncertainty in reverberation time calculations, such as bias introduced by air absorption and error in sound absorption coefficient, are analyzed as well. The thesis shows that decreasing such uncertainties leads to a good prediction accuracy of Sabine and Eyring equations in diverse conditions regarding sound absorption distribution. The measurement of the sound energy decay plays a crucial part in understanding the propagation of sound in physical spaces. Nowadays, numerous techniques to capture room impulse responses are available, each having its advantages and drawbacks. In this dissertation, the majority of commonly used measurement techniques are listed, whereas the exponential swept-sine is described in more detail. This work elaborates on the external factors that may impair the measurements and introduce error to their results, such as stationary and non-stationary noise, as well as time variance. The dissertation introduces Rule of Two, a method of detecting nonstationary disturbances in sweep measurements. It also shows the importance of using median as a robust estimator in non-stationary noise detection. Artificial reverberation is a popular sound effect, used to synthesize sound energy decay for the purpose of audio production. This dissertation offers an insight into artificial reverberation algorithms based on recursive structures. The filter design proposed in this work offers precise control over the decay rate while being efficient enough for real-time implementation. The thesis discusses the role of the delay lines and feedback matrix in achieving high echo density in feedback delay networks. It also shows that four velvet-noise sequences are sufficient to obtain smooth output in interleaved velvet noise reverberator. The thesis shows that the accuracy of reproduction increases the perceptual similarity between measured and synthesised impulse responses. The insights collected in this dissertation offer insights into the intricacies of reverberation prediction, measurement and synthesis. The results allow for reliable estimation of parameters related to sound energy decay, and offer an improvement in the field of artificial reverberation.
... Recent research extended Schroeder allpass filters and reverberators, e.g., allowing frequency-dependent gains in Schroeder allpass filters [35]; connecting FDNs to room geometry [36]; adding controls of directional distribution of sound to FDNs [37]; imbuing FDNs with the allpass prop- erty [34]; generalizing FDN feedback to a matrix of filters [38], including the case of velvet noise [39,40] feedback matrices in particular [41]; and studying coupled and parallel FDNs [42,43]. This article complements these works, providing new insight on Schroeder allpass filters and FDN architectures with good time-varying properties. ...
... To minimize this occurrence, [188] proposed to quantifying this overlap to help select an optimal set of lengths. Recently, [189] proposed adding small delays in the recirculating matrix to ensure that every recirculation path has a unique combined length. However, these small delays have an impact on the modal distribution, which was improved in [190]. ...
Thesis
Full-text available
Available online with the related articles at: http://urn.fi/URN:ISBN:978-952-64-0472-1 In this dissertation, the reproduction of reverberant sound fields containing directional characteristics is investigated. A complete framework for the objective and subjective analysis of directional reverberation is introduced, along with reverberation methods capable of producing frequency- and direction-dependent decay properties. Novel uses of velvet noise are also proposed for the decorrelation of audio signals as well as artificial reverberation. The methods detailed in this dissertation offer the means for the auralization of reverberant sound fields in real-time, with applications in the context of Immersive sound reproduction such as virtual and augmented reality.
... Since then, FDNs have become one of the most popular structures for synthesizing reverberation due to the relative efficiency of the approach. Recent research on FDNs has focused on mixing matrix design to increase echo density [6], modal analysis [7,8], time-varying FDNs [9], scattering FDNs [10], and reverberation time control by accurate design of the decay filters [11,12]. ...
Article
Full-text available
Delay Network reverberators are an efficient tool for synthesizing reverberation. We propose a novel architecture, called the Grouped Feedback Delay Network (GFDN) reverberator, with groups of delay lines sharing different target decay rates, and use it to simulate coupled room acoustics. Coupled spaces are common in apartments, concert halls, and churches where two or more volumes with different reverberation characteristics are linked via an aperture. The difference in reverberation times (T60s) of the coupled spaces leads to unique phenomena, such as multi-stage decay. Here the GFDN is used to simulate coupled spaces with groups of delay line filters representing the T60 s of the coupled rooms. A parameterized, orthonormal mixing matrix is presented that provides control over the mixing times of the rooms and amount of coupling between the rooms. As an example application we measure a coupled bedroom and bathroom system separated by a door in an apartment and use the GFDN to synthesize the late field for different openings of the door separating the two rooms, thereby varying coupling between the rooms.
... An FDN is lossless if Apzq is paraunitary, i.e., Apz -1 q H Apzq " I, where I is the identity matrix and¨H denotes the complex conjugate transpose [31]. For real scalar matrices A, the FDN is lossless if A is orthogonal, i.e., A J A " I. However also non-orthogonal feedback matrices may yield lossless FDNs [32,33], and we give some examples in Section 3. ...
Conference Paper
Full-text available
Feedback delay networks (FDNs) are recursive filters, which are widely used for artificial reverberation and decorrelation. While there exists a vast literature on a wide variety of reverb topologies, this work aims to provide a unifying framework to design and analyze delay-based reverberators. To this end, we present the Feedback Delay Network Toolbox (FDNTB), a collection of the MAT-LAB functions and example scripts. The FDNTB includes various representations of FDNs and corresponding translation functions. Further, it provides a selection of special feedback matrices, topologies, and attenuation filters. In particular, more advanced algorithms such as modal decomposition, time-varying matrices, and filter feedback matrices are readily accessible. Furthermore, our toolbox contains several additional FDN designs. Providing MATLAB code under a GNU-GPL 3.0 license and including illustrative examples, we aim to foster research and education in the field of audio processing.
Article
Full-text available
Feedback delay networks (FDNs) belong to a general class of recursive filters which are widely used in sound synthesis and physical modeling applications. We present a numerical technique to compute the modal decomposition of the FDN transfer function. The proposed pole finding algorithm is based on the Ehrlich-Aberth iteration for matrix polynomials and has improved computational performance of up to three orders of magnitude compared to a scalar polynomial root finder. The computational performance is further improved by bounds on the pole location and an approximate iteration step. We demonstrate how explicit knowledge of the FDN's modal behavior facilitates analysis and improvements for artificial reverberation. The statistical distribution of mode frequency and residue magnitudes demonstrate that relatively few modes contribute a large portion of impulse response energy.
Article
Full-text available
The mixing time of room impulse responses denotes the moment when the diffuse reverberation tail begins. A diffuse ("mixed") sound field can physically be defined by (1) equidistribution of acoustical energy and (2) a uniform acoustical energy flux over the complete solid angle. Accordingly, the perceptual mixing time could be regarded as the moment when the diffuse tail cannot be distinguished from that of any other position or listener's orientation in the room. This, for instance, provides an opportunity for reducing the part of binaural room impulse responses that has to be updated dynamically in Virtual Acoustic Environments. Several authors proposed model- and signal-based estimators for the mixing time in rooms. Our study aims at an evaluation of all measures as predictors of a perceptual mixing time. Therefore, we collected binaural impulse response data sets with an adjustable head and torso simulator for a representative sample of rectangular shaped rooms. Altering the transition time into a homogeneous diffuse tail in real time in an adaptive, forced-choice listening test, we determined just audible perceptual mixing times.We evaluated the performance of all potential predictors by linear regression and finally obtained formulae to estimate the perceptual mixing time from measured impulse responses or physical properties of the room.
Article
Full-text available
Article
Two words u and v are said to be k-abelian equivalent if, for each word x of length at most k, the number of occurrences of x as a factor of u is the same as for v. We study some combinatorial properties of k-abelian equivalence classes. Our starting point is a characterization of k-abelian equivalence by rewriting, so-called k-switching. Using this characterization we show that, over any fixed alphabet, the language of lexicographically least representatives of k-abelian equivalence classes is a regular language. From this we infer that the sequence of the numbers of equivalence classes is -rational. Furthermore, we show that the above sequence is asymptotically equal to a certain polynomial depending on k and the alphabet size.
Article
Feedback delay networks (FDNs) are frequently used to generate artificial reverberation. This contribution discusses the temporal features of impulse responses produced by FDNs, i.e., the number of echoes per time unit and its evolution over time. This so-called echo density is related to known measures of mixing time and their psychoacoustic correlates such as auditive perception of the room size. It is shown that the echo density of FDNs follows a polynomial function, whereby the polynomial coefficients can be derived from the lengths of the delays for which an explicit method is given. The mixing time of impulse responses can be predicted from the echo density, and conversely, a desired mixing time can be achieved by a derived mean delay length. A Monte Carlo simulation confirms the accuracy of the derived relation of mixing time and delay lengths.
Article
Two words u and v are k-abelian equivalent if, for each word x of length at most k, x occurs equally many times as a factor in both u and v. The notion of k-abelian equivalence is an intermediate notion between the abelian equivalence and the equality of words. In this paper, we study the equivalence classes induced by the k-abelian equivalence, mainly focusing on the cardinalities of the classes. In particular, we are interested in the number of singleton k-abelian classes, i.e., classes containing only one element. We find a connection between the singleton classes and cycle decompositions of the de Bruijn graph. We show that the number of classes of words of length n containing one single element is of order O(nNm(k1)1)\mathcal O(n^{N_m(k-1)-1}), where Nm(l)=1ldlφ(d)ml/dN_m(l) = \tfrac{1}{l}\sum_{d\mid l} \varphi(d)m^{l/d} is the number of necklaces of length l over an m-ary alphabet. We conjecture that the upper bound is sharp. We also remark that, for k even and m=2m = 2, the lower bound Ω(nNm(k1)1)\Omega(n^{N_m(k-1)-1}) follows from an old conjecture on the existence of Gray codes for necklaces of odd length. We verify this conjecture for necklaces of length up to 15.
Article
It is well known that the eigenvalues of any unitary matrix lie on the unit circle. The purpose of this paper is to prove that the eigenvalues of any matrix polynomial, with unitary coefficients, lie inside the annulus A1/2, 2(0):= {z Є C | 1/2 < |z| < 2}. The foundations of this result rely on an operator version of Rouché’s theorem and the intermediate value theorem. © 2015, International Linear Algebra Society. All rights reserved.
Article
This paper introduces a time-variant reverberation algorithm as an extension of the feedback delay network (FDN). By modulating the feedback matrix nearly continuously over time, a complex pattern of concurrent amplitude modulations of the feedback paths evolves. Due to its complexity, the modulation produces less likely perceivable artifacts and the time-variation helps to increase the liveliness of the reverberation tail. A listening test, which has been conducted, confirms that the perceived quality of the reverberation tail can be enhanced by the feedback matrix modulation. In contrast to the prior art time-varying allpass FDNs, it is shown that unitary feedback matrix modulation is guaranteed to be stable. Analytical constraints on the pole locations of the FDN help to describe the modulation effect in depth. Further, techniques and conditions for continuous feedback matrix modulation are presented.
Article
A simple, robust method for measuring echo density from a reverberation impulse response is presented. Based on the property that a reverberant field takes on a Gaussian distribution once an acoustic space is fully mixed, the measure counts samples lying outside a standard deviation in a given impulse response window and normalizes by that expected for Gaussian noise. The measure is insensitive to equalization and reverberation time, and is seen to perform well on both artificial reverberation and measurements of room impulse responses. Listening tests indicate a correlation between echo density measured in this way and perceived temporal quality or texture of the reverberation.