Johannes M. Arend

Johannes M. Arend
Technische Universität Berlin | TUB · Institut für Sprache und Kommunikation

Dr. rer. nat.

About

74
Publications
15,529
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
491
Citations
Citations since 2016
72 Research Items
485 Citations
2016201720182019202020212022050100150
2016201720182019202020212022050100150
2016201720182019202020212022050100150
2016201720182019202020212022050100150
Introduction
Johannes M. Arend currently works at the Audio Communication Group, Technical University of Berlin. Johannes does research in Spatial Audio.

Publications

Publications (74)
Article
Full-text available
High-quality spatial audio reproduction over headphones requires head-related transfer functions (HRTFs) with high spatial resolution. However, acquiring datasets with a large number of (individual) HRTFs is not always possible, and using large datasets can be problematic for real-time applications with limited resources. Consequently, interpolatio...
Article
Full-text available
Parametric spatial audio rendering is a popular approach for low computing capacity applications, such as augmented reality systems. However most methods rely on spatial room impulse responses (SRIR) for sound field rendering with 3 degrees of freedom (DoF), i.e., for arbitrary head orientations of the listener, and often require multiple SRIRs for...
Article
Full-text available
It is commonly believed that near-field head-related transfer functions (HRTFs) provide perceptual benefits over far-field HRTFs that enhance the plausibility of binaural rendering of nearby sound sources. However, to the best of our knowledge, no study has systematically investigated whether using near-field HRTFs actually provides a perceptually...
Article
Full-text available
Conventional individual head-related transfer function (HRTF) measurements are demanding in terms of measurement time and equipment. For more flexibility, free body movement (FBM) measurement systems provide an easy-to-use way to measure full-spherical HRTF datasets with less effort. However, having no fixed measurement installation implies that th...
Conference Paper
Full-text available
The spatial processing disorder (SPD) is an auditory processing disorder characterized as a reduced ability to use binaural cues to localize, segregate, and group sound sources in the space selectively. It affects up to 5% of the school-aged population, impairing the normal development of their listening, learning, and communication skills. Sound s...
Conference Paper
Full-text available
While having normal hearing thresholds, patients with spatial processing disorder (SPD) suffer from a reduced ability to use binaural cues to achieve spatial release from masking (SRM). As a result, their ability to localize sound sources and to follow and understand speech in acoustically challenging situations is impaired. There is a high preval...
Thesis
Binaural rendering aims to immerse the listener in a virtual acoustic scene, making it an essential method for spatial audio reproduction in virtual or augmented reality (VR/AR) applications. The growing interest and research in VR/AR solutions yielded many different methods for the binaural rendering of virtual acoustic realities, yet all of them...
Conference Paper
Full-text available
The human voice directivity is highly dynamic, with rapid changes between different phonemes. Even though the human voice directivity has been the subject of various studies, the perceptual role of these dynamic changes is still quite unexplored. We present a first analysis and visualization of human voice directivity with its time-variant charact...
Data
This video visualizes dynamic voice directivity for one spoken sentence showing how the polar plots and the directicity index change over time.
Data
The datasets contain measured directivity patterns in SOFA format. The directivity patterns were measured in the anechoic chamber of TH Köln applying a surrounding spherical microphone array, which has a diameter of 2m and a shape of a pentakis dodecahedron with 32 cardioid microphones (Rode NT5) located at the vertices.
Article
Full-text available
While speaking, hand postures, such as holding a hand in front of the mouth or cupping the hands around the mouth, influence human voice directivity. This study presents and analyzes spherical voice directivity datasets of an articulated [a] with and without hand postures. The datasets were determined from measurements with 13 subjects in a surroun...
Article
Full-text available
Dynamic binaural synthesis requires binaural room impulse responses (BRIRs) for each head orientation of the listener. Such BRIRs can either be measured with a dummy head or calculated from the spherical microphone array (SMA) data. Because the dense dummy head measurements require enormous effort, alternatively sparse measurements can be performed...
Article
Zusammenfassung: Eine Reihe wissenschaftlicher Publikationen legt nahe, dass die Lästigkeit von Windenergieanlagen durch Amplitudenmodulationen der Anlagengeräusche beeinflusst wird. In dieser Studie wurde ein Hörversuch mit 79 Probanden aus belästigten und nicht-belästigten Gruppen durchgeführt, die die Lästigkeit in Abhängigkeit von variierenden...
Article
Full-text available
High-quality rendering of spatial sound fields in real-time is becoming increasingly important with the steadily growing interest in virtual and augmented reality technologies. Typically, a spherical microphone array (SMA) is used to capture a spatial sound field. The captured sound field can be reproduced over headphones in real-time using binaura...
Article
Full-text available
Full text at https://www.dega-akustik.de/fileadmin/dega-akustik.de/publikationen/akustik-journal/21-03/akustik_journal_2021_03_online_artikel2.pdf Abstract: Wusch, Wusch, Wusch.“ Die Geräusche von Windenergieanlagen werden von Anwohnenden in der Regel als ein „Wuschen“ oder als ein an- und absteigendes Rauschen bezeichnet. Ursache dafür: Ähnlich...
Conference Paper
Full-text available
Open-source database download: https://doi.org/10.5281/zenodo.5031335
Conference Paper
Full-text available
The use of individual head-related transfer functions (HRTFs) can improve the quality of binaural reproduction in virtual acoustic environments. However, measuring individual HRTFs for a large number of directions is usually challenging in terms of time, effort, hardware, and setup requirements. This work presents an HRTF measurement system that ca...
Conference Paper
Full-text available
Current literature suggests that annoyance of wind turbine noise is strongly affected by amplitude modulations (AM). A survey was carried out at five German residential study sites near wind turbines with a total of about 500 residents to study the effects of AM in more detail. This manuscript reports preliminary results of the first three areas ob...
Conference Paper
Full-text available
The Interactive Virtual Environment System (IVES) is a toolkit aiding the production of immersive audiovisual 3D virtual environments for screen-based or virtual reality (VR) applications with loudspeaker-or headphone-based spatial audio reproduction. It is developed within Cycling 74s Max programming environment and consists of a set of interface-...
Data
Supplementary materials for "Investigating phoneme-dependencies of spherical voice directivity patterns" The Journal of the Acoustical Society of America, 149(6) pp. 4553-4564 (2021); https://doi.org/10.1121/10.0005401. The dataset contains measured directivities in SOFA format according to the SOFA convention "GeneralTF". The directivities were...
Article
Dynamic directivity is a specific characteristic of the human voice, showing time-dependent variations while speaking or singing. To study and model the human voice's articulation-dependencies and provide datasets that can be applied in virtual acoustic environments, full-spherical voice directivity measurements were carried out for 13 persons whil...
Article
Full-text available
Nearby sound sources provide distinct binaural cues, mainly in the form of interaural level differences, which vary with respect to distance and azimuth. However, there is a long-standing controversy regarding whether humans can actually utilize binaural cues for distance estimation of nearby sources. Therefore, we conducted three experiments using...
Article
With the COVID-19 pandemic, the wearing of face masks covering mouth and nose has become ubiquitous all around the world. This study investigates the impact of typical face masks on voice radiation. To analyze the transmission loss caused by masks and the influence of masks on directivity, this study measured the full-spherical voice directivity of...
Data
Supplementary materials for "Impact of face masks on voice radiation" The Journal of the Acoustical Society of America, 148(6) pp. 3663-3670 (2020); https://doi.org/10.1121/10.0002853 . The dataset contains all measured directivities in SOFA format according to the SOFA convention "GeneralFIR". The directivities were measured on a 2702 sampling po...
Conference Paper
Full-text available
The spatial decomposition method (SDM) aims at parameterizing a sound field as a succession of plane waves, allowing the analysis and rendering of multichannel room impulse responses (RIRs). The method was originally developed for the use with open microphone arrays, utilizing time differences of arrival to compute directional estimates. A later ve...
Article
Full-text available
The spatial decomposition method (SDM) can be used to parameterize and reproduce a sound field based on measured multichannel room impulse responses (RIRs). In this paper we propose optimizations of SDM to address the following questions and issues that have recently emerged in the development of the method: (a) accuracy in direction-of-arrival (DO...
Conference Paper
Full-text available
Employing a finite number of discrete microphones, instead of a continuous distribution according to theory, reduces the physical accuracy of sound field representations captured by a spherical microphone array. For a binaural reproduction of the sound field, a number of approaches have been proposed in the literature to mitigate the perceptual imp...
Article
To describe the sound radiation of the human voice into all directions, measurements need to be performed on a spherical grid. However, the resolution of such captured directivity patterns is limited and methods for spatial upsampling are required, for example by interpolation in the spherical harmonics (SH) domain. As the number of measurement dir...
Conference Paper
Spatial upsampling of head-related transfer functions (HRTFs) measured on a sparse grid is an important issue, particularly relevant when capturing individual datasets. While early studies mostly used nearest-neighbor approaches, ongoing research focuses on interpolation in the spherical harmonics (SH) domain. The interpolation can either be perfor...
Conference Paper
Binaural auralization applying binaural room impulse responses (BRIRs) requires a high measurement effort and immense computing capacity when rendered with maximum spatial resolution. In this study, we conducted an adaptive ABX listening test to determine the minimum grid resolution (i.e., the spatial resolution) of BRIRs sufficient to achieve an a...
Article
Full-text available
Spherical microphone arrays (SMAs) are widely used to capture spatial sound fields that can then be rendered in various ways as a virtual acoustic environment (VAE) including head-phone-based binaural synthesis. Several practical limitations have a significant impact on the fidelity of the rendered VAE. The finite number of microphones of SMAs lead...
Article
In the paper “Directional Equalization of Sparse Head-Related Transfer Function Sets for Spatial Upsampling”, published in IEEE/ACM Transactions on Audio, Speech, andLanguage Processing (Volume: 27, Issue: 6, June 2019, DOI: 10.1109/TASLP.2019.2908057) [1],Eq. (2) and Eq. (6) are incorrect. In this Comment Correspondence we provide a correction of...
Conference Paper
Full-text available
The directivity pattern of the human voice plays an important role in the realization of communication scenarios in virtual acoustic environments . In contrast to many other sound sources, a specific characteristic of the human voice is its dynamic directivity, i.e. time-dependent changes in articulation or singing. As it is nearly impossible for s...
Poster
Full-text available
Determining full-spherical individual sets of head-related transfer functions (HRTFs) based on sparse measure- ments is a prerequisite for various applications in virtual acoustics. However, when applying HRTF interpolation in the spatially continuous spherical harmonics (SH) domain, the number of measured HRTFs limits the maximal accessible SH ord...
Conference Paper
Full-text available
Determining full-spherical individual sets of head-related transfer functions (HRTFs) based on sparse measure- ments is a prerequisite for various applications in virtual acoustics. However, when applying HRTF interpolation in the spatially continuous spherical harmonics (SH) domain, the number of measured HRTFs limits the maximal accessible SH ord...
Conference Paper
Full-text available
We present spherical high-density measurement data of head-related transfer functions (HRTFs) and analyze the influence of wearing headgear during the measurements. For this we captured datasets from a Neumann KU100 and a HEAD acoustics HMS II.3 dummy head either equipped with a bicycle helmet, a baseball cap, an Oculus Rift head- mounted display,...
Conference Paper
Full-text available
Many immersive audio applications rely on a dense set of head-related transfer functions (HRTFs). However, often only measurements on a specific sparse grid are available. To obtain dense HRTF sets from sparse mea- surements, one common approach is to apply spatial interpolation in the spherical harmonics (SH) domain. However, the SH representation...
Conference Paper
Full-text available
Determining full-spherical individual sets of head-related transfer functions (HRTFs) based on sparse measurements is a prerequisite for various applications in virtual acoustics. To obtain dense sets from sparse measurements, spatial upsampling of sparse HRTF sets in the spatially continuous spherical harmonics (SH) domain can be performed by an i...
Conference Paper
Full-text available
HRTFs are virtually independent of sound source distance in the far field, but vary significantly in the near field. The change of an HRTF when a sound source shifts in distance can be described by a distance variation function (DVF). To synthesize near-field HRTFs, it is a common method to apply DVFs to far-field HRTFs. In this study, we present a...
Conference Paper
Full-text available
Directivity patterns of human speakers are required for various applica- tions in virtual acoustics. They can either be measured sequentially for an arbitrary number of directions or simultaneously using a surrounding microphone array. In the latter, the resolution of the directivity pattern is limited by the number of array microphones, and approp...
Conference Paper
Full-text available
The increased research in spatial audio processing and technical achie- vements in mixed reality systems created a demand for appropriate mi- crophone arrays to record and reproduce immersive spatial audio. Cur- rent findings in research show that high order spherical microphone ar- rays can be a good choice for an adequate reproduction, but unfort...
Conference Paper
Full-text available
Research on virtual acoustic environments (VAEs) has gained more and more interest in the last decades. In this context, VAEs have become an accepted research tool for performing psychoacoustic experiments. However, such experiments were often carried out under varying con- ditions. There were differences in the VAE systems, in the method of report...
Article
Acquiring decent full-spherical sets of head-related transfer functions (HRTFs) based on a small number of measurements is highly desirable. For spatial upsampling, HRTF interpolation in the spatially continuous spherical harmonics (SH) domain is a common approach. However, the number of measured HRTFs limits the assessable SH order, resulting in o...
Conference Paper
Reactive virtual acoustic environments (VAEs) that respond to any user-generated sound with an appropriate acoustic room response enable immersive audio applications with enhanced sonic interaction between the user and the VAE. This paper presents a reactive VAE that has two clear advantages in comparison to other systems introduced so far: it gene...
Conference Paper
The paper describes a method for obtaining spherical sets of head-related transfer functions (HRTFs) based on a small number of measurements in reverberant environments. For spatial upsampling, we apply HRTF interpolation in the spherical harmonics (SH) domain. However, the number of measured directions limits the maximal accessible SH order, resul...
Conference Paper
Full-text available
In this paper, we analyze different auditory distance cues in dynamic binaural synthesis. We compare the contributions of sound intensity, direct-to-reverberant ratio (DRR), and near-field cues. For the auralization, we use the BinRIR method, which allows to generate binaural room impulse responses (BRIRs) for dynamic binaural synthesis based on on...
Poster
Full-text available
In the ongoing research project NarDasS, we developed a system for binaural reproduction of self-generated sound in virtual acoustic environments (VAEs). A 32-channel surrounding spherical microphone array captures the direction-dependent sound of an acting user. Based on these captured signals and specific BRIRs (BRIR - Binaural Room Impulse Respo...
Conference Paper
Full-text available
In the ongoing research project NarDasS, we developed a system for binaural reproduction of self-generated sound in virtual acoustic environments (VAEs). A 32-channel surrounding spherical microphone array captures the direction-dependent sound of an acting user. Based on these captured signals and specific BRIRs (BRIR - Binaural Room Impulse Respo...
Conference Paper
Full-text available
The auralization of rooms with dynamic binaural synthesis using binaural room impulse responses (BRIRs) is an established approach in virtual audio. The BRIRs can either be obtained by simulations or by measurements. Up to now changed acoustical properties, as they occur when a room is altered in a renovation, cannot easily be considered in a measu...
Conference Paper
Full-text available
User interaction with the virtual acoustic environment (VAE) has become of increasing interest in research. However, so far little attention has been paid to interaction by means of self-generated sound, like, for example, the own voice, even though this would open up new possibilities for natural interaction. Moreover, there is evidence that adequ...
Conference Paper
Full-text available
Binaural room impulse responses (BRIRs) are often applied in spatial audio for the auralization of acoustical environments. In the same field of research, parametric audio coding is an established approach and part of different standards. The presented investigation aims for a parametric description of the sound field to synthesize BRIRs. The model...
Conference Paper
Full-text available
The auralization of acoustic environments over headphones is often realized with data-based dynamic binaural synthesis. The required binaural room impulse responses (BRIRs) for the convo-lution process can be acquired by performing measurements with an artificial head for different head orientations and positions. This procedure is rather costly an...
Poster
Full-text available
Presented at the 20th International Conference on Digital Audio Effects (DAFx-17), Edinburgh, UK
Poster
Full-text available
presented at the 173rd Meeting of Acoustical Society of America
Poster
Full-text available
Presented at the 173rd Meeting of the Acoustical Society of America and the 8th Forum Acusticum
Conference Paper
Head-related transfer functions (HRTFs) describe the directional filtering caused by the head, pinna, and torso and are an essential component of binaural synthesis systems. Currently most of these systems are based on far-field HRTFs and thus do not consider acoustical specifics of nearby sound sources. One reason might be that full spherical near...
Conference Paper
Binaural synthesis is often applied in the field of spatial audio to create a virtual acoustic environment using binaural room impulse responses (BRIRs). In the same area of research, spherical microphone arrays are gaining importance and allow for a spatio-temporal analysis. We present a new approach to describe the acoustical environment by a par...
Article
Virtual acoustics aims to immerse the user in a virtual acoustic environment (VAE). However, most VAE systems do not feed self-generated sound back into the virtual room, even though there is evidence that adequate reproduction of self-generated sound affects the user’s perception and might even enhance immersion. Thus, if at all possible, sonic in...
Article
Binaural room impulse responses(BRIRs) are often applied in spatial audio for the auralization of acoustical environments. In the same field of research, parametric audio coding is an established approach and part of different standards. The presented investigation aims for a parametric description of the sound field in order to synthesize BRIRs fo...
Conference Paper
Full-text available
In the field of virtual audio, measured binaural room impulse responses (BRIRs) are used for the auralization of acoustical environments applying binaural synthesis. Depending on the application, it can be desirable to scale down the resolution of the BRIRs and thus reduce computational effort and the amount of data. In this paper, a new approach f...
Conference Paper
Full-text available
Dynamic binaural synthesis serves as a powerful tool in the fields of psychoacoustic research, virtual environments or architectural acoustics. However, most systems are based on far-field HRTFs and therefore ignore the acoustical specifics of sound sources in the near-field. For nearby sound sources, ILDs, ITDs and spectral properties vary signifi...
Conference Paper
Full-text available
The aim of virtual acoustics is to immerse the user in a virtual acoustic environment (VAE). Usually, the user is only a passive listener in a pre-defined scene and, depending on the quality of the system, has the possibility to move within this scene or to change its acoustic properties. However, most of these systems cannot feed self-generated so...
Conference Paper
Full-text available
The perceptual refinement of dynamic binaural synthesis has been subject to research for the past years. The basic principle relies on head-related transfer functions (HRTFs), which describe the directional filtering caused by the head, pinna, and torso. However, most systems are based on far-field HRTFs and therefore ignore the acoustical specific...
Conference Paper
Auralizing rooms with data-based dynamic binaural synthesis is an established approach in virtual acoustics. Generally measured binaural room impulse responses (BRIRs) are used to create a virtual acoustic environment (VAE) over headphones. Depending on the application, it is desirable to reduce the amount of data by decreasing the resolution of th...
Conference Paper
Digital audio watermarking is an extensive research area, currently with a strong focus on improving established watermarking schemes for conventional audio content. At the same time, headphone-based virtual acoustics, as part of virtual or augmented reality (VR/AR) technologies, takes on greater significance, and therefore methods are needed to pr...