Woon-Seng Gan

Woon-Seng Gan
Nanyang Technological University | ntu · School of Electrical and Electronic Engineering

PhD

About

396
Publications
132,623
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
4,357
Citations
Additional affiliations
January 1997 - December 2012
Nanyang Technological University
May 1993 - present
Nanyang Technological University
Position
  • Professor (Full)
Description
  • Spatial Audio, Active Noise Control, and Digital Signal Processing
January 1991 - April 1993
University of Strathclyde
Position
  • Researcher

Publications

Publications (396)
Article
Traditional decibel-based measures in predicting annoyance from construction activities are limited to reflect high acoustic variability of construction machinery noises. Hence, a multidimensional approach based on perceptual attributes and psychoacoustic parameters is proposed. In-situ audio-visual recordings of 16 construction machinery in operat...
Presentation
Full-text available
Presentation on assessing the open-circuit voltage calibration procedure for headphone reproduction of acoustic environments. This presentation accompanies the paper: https://doi.org/10.48550/arXiv.2205.04728
Preprint
Full-text available
To increase the availability and adoption of the soundscape standard, a low-cost calibration procedure for reproduction of audio stimuli over headphones was proposed as part of the global ``Soundscape Attributes Translation Project'' (SATP) for validating ISO/TS~12913-2:2018 perceived affective quality (PAQ) attribute translations. A previous preli...
Preprint
Full-text available
Studies involving soundscape perception often exclude participants with hearing loss to prevent impaired perception from affecting experimental results. Participants are typically screened with pure tone audiometry, the "gold standard" for identifying and quantifying hearing loss at specific frequencies, and excluded if a study-dependent threshold...
Preprint
Full-text available
Studies involving soundscape perception often exclude participants with hearing loss to prevent impaired perception from affecting experimental results. Participants are typically screened with pure tone audiometry, the "gold standard" for identifying and quantifying hearing loss at specific frequencies, and excluded if a study-dependent threshold...
Preprint
Choosing optimal maskers for existing soundscapes to effect a desired perceptual change via soundscape augmentation is non-trivial due to extensive varieties of maskers and a dearth of benchmark datasets with which to compare and develop soundscape augmentation models. To address this problem, we make publicly available the ARAUS (Affective Respons...
Article
Although hospitals are notorious for poor acoustics, the acoustic environment is usually evaluated in silos, or in conjunction with few indoor environmental quality (IEQ) factors. With only anecdotcal evidence, it is important to first establish a holistic baseline perception of the hospital acoustic environment before commissioning costly measurem...
Article
Full-text available
The ecological validity of soundscape studies usually rests on the choice of soundscapes that are representative of the perceptual space under investigation. For example, a soundscape pleasantness study might investigate locations with soundscapes ranging from “pleasant” to “annoying”. The choice of soundscapes is typically researcher led, but a pa...
Preprint
Full-text available
Convolutional recurrent networks (CRN) integrating a convolutional encoder-decoder (CED) structure and a recurrent structure have achieved promising performance for monaural speech enhancement. However, feature representation across frequency context is highly constrained due to limited receptive fields in the convolutions of CED. In this paper, we...
Preprint
Full-text available
Translation of perceptual descriptors such as the perceived affective quality attributes in the sound-scape standard (ISO/TS 12913-2:2018) is an inherently intricate task, especially if the target language is used in multiple countries. Despite geographical proximity and a shared language of Bahasa Melayu (Standard Malay), differences in culture an...
Preprint
Full-text available
The ecological validity of soundscape studies usually rests on a choice of soundscapes that are representative of the perceptual space under investigation. For example, a soundscape pleasantness study might investigate locations with soundscapes ranging from "pleasant" to "annoying". The choice of soundscapes is typically researcher-led, but a part...
Conference Paper
Full-text available
A major issue in open-domain dialogue generation is the agent's tendency to generate repetitive and generic responses. The lack in response diversity has been addressed in recent years via the use of latent variable models, such as the Conditional Variational Auto-Encoder (CVAE), which typically involve learning a latent Gaussian distribution over...
Preprint
Full-text available
The introduction of ISO 12913-2:2018 has provided a framework for standardized data collection and reporting procedures for soundscape practitioners. A strong emphasis was placed on the use of calibrated head and torso simulators (HATS) for binaural audio capture to obtain an accurate subjective impression and acoustic measure of the soundscape und...
Article
Augmented or mixed reality (AR/MR) is emerging as one of the key technologies in the future of computing. Audio cues are critical for maintaining a high degree of realism, social connection, and spatial awareness for various AR/MR applications, such as education and training, gaming, remote work, and virtual social gatherings to transport the user...
Conference Paper
Full-text available
Soundscape augmentation is an emerging approach for noise mitigation by introducing additional sounds known as "maskers" to increase acoustic comfort. Traditionally, the choice of maskers is often predicated on expert guidance or post-hoc analysis which can be time-consuming and sometimes arbitrary. Moreover, this often results in a static set of m...
Preprint
Full-text available
The selection of maskers and playback gain levels in a soundscape augmentation system is crucial to its effectiveness in improving the overall acoustic comfort of a given environment. Traditionally, the selection of appropriate maskers and gain levels has been informed by expert opinion, which may not representative of the target population, or by...
Preprint
Full-text available
Soundscape augmentation is an emerging approach for noise mitigation by introducing additional sounds known as "maskers" to increase acoustic comfort. Traditionally, the choice of maskers is often predicated on expert guidance or post-hoc analysis which can be time-consuming and sometimes arbitrary. Moreover, this often results in a static set of m...
Preprint
Full-text available
Although hospitals are notorious for poor acoustics, the acoustic environment is usually evaluated in silos, without considering other indoor environmental quality (IEQ) factors. In the absence of qualitative evidence, it is important to first establish a holistic baseline perception of the hospital acoustic environment before commissioning costly...
Conference Paper
Soundscape augmentation, which involves the addition of sounds known as "maskers" to a given soundscape, is a human-centric urban noise mitigation measure aimed at improving the overall sound-scape quality. However, the choice of maskers is often predicated on laborious processes and is inflexible to the time-varying nature of real-world soundscape...
Preprint
Full-text available
Translation of perceptual soundscape attributes from one language to another remains a challenging task that requires a high degree of fidelity in both psychoacoustic and psycholinguistic senses across the target population. Due to the inherently subjective nature of human perception, translating soundscape attributes using only small focus group d...
Patent
Full-text available
The disclosure concerns a system for actively cancelling an acoustic noise signal entering through an aperture of a room. The system includes a sensor to detect the acoustic noise signal and convert it into an electronic noise signal; a plurality of transducers to generate an acoustic anti-noise signal from an electronic anti-noise signal to cancel...
Article
Full-text available
Active Noise Control (ANC) headphones are commonly employed to create a quiet zone around the ears of users. In conventional ANC technique, the ambient noise is picked up by the reference microphones on the earcups of ANC headphones then relayed to the ANC controller, which generates anti-noise to suppress it. In wireless ANC system, the reference...
Preprint
Full-text available
In recent years, latent variable models, such as the Conditional Variational Auto Encoder (CVAE), have been applied to both personalized and empathetic dialogue generation. Prior work have largely focused on generating diverse dialogue responses that exhibit persona consistency and empa-thy. However, when it comes to the contextual coherence of the...
Conference Paper
Full-text available
Conventional approaches to personalized dialogue generation typically require a large corpus , as well as predefined persona information. However, in a real-world setting, neither a large corpus of training data nor persona information are readily available. To address these practical limitations, we propose a novel multi-task meta-learning approac...
Technical Report
Full-text available
Controllability, despite being a much-desired property of a generative model, remains an ill-defined concept that is difficult to measure. In the context of neural music generation, a controllable system often implies an intuitive interaction between human agents and the neural model, allowing the relatively opaque neural model to be controlled by...
Conference Paper
Full-text available
Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD inherits the challenges of both tasks, such as noise, reverberation, interference, polyphony, and non-stationarity of sound sources. Furthermore, SELD often faces an...
Preprint
Full-text available
Polyphonic sound event localization and detection (SELD) has many practical applications in acoustic sensing and monitoring. However, the development of real-time SELD has been limited by the demanding computational requirement of most recent SELD systems. In this work, we introduce SALSA-Lite, a fast and effective feature for polyphonic SELD using...
Article
Full-text available
The minimum output variance filtered reference least mean square (MOV-FxLMS algorithm is a effective algorithm that utilizes the penalty mechanism to help the active noise control (ANC system achieve noise cancellation with constrained output variance or power. As it can constrain output power, the MOV-FxLMS algorithm can freely determine the ANC s...
Preprint
Full-text available
Construction workers and residents live near around construction sites are exposed to noises that might cause hearing loss, high blood pressure, heart disease, sleep disturbance and stress. Regulations has been carried out by national governments to limit the maximum permissible noise levels for construction works. A four-channel active noise contr...
Preprint
Full-text available
This paper introduces SINGA:PURA, a strongly labelled polyphonic urban sound dataset with spatiotemporal context. The data were collected via several recording units deployed across Singapore as a part of a wireless acoustic sensor network. These recordings were made as part of a project to identify and mitigate noise sources in Singapore, but also...
Conference Paper
Full-text available
This study aims to assess objective and subjective aspects of equipment and operational noises on construction sites based on a laboratory experiment. Sixteen audiovisual recordings of machines on construction sites were used as stimuli. In total, 53 participants took part in the laboratory experiments. The participants assessed noises of construct...
Preprint
Full-text available
Echo and noise suppression is an integral part of a full-duplex communication system. Many recent acoustic echo cancellation (AEC) systems rely on a separate adaptive filtering module for linear echo suppression and a neural module for residual echo suppression. However, not only do adaptive filtering modules require convergence and remain suscepti...
Preprint
Full-text available
Sound event localization and detection (SELD) consists of two subtasks, which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses amplitude and/or phase differences between microphones to estim...
Article
Full-text available
Active noise control (ANC) technology is increasingly ubiquitous in wearable audio devices, or hearables. Owing to its low computational complexity, high robustness, and exemplary performance in dealing with dynamic noise, the fixed-coefficient control filter strategy plays a central role in portable ANC implementation. Unlike its traditional adapt...
Conference Paper
Full-text available
This paper introduces SINGA:PURA, a strongly labelled polyphonic urban sound dataset with spatiotemporal context. The data were collected via several recording units deployed across Singapore as a part of a wireless acoustic sensor network. These recordings were made as part of a project to identify and mitigate noise sources in Singapore, but also...
Article
In real active noise control system implementation, the arrangement of secondary sources and error microphones have significant effect on the performance of the system. Analytical and experimental ways are usually combined to determine the best system layout. In this paper, we use COSMOL Multiphysics to accurately model the acoustic environment in...
Article
Active noise control (ANC) headphone is widely used to attenuate the noise around human' s ear. The microphone mounted on the conventional ANC headphones collected the mixed reference signals when more than one noise sources are often present in the surrounding. In this case, the uncorrelated noise sources involved in the mixed reference usually de...
Conference Paper
Full-text available
Studies involving subjective evaluation require feedback from human participants to assess the performance of a system or an environment. A participant is typically presented with a set of metrics to be observed and they present their assessment accordingly. Investigator-led in-situ soundscape evaluation in ISO 12913-2 collects perceptual responses...
Conference Paper
Full-text available
Acoustic parameters obtained from calibrated acoustic equipment are part of the minimum sound-scape reporting requirements as stated in Annex A of ISO 12913-2. To dynamically monitor the acoustic environment of a large area, a large network of acoustic sensors could be deployed, albeit at significant cost. Micro-Electro-Mechanical Systems (MEMS) mi...
Preprint
Full-text available
Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD inherits the challenges of both tasks, such as noise, reverberation, interference, polyphony, and non-stationarity of sound sources. Furthermore, SELD often faces an...
Preprint
Full-text available
The Sørensen-Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection. Conventional training of polyphonic...
Research Proposal
Full-text available
This is a pre-registration of a scoping review based on the PRISMA Extension for Scoping Reviews (PRISMA-ScR). This scoping review identifies, classifies and examines active control strategies employed to reduce the transmission of airborne noise from heavy or auxiliary equipment used in the construction sector.
Article
Full-text available
Cognitive fatigue is a psychological state characterised by feelings of tiredness and impaired cognitive functioning arising from high cognitive demands. This paper examines the recent research progress on the assessment of cognitive fatigue and provides informed recommendations for future research. Traditionally, cognitive fatigue is introspective...
Article
Active noise control (ANC) is gaining credence as an effective approach in mitigating low-frequency urban noise. Current ANC algorithms that attenuate noise across the full audio frequency band, often exert control effort excessively at higher frequencies. Moreover, such unrestrained control unavoidably attenuates some critical sounds, such as alar...
Article
Full-text available
Urban noise pollution is an omnipresent but often neglected threat to public health that must be addressed urgently. Passive noise control measures, which are less effective at reducing low-frequency noise and are often bulky and may impede airflow. As evidenced in automobiles, active control of cabin noise has resulted in lighter cars due to reduc...
Article
Full-text available
Multichannel active noise control (MCANC) is widely regarded as an effective solution to achieve a significantly large noise-cancellation area in a complicated acoustic field. However, the computational complexity of MCANC algorithms, such as the multichannel filter-x least mean square (McFxLMS) algorithm, grows exponentially with an increased chan...
Article
Full-text available
With the advent of efficient low-cost processors and electroacoustic components, there is renewed interest in the practical implementation of active noise control (ANC). However, the slow convergence of conventional adaptive algorithms deployed in ANC restricts its handling of typical amplitude-varying noise. Hence, we proposed a modified model-agn...
Article
The feedforward active noise control (FF ANC) technique has been widely used to cancel the broadband noise in many practical applications. However, it fails to cope with the uncorrelated narrow-band disturbance picked up by the error sensor, which is independent of the reference signal picked up by the reference sensor. Hence, the alternative switc...
Preprint
Full-text available
Polyphonic sound event detection and localization (SELD) task is challenging because it is difficult to jointly optimize sound event detection (SED) and direction-of-arrival (DOA) estimation in the same network. We propose a general network architecture for SELD in which the SELD network comprises sub-networks that are pretrained to solve SED and D...
Article
Full-text available
The push for greater urban sustainability has increased the urgency of the search for noise mitigation solutions that allow for natural ventilation into buildings. Although a viable active noise control (ANC) solution with up to 10 dB of global attenuation between 100 Hz and 1000 Hz was previously developed for an open window, it had limited low-fr...
Article
Before introducing natural sounds to potentially improve the soundscape quality, it is important to understand how key contextual factors (i.e. expected activities and audio-visual congruency) affect the soundscape in a given location. In this study, the perception of eight natural sounds (i.e. 4 birdsongs, 4 water sounds) at five urban recreationa...
Data
Presentation slides of the paper titled: "Virtual, augmented and mixed reality tools in soundscape evaluation and design", at Internoise 2020 e-Congress
Data
Video presentation of the paper titled: "Virtual, augmented and mixed reality tools in soundscape evaluation and design", at Internoise 2020 e-Congress
Article
Many signal processing-based methods for sound source direction-of-arrival estimation produce a spatial pseudo-spectrum of which the local maxima strongly indicate the source directions. Due to different levels of noise, reverberation and different number of overlapping sources, the spatial pseudo-spectra are noisy even after smoothing. In addition...