About
14
Publications
1,529
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
65
Citations
Introduction
Skills and Expertise
Publications
Publications (14)
In interactive audio services, users can render audio objects rather freely to match their desires and the spatial audio object coding (SAOC) scheme is fairly good both in the sense of bitrate and audio quality. But rather perceptible audio quality degradation can occur when an object is suppressed or played alone. To complement this, the SAOC sche...
We proposed the automatic sound scene control system using the image sensor network for preserving the constant sound scene without respect to the users’ movement. In the proposed system, the image sensor network detects the human location in the multichannel playback environment and the SSC (sound scene control) module automatically controls the s...
This paper proposed the method that can efficiently detect the transient component existing in the multi-channel audio signal and accordingly extract spatial cues in order to improve the performance of the spatial cue based multi-channel audio coding. The proposed transient signal detection algorithm was implemented to use energy as well as the inp...
This paper presents the mastering signal processing with the residual coding scheme in spatial audio object coding. The proposed method can eliminate the difference between the original down-mix signal and the compensated down-mix signal and enhance the sound quality, successfully. Experimental result shows that the proposed method can greatly impr...
Interactive audio services (IASs) usually provide users with audio editing functionality and they can render their own sounds according to their preference. For IASs, the spatial audio object coding (SAOC) is an appropriate multichannel coding tool that satisfies most of the required functionalities with relatively low bit rate. Nevertheless, the S...
MPEG spatial audio object coding (SAOC) is a new audio coding standard which efficiently represents various audio objects as a down-mix signal and spatial parameters. MPEG SAOC has a backward compatibility with existing playback systems for the down-mix signal. If a mastering signal is used for providing CD-like sound quality instead of the down-mi...
Aninteractive audioserviceisa new conceptualaudio service that provides the users with opportunities for a variety of experiences on the alternative and advanced audio services. In the interactive audio service, users can freely control various audio ob- jects to maketheir ownaudio sounds.A spatial audioobject coding (SAOC) is a useful technology t...
In this paper, a modified SAOC (Spatial Audio Object Coding) scheme with harmonic elimination structure is proposed. The proposed structure improves the quality of vocal-removed sound and well removes a vocal object using the harmonic information of the vocal object. Subjective and objective evaluation results show the proposed scheme is superior t...
An interactive audio service provides audio editing functionality to users. In the service, the users can control the wanted audio objects to make their own audio sound using a spatial audio object coding (SAOC) scheme. However, the vocal object cannot be removed perfectly from the down-mix signal in Karaoke mode of the SAOC. Thus, in this paper, a...
Spatial Audio Object Coding (SAOC) handles a number of audio objects to provide a user with active audio services. It represents all objects as a stereo downmixed signal with some side information, and the bitrate can be significantly reduced compared to that of the conventional audio coders. In spite of the advantage of bitrate reduction, the SAOC...
The channel level difference (CLD) is a main parameter in the reference model 0 (RM0) for MPEG Surround Nevertheless, the CLD quantization method in the RM0 has problems such as the lack of theoretical background and inappropriate quantization levels. In this letter a new CLD quantization method is proposed based on the virtual source location info...
Recent researches in speech synthesis are mainly focused on naturalness, and the emotional speech synthesis becomes one of
the highlighted research topics. Although quite a many studies on emotional speech in English or Japanese have been addressed,
the studies in Korean can seldom be found. This paper presents an analysis of emotional speech in Ko...
Recent researches in speech synthesis are mainly focused on naturalness, and the emotional speech synthesis becomes one of the highlighted research topics. Although quite a many studies on emotional speech in English or Japanese have been addressed, the studies in Korean can seldom be found. This paper presents an analysis of emotional speech in Ko...