Kwangki Kim

Kwangki Kim
Korea Nazarene University | KORNU · Department of Digital Contents

About

14
Publications
1,529
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
65
Citations
Introduction
Skills and Expertise

Publications

Publications (14)
Article
In interactive audio services, users can render audio objects rather freely to match their desires and the spatial audio object coding (SAOC) scheme is fairly good both in the sense of bitrate and audio quality. But rather perceptible audio quality degradation can occur when an object is suppressed or played alone. To complement this, the SAOC sche...
Article
Full-text available
We proposed the automatic sound scene control system using the image sensor network for preserving the constant sound scene without respect to the users’ movement. In the proposed system, the image sensor network detects the human location in the multichannel playback environment and the SSC (sound scene control) module automatically controls the s...
Conference Paper
This paper proposed the method that can efficiently detect the transient component existing in the multi-channel audio signal and accordingly extract spatial cues in order to improve the performance of the spatial cue based multi-channel audio coding. The proposed transient signal detection algorithm was implemented to use energy as well as the inp...
Conference Paper
This paper presents the mastering signal processing with the residual coding scheme in spatial audio object coding. The proposed method can eliminate the difference between the original down-mix signal and the compensated down-mix signal and enhance the sound quality, successfully. Experimental result shows that the proposed method can greatly impr...
Article
Full-text available
Interactive audio services (IASs) usually provide users with audio editing functionality and they can render their own sounds according to their preference. For IASs, the spatial audio object coding (SAOC) is an appropriate multichannel coding tool that satisfies most of the required functionalities with relatively low bit rate. Nevertheless, the S...
Article
Full-text available
MPEG spatial audio object coding (SAOC) is a new audio coding standard which efficiently represents various audio objects as a down-mix signal and spatial parameters. MPEG SAOC has a backward compatibility with existing playback systems for the down-mix signal. If a mastering signal is used for providing CD-like sound quality instead of the down-mi...
Article
Full-text available
Aninteractive audioserviceisa new conceptualaudio service that provides the users with opportunities for a variety of experiences on the alternative and advanced audio services. In the interactive audio service, users can freely control various audio ob- jects to maketheir ownaudio sounds.A spatial audioobject coding (SAOC) is a useful technology t...
Conference Paper
In this paper, a modified SAOC (Spatial Audio Object Coding) scheme with harmonic elimination structure is proposed. The proposed structure improves the quality of vocal-removed sound and well removes a vocal object using the harmonic information of the vocal object. Subjective and objective evaluation results show the proposed scheme is superior t...
Conference Paper
An interactive audio service provides audio editing functionality to users. In the service, the users can control the wanted audio objects to make their own audio sound using a spatial audio object coding (SAOC) scheme. However, the vocal object cannot be removed perfectly from the down-mix signal in Karaoke mode of the SAOC. Thus, in this paper, a...
Conference Paper
Full-text available
Spatial Audio Object Coding (SAOC) handles a number of audio objects to provide a user with active audio services. It represents all objects as a stereo downmixed signal with some side information, and the bitrate can be significantly reduced compared to that of the conventional audio coders. In spite of the advantage of bitrate reduction, the SAOC...
Article
The channel level difference (CLD) is a main parameter in the reference model 0 (RM0) for MPEG Surround Nevertheless, the CLD quantization method in the RM0 has problems such as the lack of theoretical background and inappropriate quantization levels. In this letter a new CLD quantization method is proposed based on the virtual source location info...
Conference Paper
Full-text available
Recent researches in speech synthesis are mainly focused on naturalness, and the emotional speech synthesis becomes one of the highlighted research topics. Although quite a many studies on emotional speech in English or Japanese have been addressed, the studies in Korean can seldom be found. This paper presents an analysis of emotional speech in Ko...
Conference Paper
Full-text available
Recent researches in speech synthesis are mainly focused on naturalness, and the emotional speech synthesis becomes one of the highlighted research topics. Although quite a many studies on emotional speech in English or Japanese have been addressed, the studies in Korean can seldom be found. This paper presents an analysis of emotional speech in Ko...

Network

Cited By