A self-generator method for initial filters of SIMO-ICA applied to blind separation of binaural sound mixtures
Graduate School of Information Science, Nara Institute of Science and Technology, Ikuma, Nara, Japan
DOI: 10.1109/ASPAA.2005.1540156 Conference: Applications of Signal Processing to Audio and Acoustics, 2005. IEEE Workshop on
Blind separation of binaural mixed sounds using single-input multiple-output (SIMO)-model-based independent component analysis (SIMO-ICA) with self-generator for initial filter (SIMO-ICA-SG) is now being studied by the authors. This method contains frequency-domain ICA (FDICA-PB), single-talk detection, direction of arrival (DOA) estimation, head related transfer function (HRTF) matrix bank, and SIMO-ICA. This paper describes robustness of SIMO-ICA-SG against the mismatch of HRTF matrix bank. To evaluate it, the sound decomposition experiments are carried out under the real acoustic conditions. The experimental results reveal that the decomposition performance of the proposed method with mismatched HRTF matrix bank is superior to those of the conventional methods, and almost the same as those of the proposed method with matched one
Available from: Kiyohiro Shikano
[Show abstract] [Hide abstract]
ABSTRACT: A new two-stage blind source separation (BSS) method for convolutive mixtures of speech is proposed, in which a single-input multiple-output (SIMO)-model-based independent component analysis (ICA) and a new SIMO-model-based binary masking are combined. SIMO-model-based ICA enables us to separate the mixed signals, not into monaural source signals but into SIMO-model-based signals from independent sources in their original form at the microphones. Thus, the separated signals of SIMO-model-based ICA can maintain the spatial qualities of each sound source. Owing to this attractive property, our novel SIMO-model-based binary masking can be applied to efficiently remove the residual interference components after SIMO-model-based ICA. The experimental results reveal that the separation performance can be considerably improved by the proposed method compared with that achieved by conventional BSS methods. In addition, the real-time implementation of the proposed BSS is illustrated.
[Show abstract] [Hide abstract]
ABSTRACT: Blind source separation for convolutive mixture of speech signals has been addressed in many literatures. However, widely applied Multichannel Blind Deconvolution (MBD) method suffers whitening effect or arbitrary filtering problem which results in dramatic decrease of Automatic Speech Recognition system's performance. In present paper, a new MBD based multistage method is proposed, in which contributions of each source to every microphone are final goal rather than original signals. In detail, MBD is first implemented using entropy maximization criterion combined with Natural Gradient (NG) algorithm, then compensation matrix is constructed, based on which sources are recovered to its contribution to every microphone, i.e., whitening effect or arbitrary filtering problem has been transformed to fixed filtering problem. After compensation processing, for a certain source, it becomes Single Input and Multi-Output (SIMO) problem. Thus, not only spatial quality of source can be preserved, but also SIMO blind deconvolution can be further applied to fully recover temporal structure of speech signal. Finally, experiment shows validity and superiority over other methods in both spectra preservation efficiency and speed.
Data provided are for informational purposes only. Although carefully collected, accuracy cannot be guaranteed. The impact factor represents a rough estimation of the journal's impact factor and does not reflect the actual current impact factor. Publisher conditions are provided by RoMEO. Differing provisions from the publisher's actual policy or licence agreement may be applicable.