Noise-robust automatic speech recognition using mainlobe-resilient time-frequency quantile-based noise estimation
ABSTRACT In standard speech recognition systems in which training data are clean speech, the presence of background noise in received signal can severely deteriorate the recognition performance. This paper presents a simple noise-robust speech recognition system based on a modified noise spectral estimation method called mainlobe-resilient time-frequency quantile-based noise estimation (M-R T-F QBNE), which focuses on the mainlobes at harmonic frequencies. We estimate the global signal-to-noise ratio (SNR) and select a recognition model, which is best matched to the SNR operating range. Experimental results show that the recognition accuracy of the proposed recognition system is higher than that of the AURORA2 clean training baseline by 23%. Compared to multicondition training, the proposed method achieves comparable recognition accuracy.