Computer Engineering and Applications ›› 2016, Vol. 52 ›› Issue (20): 149-153.
Previous Articles Next Articles
XUE Juntao, WENG Yuru, ZHANG Jun
Online:
Published:
薛俊韬,翁玉茹,张 军
Abstract: In view of the problem that speech endpoint detection based on Empirical Mode Decomposition(EMD) loses its accuracy and adaptive in adverse environments, this paper proposes a novel speech endpoint detection algorithm based on EMD and cross-entropy. EMD decomposition characteristic is analyzed that probability distribution of white noise in each Intrinsic Mode Functions(IMF) is identified and unrelated to noise amplitude. Since probability distribution of white noise is different from that of speech signal, cross-entropy is used to reflect the difference of speech-frames and noise-frames. EMD-energy feature and cross-entropy are complementary so that they are combined to be a comprehensive determination for speech endpoint detection. Adaptive threshold is set to adapt to negative environments. It catches the changes of noise energy and then it is self-updated to improve accuracy in speech endpoint detection. Simulation results indicate that it is effective and superior in the presence of low Signal-to-Noise Ratio(SNR) and non-stationary noise.
Key words: endpoint detection, Empirical Mode Decomposition(EMD), cross entropy, adaptive threshold, low Signal-to-Noise Ratio(SNR)
摘要: In view of the problem that speech endpoint detection based on Empirical Mode Decomposition(EMD) loses its accuracy and adaptive in adverse environments, this paper proposes a novel speech endpoint detection algorithm based on EMD and cross-entropy. EMD decomposition characteristic is analyzed that probability distribution of white noise in each Intrinsic Mode Functions(IMF) is identified and unrelated to noise amplitude. Since probability distribution of white noise is different from that of speech signal, cross-entropy is used to reflect the difference of speech-frames and noise-frames. EMD-energy feature and cross-entropy are complementary so that they are combined to be a comprehensive determination for speech endpoint detection. Adaptive threshold is set to adapt to negative environments. It catches the changes of noise energy and then it is self-updated to improve accuracy in speech endpoint detection. Simulation results indicate that it is effective and superior in the presence of low Signal-to-Noise Ratio(SNR) and non-stationary noise.
关键词: endpoint detection, Empirical Mode Decomposition(EMD), cross entropy, adaptive threshold, low Signal-to-Noise Ratio(SNR)
XUE Juntao, WENG Yuru, ZHANG Jun. Speech endpoint detection based on EMD and cross-entropy[J]. Computer Engineering and Applications, 2016, 52(20): 149-153.
薛俊韬,翁玉茹,张 军. 基于EMD和交叉熵的语音端点检测算法[J]. 计算机工程与应用, 2016, 52(20): 149-153.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/
http://cea.ceaj.org/EN/Y2016/V52/I20/149