计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (20): 147-150.
• 数据库、信号与信息处理 • 上一篇 下一篇
王景芳
收稿日期:
修回日期:
出版日期:
发布日期:
WANG Jingfang
Received:
Revised:
Online:
Published:
摘要: 提出了一种适应复杂环境下的高效的实时语音端点检测算法,给出了每帧声信号在滤波中的噪声功率谱的推算方法。先将每帧语音的频谱进行迭代维纳滤波,再将它划分成若干个子带并计算出每个子带的频谱熵,然后把相继若干帧的子带频谱熵经过一组中值滤波器获得每帧的频谱熵,根据频谱熵的值对输入的语音进行分类。实验结果表明,该算法能够有效地区分语音和噪声,可以显著地提高语音识别系统的性能,在不同的噪声环境条件下具有鲁棒性。该算法计算代价小,简单易实现,适合实时语音识别系统的应用。
关键词: 语音端点检测, 迭代维纳滤波, 子带频谱熵, 自适应处理, 鲁棒性
Abstract: This paper proposes an effective real-time voice activity detection algorithm in various noisy environments in the acoustical signal filting.It makes the iterative methods of estimating the noise power spectrum.The speech spectrum is filted with iterative Wiener method.The filting spectrum is divided into several subbands and the spectral entropy of each subband is estimated.Median filters are applied to a sequence of the subband entropies to obtain the spectral entropy of each frame.The speech/noise classification is based on the spectral entropy.The experimental results show that the proposed algorithm can distinguish speech from noise effectively and improve the performance of automatic speech recognition system significantly.It is proved to be robust under various noisy environments.The algorithm is of low computational complexity which is suitable for real-time speech recognition system application.
Key words: voice activity detection, iterative Wiener filtering, subband spectrum entropy, adaptive processing, robustness
王景芳. 实时语音端点鲁棒检测[J]. 计算机工程与应用, 2011, 47(20): 147-150.
WANG Jingfang. Real-time voice activity robust detection[J]. Computer Engineering and Applications, 2011, 47(20): 147-150.
0 / 推荐
导出引用管理器 EndNote|Ris|BibTeX
链接本文: http://cea.ceaj.org/CN/
http://cea.ceaj.org/CN/Y2011/V47/I20/147