Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (20): 147-150.

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Real-time voice activity robust detection

WANG Jingfang   

  1. Department of Electric Engineering,Hunan International Economics University,Changsha 410205,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-07-11 Published:2011-07-11

实时语音端点鲁棒检测

王景芳   

  1. 湖南涉外经济学院 电气工程系,长沙 410205

Abstract: This paper proposes an effective real-time voice activity detection algorithm in various noisy environments in the acoustical signal filting.It makes the iterative methods of estimating the noise power spectrum.The speech spectrum is filted with iterative Wiener method.The filting spectrum is divided into several subbands and the spectral entropy of each subband is estimated.Median filters are applied to a sequence of the subband entropies to obtain the spectral entropy of each frame.The speech/noise classification is based on the spectral entropy.The experimental results show that the proposed algorithm can distinguish speech from noise effectively and improve the performance of automatic speech recognition system significantly.It is proved to be robust under various noisy environments.The algorithm is of low computational complexity which is suitable for real-time speech recognition system application.

Key words: voice activity detection, iterative Wiener filtering, subband spectrum entropy, adaptive processing, robustness

摘要: 提出了一种适应复杂环境下的高效的实时语音端点检测算法,给出了每帧声信号在滤波中的噪声功率谱的推算方法。先将每帧语音的频谱进行迭代维纳滤波,再将它划分成若干个子带并计算出每个子带的频谱熵,然后把相继若干帧的子带频谱熵经过一组中值滤波器获得每帧的频谱熵,根据频谱熵的值对输入的语音进行分类。实验结果表明,该算法能够有效地区分语音和噪声,可以显著地提高语音识别系统的性能,在不同的噪声环境条件下具有鲁棒性。该算法计算代价小,简单易实现,适合实时语音识别系统的应用。

关键词: 语音端点检测, 迭代维纳滤波, 子带频谱熵, 自适应处理, 鲁棒性