计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (29): 114-118.

• 数据库、信号与信息处理 • 上一篇    下一篇

多类噪声环境下的语音端点检测

汤  霖,姜世芬   

  1. 江门职业技术学院 电子与信息技术系,广东 江门 529090
  • 出版日期:2012-10-11 发布日期:2012-10-22

Endpoint detection in multi types noise condition

TANG Lin, JIANG Shifen   

  1. Department of Electronic and Information Technology, Jiangmen Polytechnic, Jiangmen, Guangdong 529090, China
  • Online:2012-10-11 Published:2012-10-22

摘要: 在既有平稳噪音又有突发噪声的环境下进行语音端点检测是一项挑战。在选择抗噪特征的基础上,提出了自适应判定阈值和用多层感知器进行语噪鉴别的语音端点检测办法。实验结果表明,选择的语音参数比传统的帧能量和过零率在信噪比为0 dB时,正确的语音端点检出率高出27%,而多层感知器在正常环境下,检出94.47%的开关门声、咳嗽声、翻书声和呼吸声等孤立突发噪声。

关键词: 语音端点检测, 语音处理, 抗噪特征, 多层感知器

Abstract: It is a challenge to detect voice endpoints in the condition that includes stationary noise and instantaneous noise. This paper presents a method that uses self-adaptation detection thresholds and multi layer perceptron to recognize noise and voice based on the selected anti noise features. Experimental results show that the correct voice endpoints detection rate is 27% higher by using the selected features than using conventional frame energy and cross zero rate in 0 dB SNR, and the use of multi layer perceptron achieves 94.47% isolated instantaneous noise recognition rate in normal condition, those types of noise include the sound of opening and closing door, cough sound, sound of turning pages and sound of breath, etc.

Key words: voice endpoint detection, speech processing, anti noise feature, multi layer perceptron