Computer Engineering and Applications ›› 2007, Vol. 43 ›› Issue (36): 217-219.

• 工程与应用 • Previous Articles     Next Articles

Speech endpoint detection and enhancements in car noisy environment

MA Long-hua1,ZANG Yi-hua2,LIU Li-qiang1   

  1. 1.College of Automation,Harbin Engineering University,Harbin 150001,China
    2.North China Computing Technology Institute,Beijing 100083,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-12-21 Published:2007-12-21
  • Contact: MA Long-hua

车内噪声环境下的语音端点检测和增强技术

马龙华1,臧义华2,刘利强1   

  1. 1.哈尔滨工程大学 自动化学院,哈尔滨 150001
    2.华北计算技术研究所 指挥自动化研究室,北京 100083
  • 通讯作者: 马龙华

Abstract: With the development of modern technology,car has more and more electrical devices.But it is dangerous to driver to operate these devices with hands leaving steering wheel.So speech recognition is solution to this problem,but general speech recognition system is sensitive to background noise,the reason is that the contaminated speech is difficult to find the endpoint accurately,So some technology must be developed.Speech enhancement is a popular technology.This paper describes a new band partitioning mel frequency spectral entropy endpoint technology,with these technology the start point and end point can be found accurately.After getting endpoint,spectral subtract can be used to enhance speech signal,then speech recognition can work well.

Key words: speech enchantment, endpoint detection, band partitioning mel frequency spectral entropy

摘要: 随着现代科技的发展,车载电子设备越来越多,但是如果在驾驶中司机的手离开方向盘去操作这些电子设备却是很危险的。对于这个问题的一个解决方法就是这些设备都采用语音识别作为它们的输入接口。通常的语音识别系统在噪声环境下的识别率是很低的,造成识别率下降的一个重要原因就是端点检测的不准确,因此必须发展一些技术来解决这个问题。提出了一种基于子带美尔谱熵的算法,这种算法可以准确地检测到语音的起始点和结束点,得到语音端点就可以利用谱减的方法来进行语音增强,经过增强后的语音信号就可以用普通的语音识别系统进行识别。

关键词: 语音增强, 端点检测, 子带美尔谱熵