计算机工程与应用 ›› 2014, Vol. 50 ›› Issue (16): 198-201.

• 信号处理 • 上一篇    下一篇

基于对数能量倒谱特征的端点检测算法

王  民1,孙  广1,沈利荣2,刘  利1   

  1. 1.西安建筑科技大学 信息与控制工程学院,西安 710055
    2.西安石油大学 光电油气测井与检测教育部重点实验室,西安 710065
  • 出版日期:2014-08-15 发布日期:2014-08-14

Voice activity detection using logarithmic energy and cepstrum Distance

WANG Min1, SUN Guang1, SHEN Lirong2, LIU Li1   

  1. 1.School of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an 710055, China
    2.Key Laboratory of Photoelectric Logging and Detecting of Oil and Gas, Ministry of Education, Xi’an Shiyou University, Xi’an 710065, China
  • Online:2014-08-15 Published:2014-08-14

摘要: 端点检测技术是语音识别的关键技术之一,为了克服传统倒谱距离语音端点检测算法在低信噪比下检测效果的不理想,将对数能量(LE)特征和倒谱(C)特征相结合,提出了一种新的对数能量倒谱特征(LEC),采用模糊C均值聚类和贝叶斯信息准则(BIC)方法估计特征门限,得出了正确的语音端点判断,在三种典型噪声下,对信噪比从-5 dB到15 dB的带噪声语音进行仿真,结果表明LEC法的检测错误率仅为20.25%,明显低于倒谱法和对数能量法,能有效地确定语音的端点并改善语音识别效果。

关键词: 对数能量, 倒谱距离, 模糊C均值聚类, 贝叶斯信息准则(BIC), 端点检测

Abstract: Endpoint detection is one of the key technologies of speech recognition, in order to overcome the undesirable detection results of traditional cepstrum distance in speech endpoint detection algorithm under low signal to noise ratio, combined logarithmic energy feature(LE) with cepstrum features(C) for endpoint detection, proposes a new logarithmic energy cepstrum features(LEC), uses fuzzy C-means clustering and Bayesian information criterion to estimate features threshold, achieves better endpoint judgment, conducts the SNR simulation from -5 dB to 15 dB with noisy speech under three kinds of typical noise. The results indicate that the LEC method’ detection error rates is just 20.25% and significantly lower than cepstrum and logarithmic energy method, it also can effectively determine the speech endpoint and improve voice recognition results.

Key words: logarithmic energy, cepstrum distance, Fuzzy C-means clustering, Bayesian Information Criterions(BIC), Endpoint detection