Computer Engineering and Applications ›› 2014, Vol. 50 ›› Issue (15): 191-193.

Previous Articles     Next Articles

Using complex cepstrum peak filter for reverberation recognition by GMM

KONG Rong, WU Di, LIAO Qipeng, ZHU Junjie, ZHOU Qiang, TAO Zhi   

  1. School of Physical Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China
  • Online:2014-08-01 Published:2014-08-04

采用复倒谱峰值滤波GMM识别混响语音

孔  荣,吴  迪,廖启鹏,朱俊杰,周  强,陶  智   

  1. 苏州大学 物理科学与技术学院,江苏 苏州 215006

Abstract: The performance of speech recognition system will fall sharply in reverberant environment. In order to solve it, this paper proposes a method to use complex cepstrum peak filter to improve reverberation speech recognition rates by GMM. The GMM is constructed by MFCC parameters of pure speech, before identifying, this paper introduces the complex cepstrum peak filter to decrease speech distortions and improve the recognition rate in reverberant environment. The experimental results show that this method avoids estimating the room impulse response function in real conditions accurately, reduces the computational difficulty, and improves more than 4% of the system recognition rates in reverberant environment.

Key words: Gaussian Mixture Model(GMM), complex cepstrum, Mel Frequency Cepstrum Coefficient(MFCC)

摘要: 针对混响环境下语音识别系统性能急剧下降问题,提出一种采用复倒谱峰值滤波GMM识别混响语音的方法。通过训练纯净语音的MFCC特征参数构建高斯混合模型,在识别混响语音前引入复倒谱峰值滤波器以减少混响引起的语音失真而提高混响环境下语音识别率。经实验验证,该方法避免了在现实条件下准确估计房间冲击响应函数的麻烦,降低了计算难度,提高了混响环境下至少4%的系统识别率。

关键词: 高斯混合模型, 复倒谱, Mel频率倒谱系数(MFCC)