计算机工程与应用 ›› 2014, Vol. 50 ›› Issue (7): 217-220.

• 信号处理 • 上一篇    下一篇

说话人识别中MFCC参数提取的改进

胡政权,曾毓敏,宗  原,李梦超   

  1. 南京师范大学 物理科学与技术学院,南京 210046
  • 出版日期:2014-04-01 发布日期:2014-04-25

Improvement of MFCC parameters extraction in speaker recognition

HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao   

  1. College of Physical Science and Technology, Nanjing Normal University, Nanjing 210046, China
  • Online:2014-04-01 Published:2014-04-25

摘要: 在说话人识别方面,最常用到的语音特征就是梅尔倒频谱系数(MFCC)。提出了一种改进的提取MFCC参数的方法,对传统的提取MFCC过程中计算FFT这一步骤进行频谱重构,对频谱进行噪声补偿重建,使之具有很好的抗噪性,逼近纯净语音的频谱。实验表明基于此改进提取的MFCC参数,可以明显提高说话人识别系统的识别率,尤其在低信噪比的环境下,效果明显。

关键词: MFCC参数, 频谱重建, 说话人识别

Abstract: In the speaker recognition, Mel Frequency Cepstrum Coefficient(MFCC) is the most commonly used speech features. This paper presents an improved method of extraction to take the MFCC parameters, in the FFT of this step in the traditional process of extraction of MFCC spectrum reconstruction, noise compensation for reconstruction of the spectrum, with good noise immunity, approaching pure voice spectrum. The experiments show that the improvements based on this extracted MFCC, can significantly improve the recognition rate for speaker recognition system, especially in low SNR environment, the effect is obvious.

Key words: MFCC parameters, spectrum reconstruction, speaker recognition