Computer Engineering and Applications ›› 2014, Vol. 50 ›› Issue (7): 217-220.
Previous Articles Next Articles
HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao
Online:
Published:
胡政权,曾毓敏,宗 原,李梦超
Abstract: In the speaker recognition, Mel Frequency Cepstrum Coefficient(MFCC) is the most commonly used speech features. This paper presents an improved method of extraction to take the MFCC parameters, in the FFT of this step in the traditional process of extraction of MFCC spectrum reconstruction, noise compensation for reconstruction of the spectrum, with good noise immunity, approaching pure voice spectrum. The experiments show that the improvements based on this extracted MFCC, can significantly improve the recognition rate for speaker recognition system, especially in low SNR environment, the effect is obvious.
Key words: MFCC parameters, spectrum reconstruction, speaker recognition
摘要: 在说话人识别方面,最常用到的语音特征就是梅尔倒频谱系数(MFCC)。提出了一种改进的提取MFCC参数的方法,对传统的提取MFCC过程中计算FFT这一步骤进行频谱重构,对频谱进行噪声补偿重建,使之具有很好的抗噪性,逼近纯净语音的频谱。实验表明基于此改进提取的MFCC参数,可以明显提高说话人识别系统的识别率,尤其在低信噪比的环境下,效果明显。
关键词: MFCC参数, 频谱重建, 说话人识别
HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao. Improvement of MFCC parameters extraction in speaker recognition[J]. Computer Engineering and Applications, 2014, 50(7): 217-220.
胡政权,曾毓敏,宗 原,李梦超. 说话人识别中MFCC参数提取的改进[J]. 计算机工程与应用, 2014, 50(7): 217-220.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/
http://cea.ceaj.org/EN/Y2014/V50/I7/217