Improvement of MFCC parameters extraction in speaker recognition

Abstract

Abstract: In the speaker recognition, Mel Frequency Cepstrum Coefficient（MFCC） is the most commonly used speech features. This paper presents an improved method of extraction to take the MFCC parameters, in the FFT of this step in the traditional process of extraction of MFCC spectrum reconstruction, noise compensation for reconstruction of the spectrum, with good noise immunity, approaching pure voice spectrum. The experiments show that the improvements based on this extracted MFCC, can significantly improve the recognition rate for speaker recognition system, especially in low SNR environment, the effect is obvious.

Key words: MFCC parameters, spectrum reconstruction, speaker recognition

摘要： 在说话人识别方面，最常用到的语音特征就是梅尔倒频谱系数（MFCC）。提出了一种改进的提取MFCC参数的方法，对传统的提取MFCC过程中计算FFT这一步骤进行频谱重构，对频谱进行噪声补偿重建，使之具有很好的抗噪性，逼近纯净语音的频谱。实验表明基于此改进提取的MFCC参数，可以明显提高说话人识别系统的识别率，尤其在低信噪比的环境下，效果明显。

关键词: MFCC参数, 频谱重建, 说话人识别

HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao. Improvement of MFCC parameters extraction in speaker recognition[J]. Computer Engineering and Applications, 2014, 50(7): 217-220.

胡政权，曾毓敏，宗原，李梦超. 说话人识别中MFCC参数提取的改进[J]. 计算机工程与应用, 2014, 50(7): 217-220.

[1]	ZENG Chunyan, MA Chaofeng, WANG Zhifeng, ZHU Dongliang, ZHAO Nan, WANG Juan, LIU Cong. Survey of Speaker Recognition in Deep Learning Framework [J]. Computer Engineering and Applications, 2020, 56(7): 8-16.
[2]	WANG Xin, ZHANG Hongran. Robust i-vector speaker recognition method based on DNN processing [J]. Computer Engineering and Applications, 2018, 54(22): 167-172.
[3]	XU Limin1, WEI Xiang2. Analysis and design of speaker authentication system based on Android platform of parallel computation [J]. Computer Engineering and Applications, 2017, 53(3): 231-236.
[4]	ZHANG Xiaoheng1，2, XIE Wenbin2, LI Yongming2. Multiple voice features types evolutionary selection algorithm [J]. Computer Engineering and Applications, 2016, 52(14): 150-155.
[5]	LUO Jian, YANG Yingen, LEI Zhenchun. Weighted pairwise constraint metric learning in speaker recognition [J]. Computer Engineering and Applications, 2016, 52(11): 158-163.
[6]	DU Xiaoqing, YU Fengqin. Speaker recognition algorithm based on HHT cepstrum coefficient [J]. Computer Engineering and Applications, 2014, 50(3): 198-202.
[7]	XIONG Huaqiao, ZHENG Jianbin, ZHAN Enqi, WANG Yang, HUA Jian. Speaker recognition based on speaker model clustering [J]. Computer Engineering and Applications, 2014, 50(2): 133-136.
[8]	LIANG Hui, ZENG Shuiping. Application of wavelet multiresolution theory to extract personality characteristics [J]. Computer Engineering and Applications, 2013, 49(9): 120-122.
[9]	ZHU Peng, WANG Chengru. Speaker recognition combining wavelet packet transform with Teager Energy Operator [J]. Computer Engineering and Applications, 2013, 49(9): 187-189.
[10]	SUN Quanling, WANG Lixin. Fast kernel clustering back propagation algorithm [J]. Computer Engineering and Applications, 2013, 49(10): 118-120.
[11]	LIU Hong1, LIU Liqun2. Research on speaker recognition with improved MFCC [J]. Computer Engineering and Applications, 2012, 48(8): 155-157.
[12]	PAN Ping, HE Zhaoxia. Method of speaker feature parameter extraction based on duffing stochastic resonance [J]. Computer Engineering and Applications, 2012, 48(35): 123-125.
[13]	MA Zhen1, ZHANG Xiongwei2, YANG Jibin2. Speaker recognition method based on K-SVD [J]. Computer Engineering and Applications, 2012, 48(34): 112-115.
[14]	LEI Zhenchun. Probabilistic sequence kernel for speaker recognition [J]. Computer Engineering and Applications, 2011, 47(6): 151-155.
[15]	WU Liangchun，PAN Shiyong，HE Jinrui，ZHANG Donghai. Improved algorithm of feature extraction based on wavelet packet for voice [J]. Computer Engineering and Applications, 2011, 47(5): 210-212.

Improvement of MFCC parameters extraction in speaker recognition

说话人识别中MFCC参数提取的改进

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics