计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (9): 187-189.

• 图形图像处理 • 上一篇    下一篇

小波包变换与Teager能量算子结合的说话人识别

祝  鹏,王成儒   

  1. 燕山大学 信息科学与工程学院,河北 秦皇岛 066004
  • 出版日期:2013-05-01 发布日期:2016-03-28

Speaker recognition combining wavelet packet transform with Teager Energy Operator

ZHU Peng, WANG Chengru   

  1. College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004, China
  • Online:2013-05-01 Published:2016-03-28

摘要: 在说话人识别系统中,语音特征参数的提取是影响系统性能的关键因素之一。在研究了MFCC参数的基础上,结合MFCC参数在信号的低频部分具有高频率分辨率以及小波包变换可以对信号的高频部分进行分解以提高高频部分的频率分辨率的优点,将二者结合,将Teager能量算子引入到信号高频部分的能量参数求解,构造了一种新的混合特征参数,采用支持向量机实现说话人的分类识别。实验结果表明,该特征参数有效提高了说话人辨识系统的识别率。

关键词: 说话人识别, 梅尔频率倒谱系数, 小波包变换, Teager能量算子

Abstract: In speaker recognition system, the key factor is extracting a personality feature of the speaker. Based on analysis of MFCC parameter extraction, this paper constructs a hybrid parameter through combination of the low part of MFCC parameter and using the wavelet packet transform processing the high part of the signal, of which the Teager Energy Operator(TEO) is used. A Support Vector Machine(SVM) is used to do the classification work. Experimental data shows that the method is effective in raising the recognition rate.

Key words: speaker recognition, Mel Frequency Cepstrum Coefficient(MFCC), wavelet packet transform, Teager Energy Operator(TEO)