计算机工程与应用 ›› 2015, Vol. 51 ›› Issue (17): 228-232.

• 信息与信号处理 • 上一篇    下一篇

基于CFCC和相位信息的鲁棒性说话人辨识

李作强,高  勇   

  1. 四川大学 电子信息学院,成都 610065
  • 出版日期:2015-09-01 发布日期:2015-09-14

Robust speaker identification based on CFCC and phase information

LI Zuoqiang, GAO Yong   

  1. College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
  • Online:2015-09-01 Published:2015-09-14

摘要: 传统的说话人识别中,人们往往认为人耳对相位信息不敏感而忽略了相位信息对语音识别的影响。为了验证相位信息对说话人识别的影响,提出了一种提取相位特征参数的方法。分别在纯净语音和带噪语音条件下,基于高斯混合模型,通过将相位特征参数与耳蜗倒谱系数(CFCC)相结合,研究了相位信息对说话人辨识性能的影响。实验结果标明:相位信息在说话人识别中也有着重要的作用,将其应用于说话人辨识系统,可明显提高系统的识别率和鲁棒性。

关键词: 说话人辨识, 相位特征参数, 耳蜗倒谱系数(CFCC), 高斯混合模型(GMM)

Abstract: In the traditional speaker recognition, people tend to think that the human ear is not sensitive to phase information and ignore the influence of phase information on speech recognition. In order to verify the influence of phase information on speaker recognition, an algorithm to extract the phase cepstrum coefficient is proposed in this paper. Respectively, under the condition of pure and noise-corrupted speech, based on Gaussian Mixes Model (GMM), combining the Cochlear Filter Cepstrum Coefficient (CFCC) with the phase feature parameters, the influence of phase on speaker recognition performance is studied. The experimental results indicate that phase information also plays an important role in speaker recognition and when it is applied to speaker recognition system, the recognition rate and the robustness of system has obvious improvement.

Key words: speaker identification, phase feature parameters, Cochlear Filter Cepstrum Coefficient(CFCC), Gaussian Mixes Model(GMM)