计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (19): 134-136.DOI: 10.3778/j.issn.1002-8331.2009.19.041

• 数据库、信息处理 • 上一篇    下一篇

有效提取耳语音共振峰的改进方法

吕 岗,赵鹤鸣,刘建新,龚呈卉   

  1. 苏州大学 电子信息学院,江苏 苏州 215021
  • 收稿日期:2008-04-25 修回日期:2008-07-07 出版日期:2009-07-01 发布日期:2009-07-01
  • 通讯作者: 吕 岗

Improved method for effectively extracting whisper speech formant

LV Gang,ZHAO He-ming,LIU Jian-xin,GONG Cheng-hui   

  1. School of Electronics and Information Engineering,Soochow University,Suzhou,Jiangsu 215021,China
  • Received:2008-04-25 Revised:2008-07-07 Online:2009-07-01 Published:2009-07-01
  • Contact: LV Gang

摘要: 耳语音是噪声源激励,与正常音相比,其共振峰位置发生了偏移,带宽增宽。故采用传统的线性预测法提取耳语音共振峰时存在虚假峰问题。通过分析功率谱,提出了一种改进算法。根据极点功率不变的原则,利用极点交互因子修正共振峰的带宽,从而准确地提取出耳语音的共振峰。对汉语普通话单元音音素仿真实验的结果证明了该算法的有效性。

关键词: 耳语音, 共振峰, 线性预测编码, 极点交互

Abstract: Whisper is stirred by noise.Comparing with normal speech,the formant of whisper is shifted and the bandwidth is broadened,and that will bring up the problem of spurious peaks when using the tranditional conventional liner prediction coding for formant extraction.By analyzing power spectrum,an improved approach has been proposed.Based on the rule that the pole power is not change,the algorithm modifies the whisper formant bandwidths using pole interaction factor,and extracts formants exactly.Experimental results with mono-vowel phones in Mandarin speech prove the ability of this algorithm.

Key words: whispered speech, formant, liner prediction coding, pole interaction