有效提取耳语音共振峰的改进方法

doi:10.3778/j.issn.1002-8331.2009.19.041

计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (19): 134-136.DOI: 10.3778/j.issn.1002-8331.2009.19.041

有效提取耳语音共振峰的改进方法

吕岗,赵鹤鸣,刘建新,龚呈卉

苏州大学电子信息学院，江苏苏州 215021

收稿日期:2008-04-25 修回日期:2008-07-07 出版日期:2009-07-01 发布日期:2009-07-01
通讯作者: 吕岗

Improved method for effectively extracting whisper speech formant

LV Gang,ZHAO He-ming,LIU Jian-xin,GONG Cheng-hui

School of Electronics and Information Engineering，Soochow University，Suzhou，Jiangsu 215021，China

Received:2008-04-25 Revised:2008-07-07 Online:2009-07-01 Published:2009-07-01
Contact: LV Gang

摘要/Abstract

摘要： 耳语音是噪声源激励，与正常音相比，其共振峰位置发生了偏移，带宽增宽。故采用传统的线性预测法提取耳语音共振峰时存在虚假峰问题。通过分析功率谱，提出了一种改进算法。根据极点功率不变的原则，利用极点交互因子修正共振峰的带宽，从而准确地提取出耳语音的共振峰。对汉语普通话单元音音素仿真实验的结果证明了该算法的有效性。

关键词: 耳语音, 共振峰, 线性预测编码, 极点交互

Abstract: Whisper is stirred by noise.Comparing with normal speech，the formant of whisper is shifted and the bandwidth is broadened，and that will bring up the problem of spurious peaks when using the tranditional conventional liner prediction coding for formant extraction.By analyzing power spectrum，an improved approach has been proposed.Based on the rule that the pole power is not change，the algorithm modifies the whisper formant bandwidths using pole interaction factor，and extracts formants exactly.Experimental results with mono-vowel phones in Mandarin speech prove the ability of this algorithm.

Key words: whispered speech, formant, liner prediction coding, pole interaction

吕岗,赵鹤鸣,刘建新,龚呈卉. 有效提取耳语音共振峰的改进方法[J]. 计算机工程与应用, 2009, 45(19): 134-136.

LV Gang,ZHAO He-ming,LIU Jian-xin,GONG Cheng-hui. Improved method for effectively extracting whisper speech formant[J]. Computer Engineering and Applications, 2009, 45(19): 134-136.

[1]	赵涛涛，杨鸿武. 结合EMD和加权Mel倒谱的语音共振峰提取算法[J]. 计算机工程与应用, 2015, 51(9): 207-212.
[2]	吴亮春，潘世永，何金瑞，张东海. 改进的基于小波包变换的语音特征提取算法[J]. 计算机工程与应用, 2011, 47(5): 210-212.
[3]	王艳1，冯宏伟1，张利平1，忽满利2. 基于元音检测的汉语连续语音声韵母分割[J]. 计算机工程与应用, 2011, 47(14): 134-136.
[4]	谈雪丹¹，顾济华¹，赵鹤鸣²，陶智¹，韩韬¹，吴俊¹. 基于HHT瞬时能频值的耳语音端点检测[J]. 计算机工程与应用, 2010, 46(29): 147-150.
[5]	张利平，冯宏伟，王艳. 基于元音检测的汉语连续语音端点检测方法[J]. 计算机工程与应用, 2010, 46(27): 114-116.
[6]	陈宁，万茂文. 语音信号共振峰频率估计的分段线性预测算法[J]. 计算机工程与应用, 2009, 45(28): 156-159.
[7]	朱颖,钱盛友. 一种改进的倒谱基音提取算法[J]. 计算机工程与应用, 2009, 45(15): 158-159.
[8]	纪友芳,刘桂斌. 一种改进的线性预测语音编码技术及实现[J]. 计算机工程与应用, 2009, 45(15): 163-165.
[9]	徐媛媛,袁晓,杨莎. 复子波提取语音信号特征信息[J]. 计算机工程与应用, 2008, 44(36): 224-226.
[10]	荣薇¹,陶智¹,顾济华¹,赵鹤鸣². 基于概率神经网络的汉语耳语音识别系统[J]. 计算机工程与应用, 2008, 44(17): 148-150.
[11]	荣薇¹,陶智¹,顾济华¹,赵鹤鸣². 基于改进LPCC和MFCC的汉语耳语音识别[J]. 计算机工程与应用, 2007, 43(30): 213-216.
[12]	施晓敏¹,顾济华¹,陶智¹,赵鹤鸣²,张晓俊¹. 基于听觉感知的电子耳蜗共振峰提取方案[J]. 计算机工程与应用, 2007, 43(29): 232-234.
[13]	孙静¹,陶智¹,顾济华¹,赵鹤鸣². 基于AD神经网络的耳语音增强的研究[J]. 计算机工程与应用, 2007, 43(29): 242-244.

有效提取耳语音共振峰的改进方法

Improved method for effectively extracting whisper speech formant

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 13

编辑推荐

Metrics