Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (17): 151-153.

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Harmonic sinusoidal speech modeling based on wavelet multi-resolution analysis

SUN Yan,YU Fengqin   

  1. School of Communication and Control Engineering,Jiangnan University,Wuxi,Jiangsu 214122,China

  • Received:1900-01-01 Revised:1900-01-01 Online:2011-06-11 Published:2011-06-11

小波多分辨率的谐波正弦语音建模

孙 艳,于凤芹   

  1. 江南大学 通信与控制工程学院,江苏 无锡 214122

Abstract: Harmonic sinusoidal speech model has the problem of fixed frame length,which can not get the best resolution of each harmonic,and the resolution determines the effect of speech modeling.A harmonic sinusoidal speech model based on wavelet multi-resolution analysis is proposed,decomposing an input harmonic speech signal into multi-resolution sub-band signals using the wavelet transform,and harmonic sinusoidal speech model is applied to each sub-band signal respectively.Finally,each sub-band signal modeled is synthesized.Simulation experiments show that the objective signal reconstruction error of the proposed model is reduced by about two orders of magnitude,and MOS’s grades have increased about 0.3 through PESQ software’s testing.

Key words: harmonic sinusoidal speech model, wavelet transform, multi-resolution, harmonic speech signal

摘要: 谐波正弦语音模型因固定帧长不能使每个谐波得到最佳分辨率,而分辨率决定着语音的建模效果。因此提出小波多分辨率的谐波正弦语音模型,将谐波语音信号通过小波变换分解成多分辨率子带信号,利用谐波正弦语音模型对这些子带信号独立建模,将建模后的各子带信号相加合成。仿真实验显示该模型的信号重构误差降低约两个数量级,通过PESQ软件测试得到的MOS分值约提高0.3。

关键词: 谐波正弦语音模型, 小波变换, 多分辨率, 谐波语音信号