计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (29): 131-133.

• 数据库、信号与信息处理 • 上一篇    下一篇

改进的MELP低速率语音编码器

冯晓荣,刘晓明,田 雨   

  1. 重庆大学 通信工程学院,重庆 400030
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-10-11 发布日期:2011-10-11

Improved MELP speech coder for low bit rate speech coding

FENG Xiaorong,LIU Xiaoming,TIAN Yu   

  1. College of Communication Engineering,Chongqing University,Chongqing 400030,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-10-11 Published:2011-10-11

摘要: 在混合激励线性预测(MELP)语音编码算法中,语音帧的突变转换导致合成语音质量的下降成为一个突出问题。为解决该问题,提出一种基于过渡帧判决算法的改进MELP模型,提高了参数估计的准确度,有效实现了语音的降噪处理。引入动态清浊音判决(U/V判决)门限将语音帧分为3种类型:浊音帧、清音帧、过渡帧,通过区分过渡帧和清浊音帧,避免了传统的U/V判决错误和清浊音帧的突变转换。给出了改进MELP编码参数比特分配表,通过PESQ—MOS测试表明,合成语音质量尤其是高频女声合成语音质量有了明显的改进。

关键词: 混合激励线性预测(MELP), 清/浊音(U/V)判决, 语音编码

Abstract: In the Mixed Excitation Linear Prediction(MELP) algorithm,frames with excessive switchings result in quality degradation for synthesized speech,which is a serious problem.In order to solve this problem,a new approach is given to analyze and synthesize the transition segments which are called the transition frames.This can improve the accuracy of parameter estimation and reduce the noise caused by classifying the transition frame simply into U/V frame.A new frame type decision algorithm with dynamic U/V threshold related to the pitch lag is introduced which classifies the speech frame into three types:voiced,unvoiced,and transition.The classifier can reduce the U/V decision errors and avoid excessive switchings between voiced frame and unvoiced frame.An improved bit allocation table is introduced and the PESQ—MOS test shows that the synthesized speech quality has been greatly improved through the new MELP speech coder,especially for high frequency female speakers.

Key words: Mixed Excitation Linear Prediction(MELP), Unvoiced/Voiced(U/V) decision, speech coder