改进的MELP低速率语音编码器

计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (29): 131-133.

• 数据库、信号与信息处理 • 上一篇下一篇

改进的MELP低速率语音编码器

冯晓荣，刘晓明，田雨

重庆大学通信工程学院，重庆 400030

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-10-11 发布日期:2011-10-11

Improved MELP speech coder for low bit rate speech coding

FENG Xiaorong，LIU Xiaoming，TIAN Yu

College of Communication Engineering，Chongqing University，Chongqing 400030，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-10-11 Published:2011-10-11

摘要/Abstract

摘要： 在混合激励线性预测（MELP）语音编码算法中，语音帧的突变转换导致合成语音质量的下降成为一个突出问题。为解决该问题，提出一种基于过渡帧判决算法的改进MELP模型，提高了参数估计的准确度，有效实现了语音的降噪处理。引入动态清浊音判决（U/V判决）门限将语音帧分为3种类型：浊音帧、清音帧、过渡帧，通过区分过渡帧和清浊音帧，避免了传统的U/V判决错误和清浊音帧的突变转换。给出了改进MELP编码参数比特分配表，通过PESQ—MOS测试表明，合成语音质量尤其是高频女声合成语音质量有了明显的改进。

关键词: 混合激励线性预测（MELP）, 清/浊音（U/V）判决, 语音编码

Abstract: In the Mixed Excitation Linear Prediction（MELP） algorithm，frames with excessive switchings result in quality degradation for synthesized speech，which is a serious problem.In order to solve this problem，a new approach is given to analyze and synthesize the transition segments which are called the transition frames.This can improve the accuracy of parameter estimation and reduce the noise caused by classifying the transition frame simply into U/V frame.A new frame type decision algorithm with dynamic U/V threshold related to the pitch lag is introduced which classifies the speech frame into three types：voiced，unvoiced，and transition.The classifier can reduce the U/V decision errors and avoid excessive switchings between voiced frame and unvoiced frame.An improved bit allocation table is introduced and the PESQ—MOS test shows that the synthesized speech quality has been greatly improved through the new MELP speech coder，especially for high frequency female speakers.

Key words: Mixed Excitation Linear Prediction（MELP）, Unvoiced/Voiced（U/V） decision, speech coder

冯晓荣，刘晓明，田雨. 改进的MELP低速率语音编码器[J]. 计算机工程与应用, 2011, 47(29): 131-133.

FENG Xiaorong，LIU Xiaoming，TIAN Yu. Improved MELP speech coder for low bit rate speech coding[J]. Computer Engineering and Applications, 2011, 47(29): 131-133.

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	69

来源	本网站	其他网站

次数	52	17
比例	75%	25%

摘要

最新录用	在线预览	正式出版

0	0	75

	来源	本网站

	次数	75
	比例	100%

[1]	熊燕. LSF参数转换分裂矢量量化的卡尔曼后滤波增强方法[J]. 计算机工程与应用, 2013, 49(10): 228-231.
[2]	唐骏1，2，袁江南1，2. AMR-WB固定码本快速搜索新方法[J]. 计算机工程与应用, 2012, 48(36): 14-18.
[3]	陈峰，吴玉成. LPC-10e到MELP语音编码转换[J]. 计算机工程与应用, 2011, 47(33): 159-161.
[4]	马苗苗，何勇军，韩纪庆. 说话人识别中用模型合成的编码畸变补偿研究[J]. 计算机工程与应用, 2011, 47(3): 135-138.
[5]	高国坪¹，陈水仙². 语音编码器中的分数基音估计算法[J]. 计算机工程与应用, 2010, 46(16): 163-165.
[6]	赵欢¹，范锦秀¹，张波涛². 一种新型快速的固定码本搜索方法[J]. 计算机工程与应用, 2010, 46(15): 135-137.
[7]	杜睿山¹,尚福华¹,李阳². 复合混沌映射在语音加密算法中的应用[J]. 计算机工程与应用, 2009, 45(7): 103-104.
[8]	纪友芳,刘桂斌. 一种改进的线性预测语音编码技术及实现[J]. 计算机工程与应用, 2009, 45(15): 163-165.
[9]	武淑红^1,2,张刚¹,张雪英¹. 改进的SOFM算法及其在低延迟语音编码中的应用[J]. 计算机工程与应用, 2009, 45(12): 124-125.
[10]	赵哲峰,张刚,谢克明,王一平. 低延迟低码率语音编码研究[J]. 计算机工程与应用, 2008, 44(34): 100-102.
[11]	赵群群,张雪英. 直接矢量量化方法在语音编码算法中的应用[J]. 计算机工程与应用, 2008, 44(15): 39-42.
[12]	王伟王伟达郭恒业. G.729A语音压缩算法分析及DSP实现[J]. 计算机工程与应用, 2007, 43(8期): 99-102.
[13]	李图平龚素文. 嵌入式SIMD处理器上G.729的优化方法研究[J]. 计算机工程与应用, 2007, 43(3期): 139-139.
[14]	刘思伟,吕海波,慕德俊. 基于G.729的自适应实时语音活动检测方法研究[J]. 计算机工程与应用, 2007, 43(34): 57-60.

改进的MELP低速率语音编码器

Improved MELP speech coder for low bit rate speech coding

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 14

编辑推荐 0

Metrics