计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (19): 217-221.

• 信号处理 • 上一篇    下一篇

音素关联的多文种语音融合编码方法

孙广武,戴  永,喻世东,李  璇   

  1. 湘潭大学 智能计算与信息处理教育部重点实验室,湖南 湘潭 411105
  • 出版日期:2013-10-01 发布日期:2015-04-20

Phonemes associated multilingual speech fusion coding method

SUN Guangwu, DAI Yong, YU Shidong, LI Xuan   

  1. Key Laboratory of Intelligent Computing and Information Processing, Ministry of Education, Xiangtan University, Xiangtan, Hunan 411105, China
  • Online:2013-10-01 Published:2015-04-20

摘要: 依据异类文种之间、同类文种不同语音之间存在音素数据关联的特性,提出多文种语音数据融合编码方法。将不同文种存在的相同音素数据段块按段块模板截取语音样本序列,小波变换,提取特征矢量,生成共享模板集;任意字音或语句音串均按共享模板集提供的元素进行编码与解码;以模板音素串构成的语音记录库按(音节、音素)索引。实验结果表明,单字语音数据压缩比、语音数据存储量、语音还原分段信噪比、主观评价得分等参数均明显优于已有方法,语音还原质量良好。

关键词: 语音, 音素, 关联, 多文种, 融合编码

Abstract: Multilingual speech data fusion coding method is proposed according to the phoneme data correlation properties, which exist among heterogeneous languages and different words in the same language. Voice sample sequences of the same phoneme data segment in different languages are intercepted according to the segment templates, wavelet transform is done to those sequences, then feature vectors is extracted to generate shared template sets. Speech data of any word or sentence are coded or decoded according to the template sets. The speech record database made up of template phoneme sets is indexed according to(syllable, phoneme) structure. The single word compression ratio, speech data size, segmental signal-to-noise ratio(SNRS) and score of subjective evaluation(MOS) are significantly better than existing methods, also the voice restoration is of good quality.

Key words: speech, phoneme, correlation, multilingual, fusion coding