计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (16): 122-127.

• 数据库、信号与信息处理 • 上一篇    下一篇

普通话到西安话的韵律转换

郭威彤1,杨鸿武1,梁青青2,裴 东1   

  1. 1.西北师范大学 物理与电子工程学院,兰州 730070
    2.甘肃联合大学 电子与信息工程学院,兰州 730010
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-06-01 发布日期:2011-06-01

Prosody conversion from mandarin to Xi’an dialect

GUO Weitong1,YANG Hongwu1,LIANG Qingqing2,PEI Dong1   

  1. 1.College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China
    2.School of Electronics and Information Engineering,Gansu Lianhe University,Lanzhou 730010,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-06-01 Published:2011-06-01

摘要: 方言语音的转换是人机交互领域的一个重要研究课题。为实现普通话到西安话的转换,论文利用《方言调查字表》设计了一个包括文本语料和语音语料的西安方言语料库,录制了普通话和西安话平行的语音语料库。提出了基于归一化非线性多项式的方言韵律转换模型以及基于统计的方言时长转换模型和停顿时长转换模型。利用STRAIGHT算法修改普通话语音,实现普通话到西安话的转换。对转换结果的MOS评测表明,转换后的单字平均MOS得分4.60,双字平均MOS得分为4.75,语句的平均MOS得分为4.15。

关键词: 方言转换, 语料库, 韵律建模, 时长模型, 基频曲线

Abstract: The conversion of dialect speech is an important research topic in the field of human-computer speech communication.A Xi’an dialect corpus is built based on “word-list in dialectal survey” for Xi’an dialect conversion from mandarin.Speech corpus is recorded with contrastive (Xi’an dialect vs. mandarin) recordings.Prosodic models based on the normalized nonlinear polynomial method are built for Xi’an dialect by analyzing the differences of pitch,duration and pause duration between Xi’an dialect and mandarin.Xi’an dialect is converted from mandarin by STRAIGHT algorithm.Subjective experiments demonstrate that the converted monosyllable,disyllable and sentence achieved 4.60,4.71 and 4.15 of the average Mean Opinion Score(MOS).

Key words: dialect conversion, corpus, prosody modeling, duration model, pitch contour