计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (16): 19-21.

• 博士论坛 • 上一篇    下一篇

语调短语预测中的特征模板自动生成

刘方舟1,陶建华2   

  1. 1.湖南师范大学 数学与计算机科学学院,长沙 410081
    2.中国科学院 自动化研究所 模式识别国家重点实验室,北京 100190
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-06-01 发布日期:2011-06-01

Automatic feature template generation for intonational phrase prediction

LIU Fangzhou1,TAO Jianhua2   

  1. 1.College of Mathematics and Computer Science,Hunan Normal University,Changsha 410081,China
    2.National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Science,Beijing 100190,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-06-01 Published:2011-06-01

摘要: 在语音合成系统中,语调短语的自动预测是影响合成语音的自然度和可懂度的关键因素之一。采用了最大熵(Maximum Entropy,ME)模型从无限制的文本中预测语调短语,并且提出了一个自动生成特征模板的层次聚类算法,从而减少了最大熵模型训练过程中的人工参与。实验结果表明,对于语调短语预测而言,最大熵模型明显优于分类与回归树(Classification And Regression Trees,CART)。相比手工总结的特征模板,自动生成的特征模板不仅将语调短语预测的F-score提高了3.18%,而且将最大熵模型的大小缩小了78.38%。

关键词: 语调短语, 特征模板, 最大熵(ME), 分类与回归树(CART)

Abstract: In Text-To-Speech(TTS) systems,intonational phrase prediction is important for both the naturalness and intelligibility of synthetic speech.This paper presents a Maximum Entropy(ME) model to predict intonational phrases from unrestricted text.Furthermore,a hierarchical clustering algorithm is proposed for automatic generation of feature templates,which minimizes the need for human supervision during ME model training.Results of comparative experiments show that,for the task of intonational phrase prediction,ME model obviously outperforms Classification And Regression Tree(CART).Compared with manual templates,templates automatically generated by the proposed approach not only make an improvement of 3.18% on the F-score of ME based intonational phrase prediction,but also reduce the size of ME model by up to 78.38%.

Key words: intonational phrase, feature template, Maximum Entropy(ME), Classification And Regression Tree(CART)