语调短语预测中的特征模板自动生成

计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (16): 19-21.

语调短语预测中的特征模板自动生成

刘方舟1，陶建华2

1.湖南师范大学数学与计算机科学学院，长沙 410081
2.中国科学院自动化研究所模式识别国家重点实验室，北京 100190

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-06-01 发布日期:2011-06-01

Automatic feature template generation for intonational phrase prediction

LIU Fangzhou1，TAO Jianhua2

1.College of Mathematics and Computer Science，Hunan Normal University，Changsha 410081，China
2.National Laboratory of Pattern Recognition，Institute of Automation，Chinese Academy of Science，Beijing 100190，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-06-01 Published:2011-06-01

摘要/Abstract

摘要： 在语音合成系统中，语调短语的自动预测是影响合成语音的自然度和可懂度的关键因素之一。采用了最大熵（Maximum Entropy，ME）模型从无限制的文本中预测语调短语，并且提出了一个自动生成特征模板的层次聚类算法，从而减少了最大熵模型训练过程中的人工参与。实验结果表明，对于语调短语预测而言，最大熵模型明显优于分类与回归树（Classification And Regression Trees，CART）。相比手工总结的特征模板，自动生成的特征模板不仅将语调短语预测的F-score提高了3.18%，而且将最大熵模型的大小缩小了78.38%。

关键词: 语调短语, 特征模板, 最大熵（ME）, 分类与回归树（CART）

Abstract: In Text-To-Speech（TTS） systems，intonational phrase prediction is important for both the naturalness and intelligibility of synthetic speech.This paper presents a Maximum Entropy（ME） model to predict intonational phrases from unrestricted text.Furthermore，a hierarchical clustering algorithm is proposed for automatic generation of feature templates，which minimizes the need for human supervision during ME model training.Results of comparative experiments show that，for the task of intonational phrase prediction，ME model obviously outperforms Classification And Regression Tree（CART）.Compared with manual templates，templates automatically generated by the proposed approach not only make an improvement of 3.18% on the F-score of ME based intonational phrase prediction，but also reduce the size of ME model by up to 78.38%.

Key words: intonational phrase, feature template, Maximum Entropy（ME）, Classification And Regression Tree（CART）

刘方舟1，陶建华2. 语调短语预测中的特征模板自动生成[J]. 计算机工程与应用, 2011, 47(16): 19-21.

LIU Fangzhou1，TAO Jianhua2. Automatic feature template generation for intonational phrase prediction[J]. Computer Engineering and Applications, 2011, 47(16): 19-21.

[1]	姑丽加玛丽·麦麦提艾力1，艾斯卡尔·肉孜2，艾斯卡尔·艾木都拉3. 分层特征模板筛选的维吾尔语韵律边界预测[J]. 计算机工程与应用, 2017, 53(8): 250-253.
[2]	康才畯1，龙从军2，3，江荻1，2. 基于词位的藏文黏写形式的切分[J]. 计算机工程与应用, 2014, 50(11): 218-222.
[3]	施水才1，2，王锴1，韩艳铧1，2，吕学强1，2. 基于条件随机场的领域术语识别研究[J]. 计算机工程与应用, 2013, 49(10): 147-149.
[4]	于江德¹，王希杰¹，樊孝忠². 汉语分词中上文和下文重要性比较[J]. 计算机工程与应用, 2011, 47(4): 117-120.
[5]	陈红艳¹，马上²，王海江³. 新的噪声污染灰度图像边缘检测方法[J]. 计算机工程与应用, 2010, 46(11): 183-185.