Automatic feature template generation for intonational phrase prediction

Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (16): 19-21.

• 博士论坛 • Previous Articles Next Articles

Automatic feature template generation for intonational phrase prediction

LIU Fangzhou1，TAO Jianhua2

1.College of Mathematics and Computer Science，Hunan Normal University，Changsha 410081，China
2.National Laboratory of Pattern Recognition，Institute of Automation，Chinese Academy of Science，Beijing 100190，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-06-01 Published:2011-06-01

语调短语预测中的特征模板自动生成

刘方舟1，陶建华2

1.湖南师范大学数学与计算机科学学院，长沙 410081
2.中国科学院自动化研究所模式识别国家重点实验室，北京 100190

Abstract

Abstract: In Text-To-Speech（TTS） systems，intonational phrase prediction is important for both the naturalness and intelligibility of synthetic speech.This paper presents a Maximum Entropy（ME） model to predict intonational phrases from unrestricted text.Furthermore，a hierarchical clustering algorithm is proposed for automatic generation of feature templates，which minimizes the need for human supervision during ME model training.Results of comparative experiments show that，for the task of intonational phrase prediction，ME model obviously outperforms Classification And Regression Tree（CART）.Compared with manual templates，templates automatically generated by the proposed approach not only make an improvement of 3.18% on the F-score of ME based intonational phrase prediction，but also reduce the size of ME model by up to 78.38%.

Key words: intonational phrase, feature template, Maximum Entropy（ME）, Classification And Regression Tree（CART）

摘要： 在语音合成系统中，语调短语的自动预测是影响合成语音的自然度和可懂度的关键因素之一。采用了最大熵（Maximum Entropy，ME）模型从无限制的文本中预测语调短语，并且提出了一个自动生成特征模板的层次聚类算法，从而减少了最大熵模型训练过程中的人工参与。实验结果表明，对于语调短语预测而言，最大熵模型明显优于分类与回归树（Classification And Regression Trees，CART）。相比手工总结的特征模板，自动生成的特征模板不仅将语调短语预测的F-score提高了3.18%，而且将最大熵模型的大小缩小了78.38%。

关键词: 语调短语, 特征模板, 最大熵（ME）, 分类与回归树（CART）

LIU Fangzhou1，TAO Jianhua2. Automatic feature template generation for intonational phrase prediction[J]. Computer Engineering and Applications, 2011, 47(16): 19-21.

刘方舟1，陶建华2. 语调短语预测中的特征模板自动生成[J]. 计算机工程与应用, 2011, 47(16): 19-21.

[1]	Guljamal Mamateli1, Askar rozi2, Askar Hamdulla3. Uyghur prosodic boundary prediction based on hierarchical feature template selection [J]. Computer Engineering and Applications, 2017, 53(8): 250-253.
[2]	KANG Caijun1, LONG Congjun2，3, JIANG Di1，2. Segmentation of Tibetan abbreviated forms based on word position [J]. Computer Engineering and Applications, 2014, 50(11): 218-222.
[3]	SHI Shuicai1，2, WANG Kai1, HAN Yanhua1，2, LV Xueqiang1，2. Terminology recognition based on conditional random fields [J]. Computer Engineering and Applications, 2013, 49(10): 147-149.
[4]	YU Jiangde¹，WANG Xijie¹，FAN Xiaozhong². Comparing of importance of above-context versus below-context for Chinese word segmentation [J]. Computer Engineering and Applications, 2011, 47(4): 117-120.
[5]	ZHANG Xin-ming，SUN Yin-jie，ZHANG Hui-yun. Interactive image segmentation based on combining maximum entropy and minimum cross entropy [J]. Computer Engineering and Applications, 2010, 46(30): 191-194.

Automatic feature template generation for intonational phrase prediction

语调短语预测中的特征模板自动生成

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 5

Recommended Articles

Metrics