计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (26): 138-140.DOI: 10.3778/j.issn.1002-8331.2008.26.042

• 数据库、信号与信息处理 • 上一篇    下一篇

英语语音合成中基于WordNet的多音词消歧算法

王永生1,李 梅2   

  1. 1.上海同济大学 留德预备部,上海 200092
    2.上海同济大学 外国语学院,上海 200092
  • 收稿日期:2007-10-31 修回日期:2008-01-18 出版日期:2008-09-11 发布日期:2008-09-11
  • 通讯作者: 王永生

Homograph disambiguation algorithm using WordNet in English speech synthesis

WANG Yong-sheng1,LI Mei2   

  1. 1.The German College,Tongji University,Shanghai 200092,China
    2.School of Foreign Languages,Tongji University,Shanghai 200092,China
  • Received:2007-10-31 Revised:2008-01-18 Online:2008-09-11 Published:2008-09-11
  • Contact: WANG Yong-sheng

摘要: 英语中的多音词分成两类,一是因词性不同而读音不同,一是因词义不同而读音不同。前者只需经词性标注,根据其词性标记就可判别其正确的读音。而后者则复杂得多,论文采用了一种基于WordNet语义信息的多音词消歧算法,该算法将多音词的语义信息与上下文中词的语义信息进行匹配,根据匹配结果来判别多音词的读音。

Abstract: English homograph has two types,one is polyphonic because of different part of speech,another is polyphonic because of different senses.The disambiguation of the former is easy to be handled after part-of-speech tagging,while the disambiguation of the latter is more difficult.In this paper,a homograph disambiguation algorithm is proposed using WordNet.In this algorithm,the authors extract semantic words from taxonomy of homograph of its senses and context words,and then compare the two semantic sets.The pronunciation with the maximum score is selected.