Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (26): 138-140.DOI: 10.3778/j.issn.1002-8331.2008.26.042
• 数据库、信号与信息处理 • Previous Articles Next Articles
WANG Yong-sheng1,LI Mei2
Received:
Revised:
Online:
Published:
Contact:
王永生1,李 梅2
通讯作者:
Abstract: English homograph has two types,one is polyphonic because of different part of speech,another is polyphonic because of different senses.The disambiguation of the former is easy to be handled after part-of-speech tagging,while the disambiguation of the latter is more difficult.In this paper,a homograph disambiguation algorithm is proposed using WordNet.In this algorithm,the authors extract semantic words from taxonomy of homograph of its senses and context words,and then compare the two semantic sets.The pronunciation with the maximum score is selected.
摘要: 英语中的多音词分成两类,一是因词性不同而读音不同,一是因词义不同而读音不同。前者只需经词性标注,根据其词性标记就可判别其正确的读音。而后者则复杂得多,论文采用了一种基于WordNet语义信息的多音词消歧算法,该算法将多音词的语义信息与上下文中词的语义信息进行匹配,根据匹配结果来判别多音词的读音。
WANG Yong-sheng1,LI Mei2. Homograph disambiguation algorithm using WordNet in English speech synthesis[J]. Computer Engineering and Applications, 2008, 44(26): 138-140.
王永生1,李 梅2. 英语语音合成中基于WordNet的多音词消歧算法[J]. 计算机工程与应用, 2008, 44(26): 138-140.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2008.26.042
http://cea.ceaj.org/EN/Y2008/V44/I26/138