计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (19): 148-150.DOI: 10.3778/j.issn.1002-8331.2010.19.043

• 数据库、信号与信息处理 • 上一篇    下一篇

一种改进的词义排歧算法

郭志兵,黄广君,卢朝华   

  1. 河南科技大学电子信息工程学院,河南洛阳471003
  • 收稿日期:2008-12-30 修回日期:2009-03-06 出版日期:2010-07-01 发布日期:2010-07-01
  • 通讯作者: 郭志兵

Modified word sense disambiguation algorithm

GUO Zhi-bing,HUANG Guang-jun,LU Chao-hua   

  1. Electronic Information Engineering College,Henan University of Science & Technology,Luoyang,Henan 471003,China
  • Received:2008-12-30 Revised:2009-03-06 Online:2010-07-01 Published:2010-07-01
  • Contact: GUO Zhi-bing

摘要:

针对传统基于义原同现频率的汉语词义排歧算法的“盲目性”,提出一种“双距离”词义排歧算法,即在计算待排歧词各义项与特征词之间的相关系数时,考虑两个距离因素:特征词与待排歧词之间的空间距离;最近选择该义项的同形歧词与该待排歧词之间的空间距离。实验表明,改进的算法是有效的。

Abstract: There is the fault of blindness in traditionary Chinese word sense disambiguation algorithm based on primitive
co-occurrence data.This thesis puts forward a“Double-Distance”word sense disambiguation algorithm,which considering two
parameters of distance when calculates these relation-modulus between the maltivocal word and the character-words,the
space distance between the character-words and the maltivocal word,the space distance between the currently maltivocal
word and the same maltivocal word which has been selected sense at the latest.The experiment shows that the modified algorithm
are effective.

中图分类号: