计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (19): 142-145.

• 数据库、信号与信息处理 • 上一篇    下一篇

一种适用于专业搜索引擎的中文分词系统研究

王 硕,尤 枫,山 岚,赵恒永   

  1. 北京化工大学 信息科学与技术学院,北京 100029
  • 收稿日期:2008-02-01 修回日期:2008-04-18 出版日期:2008-07-01 发布日期:2008-07-01
  • 通讯作者: 王 硕

Research of Chinese word segmentation system applies in professional search engine

WANG Shuo,YOU Feng,SHAN Lan,ZHAO Heng-yong   

  1. College of Information Science and Technology,Beijing University of Chemical Technology,Beijing 100029,China
  • Received:2008-02-01 Revised:2008-04-18 Online:2008-07-01 Published:2008-07-01
  • Contact: WANG Shuo

摘要: 在对现有中文分词技术研究的基础上,提出了一种应用于化工专业领域的中文分词系统,先后介绍了首字哈希结合二分查找的词典机制,以及结合路径选择机制而改进了的层进式最短路径切词算法,并经过实验分析,在保证切分效率的同时,在一定程度上达到了消除歧义的效果。

Abstract: This article based on the research of current technology of Chinese word segmentation,proposes a Chinese word segmentation system to the chemical field,first introduces the dictionary mechanism combined first character hash indexing with binary search,then introduces an improved algorithm based on level-pattern shortest path method with the complementarity of the paths selection mechanism,at last,by analyzing the experiment’s result,this system shows a desired effectiveness as well as eliminating the ambiguity to some extent.