Computer Engineering and Applications ›› 2007, Vol. 43 ›› Issue (23): 184-186.

• 数据库与信息处理 • Previous Articles     Next Articles

Full-mapping dictionary implemented by single array

WEI Jin,CHANG Chao-wen   

  1. Institute of Electronic Technology,PLA Information Engineering University,Zhengzhou 450004,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-08-11 Published:2007-08-11
  • Contact: WEI Jin

单数组全映射分词词典

魏 进,常朝稳   

  1. 解放军信息工程大学 电子技术学院,郑州 450004
  • 通讯作者: 魏 进

Abstract: Provide and implement a new dictionary named Single-Array-Full-Mapping(SAFM) by studying and analyzing four typical dictionary at present:binary-seek-by-word,TRIE indexing tree,binary-seek-by-characters and double-character-hash-indexing.SAFM dictionary has a simple structure,high speed of segmentation and little memory requirement.

Key words: Chinese information processing, Chinese word segmentation, dictionary mechanism for Chinese word segmentation, single array full mapping

摘要: 通过研究和分析目前几种典型的分词词典机制:整词二分、TRIE索引树、逐字二分及双字哈希,提出并实现了新的单数组全映射(SAFM)分词词典。该词典具有构造简单,分词速度快,占用空间小的优点。

关键词: 中文信息处理, 汉语自动分词, 汉语自动分词词典机制, 单数组全映射