Computer Engineering and Applications ›› 2012, Vol. 48 ›› Issue (3): 146-150.

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Reaserch of semantic retrieval method for domain knowledge documents

QI Baoyuan1,2, CAO Cungen2, ZHENG Yufei2, YUE Jinpeng2,3   

  1. 1.Joint Institute of Computer Science, Capital Normal University, Beijing 100037, China
    2.Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China
    3.Graduate University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:1900-01-01 Revised:1900-01-01 Online:2012-01-21 Published:2012-01-21

领域知识文档的语义检索方法研究

齐保元1,2,曹存根2,郑宇飞2,岳金朋2,3   

  1. 1.首都师范大学 计算机科学联合研究院,北京 100037
    2.中国科学院 计算技术研究所 智能信息处理重点实验室,北京 100080
    3.中国科学院 研究生院,北京 100049

Abstract: Semantic retrieval is a very potential approach in improving the accuracy of information retrieval and satisfying the customized requirements. This paper annotates the documents with thesaurus and then builds a bi-level indexing structure which from thesaurus element to thesaurus and from thesaurus to document. It makes a conversion from users’ query to the thesaurus, then calculates the semantic similarity in the thesaurus network. After doing that, the documents will be sorted in order. The system has been deployed and applied in a company, making the top 5 results hit 90% of users’ queries. Experimental results show that the method is effective for semantic retrieval.

Key words: sementic retrieval, knowledge document, thesaurus, similarity

摘要: 语义检索是解决信息检索中准确度、人性化要求的一个非常有潜力的方法。通过对知识文档进行主题词标注,然后建立从词元→主题词→知识文档的二级索引结构;对用户的检索,进行查询词到主题词的转化,计算语义相似度,按照语义相似度算法进行排序文档。目前基于知识文档的语义检索系统已经在某集团公司进行部署和应用,取得了前5项结果命中用户总查询90%的效果,说明这种方法是语义检索的一种有效途径。

关键词: 语义检索, 知识文档, 主题词, 相似度