计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (34): 147-151.DOI: 10.3778/j.issn.1002-8331.2009.34.045

• 数据库、信号与信息处理 • 上一篇    下一篇

基于场景信息融合的中文姓名识别方法研究

张腾飞,王晓磊,王保云   

  1. 南京邮电大学 自动化学院,南京 210046
  • 收稿日期:2009-09-17 修回日期:2009-11-16 出版日期:2009-12-01 发布日期:2009-12-01
  • 通讯作者: 张腾飞

Research of Chinese name identification method based on scene information fusion

ZHANG Teng-fei,WANG Xiao-lei,WANG Bao-yun   

  1. College of Automation,Nanjing University of Posts and Telecommunications,Nanjing 210046,China
  • Received:2009-09-17 Revised:2009-11-16 Online:2009-12-01 Published:2009-12-01
  • Contact: ZHANG Teng-fei

摘要: 为克服传统的先分词再识别方法的缺点,提出了一种基于场景信息融合的姓名识别方法。该方法结合中文姓名的特点,综合考虑上下文信息、词本身信息、词典信息和姓名自身信息等场景资源对中文名实体的影响,将它们作为姓名识别的依据,同时引入了证据理论,通过场景资源信息的融合,最终识别出人名。通过对互联网上随机抽取的大规模真实语料的开放测试表明,该方法可以取得较高的召回率并同时保证较高的准确率。

Abstract: To overcome the defects of traditional name identification algorithms with automatic segmentation at first,a name identification method based on scene information fusion is presented.Combining the characteristics of Chinese names,the scene information,such as the context,word,dictionary,names,is used as the basis of name identification.And then,the evidence theory is introduced,and the names are identified by scene information fusion.The open tests on real data sets randomly selected from the internet show that it is an effective method to improve the result of the identification with high recall rate and accuracy rate are guaranteed.

中图分类号: