计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (25): 143-145.DOI: 10.3778/j.issn.1002-8331.2008.25.043

• 数据库、信号与信息处理 • 上一篇    下一篇

基于框架语义标注的自由文本信息抽取研究

牛之贤,白鹏洲,段 富   

  1. 太原理工大学 计算机与软件学院,太原 030024
  • 收稿日期:2007-10-30 修回日期:2008-01-28 出版日期:2008-09-01 发布日期:2008-09-01
  • 通讯作者: 牛之贤

Free text information extraction based on frame semantic tagging

NIU Zhi-xian,BAI Peng-zhou,DUAN Fu   

  1. College of Computer and Software,Taiyuan University of Technology,Taiyuan 030024,China
  • Received:2007-10-30 Revised:2008-01-28 Online:2008-09-01 Published:2008-09-01
  • Contact: NIU Zhi-xian

摘要: 信息抽取是从自由文本语料库构建数据库,实现信息自动收集的有效途径之一。提出了一种以框架语义标注为基础构建信息抽取规则的信息抽取方法。基于框架语义标注的信息抽取是用统一的方法来指导信息抽取过程。这种方法具有较细的处理粒度,对语义规则性强的领域有一定的普遍适用性。设计了基于框架语义的BAIE(图书内容简介信息抽取)系统,并对图书的内容简介试行信息抽取。抽取结果表明,基于框架语义的信息抽取方式有一定的可行性和适用性。

关键词: 信息抽取, 框架语义, 抽取规则

Abstract: Information extraction is a main approach for constructing database from free text corpus and for automatic collecting information.Frame semantic tagging is suggested to be the base for rule-building in information extraction.Information extraction based on frame semantic tagging uses a uniform approach to guide the process of information extraction.Processing at a finer granularity level,the method has a universal appeal for information extraction in domains showing strong semantic rules.A system called BAIE(Book Abstract Information Extraction system),which is based on frame semantic,is designed and used to extract information from book abstract.The result shows that the approach is feasible and has practical promise.

Key words: information extraction, frame semantic, extraction rules