计算机工程与应用 ›› 2007, Vol. 43 ›› Issue (2): 180-180.

• 数据库与信息处理 • 上一篇    下一篇

多文档文摘评价标准的研究

魏继增,孙济洲,秦兵   

  1. 天津大学电子信息工程学院计算机系IBM中心
  • 收稿日期:2006-01-24 修回日期:1900-01-01 出版日期:2007-01-11 发布日期:2007-01-11
  • 通讯作者: 魏继增 weijizeng

The Research of the Standard of the Evaluation of Multi-Document Summarization

JiZeng Wei,,   

  1. 天津大学电子信息工程学院计算机系IBM中心
  • Received:2006-01-24 Revised:1900-01-01 Online:2007-01-11 Published:2007-01-11
  • Contact: JiZeng Wei

摘要: 多文档自动文摘是自然语言处理领域的一个重要研究方向。但对于多文档文摘的评价方法仍然存在方法单一,缺乏统一标准的问题。本文针对这些问题,就多文档文摘信息覆盖度尝试性地提出一套标准。该标准将涉及以下几个重要参数:改进BLEU参数(改进召回率),与原文档有效词覆盖度,高频词覆盖度。实验证明利用该标准能准确反映出文摘系统在信息覆盖度方面的优劣,并且接近人工评价结果。

关键词: BLEU, 高频词覆盖度, 有效词覆盖度, 召回率

Abstract: Multi-Document Automatic Summarization is an important branch of Natural Language Understanding. But the methods of evaluation of the Multi-Document Automatic Summarization also have many problems which are single and lack of uniform standard. The investigative point in this text is to attempt to give a standard aiming at the covered rate of information of Multi-Document Automatic Summarization. This standard will use a few of parameters: improved BLEU parameter (recall), covered rate of effective phrase with original documents, high frequency phrase covered rate. The experiments have indicated this standard can reflect the covered rate of information of summarization system good or bad, and whether it is near to artificial evaluation results.

Key words: BLEU, high frequency phrase covered rate, covered rate of effective phrase, recall