多文档文摘评价标准的研究

计算机工程与应用 ›› 2007, Vol. 43 ›› Issue (2): 180-180.

多文档文摘评价标准的研究

魏继增,孙济洲,秦兵

天津大学电子信息工程学院计算机系IBM中心

收稿日期:2006-01-24 修回日期:1900-01-01 出版日期:2007-01-11 发布日期:2007-01-11
通讯作者: 魏继增 weijizeng

The Research of the Standard of the Evaluation of Multi-Document Summarization

JiZeng Wei,,

天津大学电子信息工程学院计算机系IBM中心

Received:2006-01-24 Revised:1900-01-01 Online:2007-01-11 Published:2007-01-11
Contact: JiZeng Wei

摘要/Abstract

摘要： 多文档自动文摘是自然语言处理领域的一个重要研究方向。但对于多文档文摘的评价方法仍然存在方法单一，缺乏统一标准的问题。本文针对这些问题，就多文档文摘信息覆盖度尝试性地提出一套标准。该标准将涉及以下几个重要参数：改进BLEU参数（改进召回率），与原文档有效词覆盖度，高频词覆盖度。实验证明利用该标准能准确反映出文摘系统在信息覆盖度方面的优劣，并且接近人工评价结果。

关键词: BLEU, 高频词覆盖度, 有效词覆盖度, 召回率

Abstract: Multi-Document Automatic Summarization is an important branch of Natural Language Understanding. But the methods of evaluation of the Multi-Document Automatic Summarization also have many problems which are single and lack of uniform standard. The investigative point in this text is to attempt to give a standard aiming at the covered rate of information of Multi-Document Automatic Summarization. This standard will use a few of parameters: improved BLEU parameter (recall), covered rate of effective phrase with original documents, high frequency phrase covered rate. The experiments have indicated this standard can reflect the covered rate of information of summarization system good or bad, and whether it is near to artificial evaluation results.

Key words: BLEU, high frequency phrase covered rate, covered rate of effective phrase, recall

魏继增,孙济洲,秦兵. 多文档文摘评价标准的研究[J]. 计算机工程与应用, 2007, 43(2): 180-180.

JiZeng Wei,,. The Research of the Standard of the Evaluation of Multi-Document Summarization[J]. Computer Engineering and Applications, 2007, 43(2): 180-180.

[1]	郑行家，钟宝江. 图像直线段检测算法综述与测评[J]. 计算机工程与应用, 2019, 55(17): 9-19.
[2]	肖文强，姚世军，吴善明. 基于用户谱聚类的Top-N协同过滤推荐算法[J]. 计算机工程与应用, 2018, 54(7): 138-143.
[3]	周法国1，吴锡坤1，孙泰2，孙镇2. 基于转移学习的中文命名实体识别[J]. 计算机工程与应用, 2018, 54(5): 117-121.
[4]	刘颖，姜巍. 统计机器翻译中翻译规则抽取[J]. 计算机工程与应用, 2012, 48(32): 98-101.
[5]	郑继明1，张萍2. 基于小波变换的音频分割[J]. 计算机工程与应用, 2011, 47(7): 139-142.
[6]	宋军涛¹,周铜²,杜庆灵¹. 支持向量机和蚁群算法的网页分类研究[J]. 计算机工程与应用, 2009, 45(17): 122-124.
[7]	于屏方,杜家利. 文本排歧语义图式的自动获取与选择[J]. 计算机工程与应用, 2007, 43(31): 169-171.
[8]	齐振宇,许静,李吉屹,史广顺. 掌纹鉴别自动性能评估[J]. 计算机工程与应用, 2007, 43(28): 214-216.
[9]	林海霞原福永陈金森刘俊峰. 一种改进的主题网络蜘蛛搜索算法 [J]. 计算机工程与应用, 2007, 43(10): 174-176.