计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (34): 95-98.DOI: 10.3778/j.issn.1002-8331.2010.34.030

• 网络、通信、安全 • 上一篇    下一篇

应用精确代价因子的两层邮件过滤模型

王 涛1,裘国永1,冯 涛2   

  1. 1.陕西师范大学 计算机科学学院,西安 710062
    2.陕西师范大学 杂志社,西安 710062
  • 收稿日期:2009-04-14 修回日期:2009-06-12 出版日期:2010-12-01 发布日期:2010-12-01
  • 通讯作者: 王 涛

Bi-layer email filtering model based on smart cost factor

WANG Tao1,QIU Guo-yong1,FENG Tao2   

  1. 1.School of Computer Science,Shaanxi Normal University,Xi’an 710062,China
    2.Magazine Press of Shaanxi Normal University,Xi’an 710062,China
  • Received:2009-04-14 Revised:2009-06-12 Online:2010-12-01 Published:2010-12-01
  • Contact: WANG Tao

摘要: 分析了一种基于直线几何分割的朴素贝叶斯邮件过滤模型LGDNBF,用更为精确的代价因子描述了分类器误判的代价。定义了高风险决策区域,对高风险决策区域中的邮件引入SVM方法进行二次分类,提出了基于精确代价因子的两层邮件过滤模型。在中文邮件语料集上的实验结果证明了这一两层过滤模型的分类效果较之朴素贝叶斯邮件过滤模型有明显的改进。

Abstract: A new Naïve Bayes Filtering model based on Line Geometry Division(LGDNBF) is analyzed.Cost of false positive classification is described by smarter cost factor.With definition of the high risk area,the paper introduces SVM to classify second time,and puts forward a bi-layer email filtering model based on smart cost factor.The test on Chinese email corpus demonstrates that the new model has better performance than Naïve Bayes Filtering model.

中图分类号: