Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (33): 105-107.DOI: 10.3778/j.issn.1002-8331.2008.33.032
• 网络、通信、安全 • Previous Articles Next Articles
YAN Peng1,2,ZHENG Xue-feng1,LI Ming-xiang1,CHEN Song-hua2
Received:
Revised:
Online:
Published:
Contact:
闫 鹏1,2,郑雪峰1,李明祥1,陈松华2
通讯作者:
Abstract: FS(Feature Selection) is a basal but crucial step within anti-spam classifiers based on ML(Machine Learning) algorithms.Nowadays FS based on Mutual Information(MI) is widely used.In this paper,by analyzing characteristic of spam emails,a new FS approach based on Bayes reasoning is presented.Experiments show that it can achieve much higher performance and efficiency than MI approach.
摘要: 在各种基于机器学习的垃圾邮件过滤系统中,特征选择是基础且非常关键的一个环节,它对整个系统的性能和效率都有直接的影响。通过对垃圾邮件特点的分析,提出了一种基于贝叶斯推理的特征选择评估函数方法。新方法运算开销较小,且能够区分出不同的特征词在体现垃圾邮件特征时所存在的差异性,因而在进行特征选择时较其它常用方法更具针对性,非常利于提高过滤系统的准确性和运行效率。
YAN Peng1,2,ZHENG Xue-feng1,LI Ming-xiang1,CHEN Song-hua2. Feature selection approach based on Bayes reasoning in anti-spam classifier[J]. Computer Engineering and Applications, 2008, 44(33): 105-107.
闫 鹏1,2,郑雪峰1,李明祥1,陈松华2. 关于贝叶斯推理的垃圾邮件特征选择评估函数[J]. 计算机工程与应用, 2008, 44(33): 105-107.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2008.33.032
http://cea.ceaj.org/EN/Y2008/V44/I33/105