Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (35): 126-128.DOI: 10.3778/j.issn.1002-8331.2009.35.038
• 数据库、信号与信息处理 • Previous Articles Next Articles
CHENG Chun-hui,HE Qin-ming
Received:
Revised:
Online:
Published:
Contact:
程春惠,何钦铭
通讯作者:
Abstract: According to the feature of case text,this paper explores the special text preprocessing method and compares two effective feature selection methods.An improved model based on multi-variate Bernoulli model is proposed,due to the unbalanced distribution of criminal case categories.The experiment indicates that the improved Naive Bayes method performs better in the case text classification.
摘要: 针对案件文本的特点,提出了具有针对性的特殊文本预处理方法,并比较了两种有效的特征选择方法。针对案件类别分布不均衡的特点,提出了改进的多变量贝努里模型。实验结果表明,改进的多变量贝努里模型有效地提高了案件文本分类的准确率。
CLC Number:
TP301.6
CHENG Chun-hui,HE Qin-ming. Naive Bayes based criminal text classification of unbalanced classes[J]. Computer Engineering and Applications, 2009, 45(35): 126-128.
程春惠,何钦铭. 面向不均衡类别朴素贝叶斯犯罪案件文本分类[J]. 计算机工程与应用, 2009, 45(35): 126-128.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2009.35.038
http://cea.ceaj.org/EN/Y2009/V45/I35/126