一种基于支持向量阈值控制的优化增量SVM算法

计算机工程与应用 ›› 2015, Vol. 51 ›› Issue (3): 124-128.

• 数据库、数据挖掘、机器学习 • 上一篇下一篇

一种基于支持向量阈值控制的优化增量SVM算法

刘伟，谢兴生，肖超峰

中国科学技术大学自动化系，合肥 230027

出版日期:2015-02-01 发布日期:2015-01-28

Optimized incremental SVM algorithm based on support vector threshold control

LIU Wei, XIE Xingsheng, XIAO Chaofeng

Department of Automation, University of Science and Technology of China, Hefei 230027, China

Online:2015-02-01 Published:2015-01-28

摘要/Abstract

摘要： 针对I-SVM算法在文本分类中训练时间较长和分类效率低的问题，提出了一种基于支持向量（SV）阀值控制的优化I-SVM算法（TI-SVM）。由于在增量训练样本集中存在大量的非SV，TI-SVM算法根据历史训练模型和KKT条件对新增样本集和历史样本集进行预处理，剔除大部分的非SV，根据预处理后的样本集进行训练新的SVM模型，利用文本的相似度和预设SV的阀值对模型中的冗余SV进一步处理，以提高分类性能。经过对一组客户新闻分类的实验表明，该算法在保证分类精度的同时有效提高了模型的训练和分类效率。

关键词: 支持向量机, 机器学习, 文本分类, 分类模型, KKT条件

Abstract: With information constantly updating and sample collecting, the classification performance and accuracy of initial training model using I-SVM is of low efficiency and costs long time. To solve this problem, this paper describes a growing Incremental Supported Vector Machine algorithm（I-SVM） based on support vector threshold control optimization. The TI-SVM algorithm removes most of the non-support vector which aims at new sample sets and the historical sample set that are based on historical training model and the KKT conditions pretreatment. According to the sample after the pretreatment set, this algorithm trains a new SVM model. It takes vantage of the similarity of the text and the default threshold of support vector system to give a further treatment to the redundancy of support vector and to improve the classification performance. The theoretical analysis and experimental results show that the algorithm is effective with a high classification accuracy.

Key words: Support Vector Machine（SVM）, machine learning, text classification, model of classification, Karush-Kuhn-Tucker（KKT）

刘伟，谢兴生，肖超峰. 一种基于支持向量阈值控制的优化增量SVM算法[J]. 计算机工程与应用, 2015, 51(3): 124-128.

LIU Wei, XIE Xingsheng, XIAO Chaofeng. Optimized incremental SVM algorithm based on support vector threshold control[J]. Computer Engineering and Applications, 2015, 51(3): 124-128.

[1]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[2]	高一锴，彭力，徐龙壮. 改进AFSA算法优化TWSVM的火焰识别方法[J]. 计算机工程与应用, 2021, 57(8): 204-213.
[3]	韦佶宏，郑荣锋，刘嘉勇. 基于混合神经网络的恶意TLS流量识别研究[J]. 计算机工程与应用, 2021, 57(7): 107-114.
[4]	韩卫宇，程龙生. 结合马田系统-SVM的滚动轴承故障模式分类研究[J]. 计算机工程与应用, 2021, 57(6): 239-246.
[5]	霍光煜，张勇，孙艳丰，尹宝才. 基于语义的档案数据智能分类方法研究[J]. 计算机工程与应用, 2021, 57(6): 247-253.
[6]	张晓丽，张魁星，江梅，魏本征，丛金玉. 淋巴瘤图像分类技术研究综述[J]. 计算机工程与应用, 2021, 57(6): 1-9.
[7]	韩东方，吐尔地·托合提，艾斯卡尔·艾木都拉. 问答系统中问句分类方法研究综述[J]. 计算机工程与应用, 2021, 57(6): 10-21.
[8]	黄金杰，蔺江全，何勇军，何瑾洁，王雅君. 局部语义与上下文关系的中文短文本分类算法[J]. 计算机工程与应用, 2021, 57(6): 94-100.
[9]	万梦翔，姚寒冰. 面向恶意网页训练数据生成的GAN模型[J]. 计算机工程与应用, 2021, 57(6): 124-130.
[10]	杨晔民，张慧军，张小龙. 随机森林的可解释性可视分析方法研究[J]. 计算机工程与应用, 2021, 57(6): 168-175.
[11]	雷恒林，古兰拜尔·吐尔洪，买日旦·吾守尔，张东梅. 新奇检测综述[J]. 计算机工程与应用, 2021, 57(5): 47-55.
[12]	温杰彬，杨文忠，马国祥，张志豪，李海磊. 基于Apex帧光流和卷积自编码器的微表情识别[J]. 计算机工程与应用, 2021, 57(4): 127-133.
[13]	郑诚，董春阳，黄夏炎. 基于BTM图卷积网络的短文本分类方法[J]. 计算机工程与应用, 2021, 57(4): 155-160.
[14]	徐可文，许波，吴英，徐浩然. 机器学习在超声图像中的应用综述[J]. 计算机工程与应用, 2021, 57(4): 11-17.
[15]	王振东，张林，李大海. 基于机器学习的物联网入侵检测系统综述[J]. 计算机工程与应用, 2021, 57(4): 18-27.

一种基于支持向量阈值控制的优化增量SVM算法

Optimized incremental SVM algorithm based on support vector threshold control

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics