基于边界样本选择的支持向量机加速算法

doi:10.3778/j.issn.1002-8331.1507-0245

计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (3): 169-173.DOI: 10.3778/j.issn.1002-8331.1507-0245

基于边界样本选择的支持向量机加速算法

胡小生，钟勇

佛山科学技术学院电子与信息工程学院，广东佛山 528000

出版日期:2017-02-01 发布日期:2017-05-11

SVM accelerated training algorithm based on border sample selection

HU Xiaosheng, ZHONG Yong

College of Electronic and Information Engineering, Foshan University, Foshan, Guangdong 528000, China

Online:2017-02-01 Published:2017-05-11

摘要/Abstract

摘要： 针对支持向量机（Support Vector Machine，SVM）处理大规模数据集的学习时间长、泛化能力下降等问题，提出基于边界样本选择的支持向量机加速算法。首先，进行无监督的K均值聚类；然后，在各个聚簇内依照簇的混合度、支持度因素应用K近邻算法剔除非边界样本，获得最终的类别边界区域样本，参与SVM模型训练。在标准数据集上的实验结果表明，算法在保持传统支持向量机的分类泛化能力的同时，显著降低了模型训练时间。

关键词: 支持向量机, 大规模分类, 边界样本, 聚类

Abstract: Support Vector Machine（SVM）is a powerful instrument for solving pattern classification problem, but it is not suitable for large-scale data, due to the drawbacks of slow training speed, large computational cost and low generalization. An accurate support vector machine algorithm is proposed, which uses training samples lying close to the separation boundary. First of all, K-means clustering is performed to the initial training data, and then the boundary samples are selected in each cluster by K-nearest neighbor algorithm, two cluster factors, the degree of mixing and support, are defined to determine the boundary width. These boundary samples are then used in the training of the SVM classifier. The experiments on some benchmark datasets show that the proposed method not only makes computational complexities decreased, but also makes classification power of traditional SVM invariant.

Key words: Support Vector Machine（SVM）, large-scale classification, boundary samples, clustering

胡小生，钟勇. 基于边界样本选择的支持向量机加速算法[J]. 计算机工程与应用, 2017, 53(3): 169-173.

HU Xiaosheng, ZHONG Yong. SVM accelerated training algorithm based on border sample selection[J]. Computer Engineering and Applications, 2017, 53(3): 169-173.

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	75

来源	本网站	其他网站

次数	69	6
比例	92%	8%

摘要

108

最新录用	在线预览	正式出版

3	0	105

	来源	本网站

	次数	108
	比例	100%

[1]	兰红，黄敏. 融合KNN优化的密度峰值和FCM聚类算法[J]. 计算机工程与应用, 2021, 57(9): 81-88.
[2]	高一锴，彭力，徐龙壮. 改进AFSA算法优化TWSVM的火焰识别方法[J]. 计算机工程与应用, 2021, 57(8): 204-213.
[3]	郭晓静，隋昊达. 改进YOLOv3在机场跑道异物目标检测中的应用[J]. 计算机工程与应用, 2021, 57(8): 249-255.
[4]	李莉，纪欣沅，宋嵩. 回环软件缺陷数量预测模型[J]. 计算机工程与应用, 2021, 57(7): 158-163.
[5]	韩卫宇，程龙生. 结合马田系统-SVM的滚动轴承故障模式分类研究[J]. 计算机工程与应用, 2021, 57(6): 239-246.
[6]	霍光煜，张勇，孙艳丰，尹宝才. 基于语义的档案数据智能分类方法研究[J]. 计算机工程与应用, 2021, 57(6): 247-253.
[7]	杨芳，尹曦，司建辉，刘宏媛，汪雪. 基于侧重点聚类的数学表达式相似度计算方法[J]. 计算机工程与应用, 2021, 57(6): 88-93.
[8]	赵凡，张琳，闻治泉，杨林林，蔺广逢. 一种直接高效的自然场景汉字逼近定位方法[J]. 计算机工程与应用, 2021, 57(6): 159-167.
[9]	雷恒林，古兰拜尔·吐尔洪，买日旦·吾守尔，张东梅. 新奇检测综述[J]. 计算机工程与应用, 2021, 57(5): 47-55.
[10]	彭启慧，宣士斌，高卿. 分布的自动阈值密度峰值聚类算法[J]. 计算机工程与应用, 2021, 57(5): 71-78.
[11]	李勇振，廖湖声. 基于图卷积神经网络的多视角聚类[J]. 计算机工程与应用, 2021, 57(5): 115-122.
[12]	王昌龙，张远东，缪宏，杨煜恒. 双通道卷积神经网络在南瓜病害识别上的应用[J]. 计算机工程与应用, 2021, 57(5): 183-189.
[13]	胡晓敏，王明丰，张首荣，李敏. 用于文本聚类的新型差分进化粒子群算法[J]. 计算机工程与应用, 2021, 57(4): 61-67.
[14]	温杰彬，杨文忠，马国祥，张志豪，李海磊. 基于Apex帧光流和卷积自编码器的微表情识别[J]. 计算机工程与应用, 2021, 57(4): 127-133.
[15]	王俊玲，卢新明. 基于语义相关的视频关键帧提取算法[J]. 计算机工程与应用, 2021, 57(4): 192-198.

基于边界样本选择的支持向量机加速算法

SVM accelerated training algorithm based on border sample selection

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐 0

Metrics