SVM accelerated training algorithm based on border sample selection

doi:10.3778/j.issn.1002-8331.1507-0245

Abstract

Abstract: Support Vector Machine（SVM）is a powerful instrument for solving pattern classification problem, but it is not suitable for large-scale data, due to the drawbacks of slow training speed, large computational cost and low generalization. An accurate support vector machine algorithm is proposed, which uses training samples lying close to the separation boundary. First of all, K-means clustering is performed to the initial training data, and then the boundary samples are selected in each cluster by K-nearest neighbor algorithm, two cluster factors, the degree of mixing and support, are defined to determine the boundary width. These boundary samples are then used in the training of the SVM classifier. The experiments on some benchmark datasets show that the proposed method not only makes computational complexities decreased, but also makes classification power of traditional SVM invariant.

Key words: Support Vector Machine（SVM）, large-scale classification, boundary samples, clustering

摘要： 针对支持向量机（Support Vector Machine，SVM）处理大规模数据集的学习时间长、泛化能力下降等问题，提出基于边界样本选择的支持向量机加速算法。首先，进行无监督的K均值聚类；然后，在各个聚簇内依照簇的混合度、支持度因素应用K近邻算法剔除非边界样本，获得最终的类别边界区域样本，参与SVM模型训练。在标准数据集上的实验结果表明，算法在保持传统支持向量机的分类泛化能力的同时，显著降低了模型训练时间。

关键词: 支持向量机, 大规模分类, 边界样本, 聚类

HU Xiaosheng, ZHONG Yong. SVM accelerated training algorithm based on border sample selection[J]. Computer Engineering and Applications, 2017, 53(3): 169-173.

胡小生，钟勇. 基于边界样本选择的支持向量机加速算法[J]. 计算机工程与应用, 2017, 53(3): 169-173.

[1]	LAN Hong, HUANG Min. Fusion of KNN Optimized Density Peaks and FCM Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 81-88.
[2]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[3]	LI Li, JI Xinyuan, SONG Song. Prediction Model for Number of Software Defects in Loop [J]. Computer Engineering and Applications, 2021, 57(7): 158-163.
[4]	HAN Weiyu, CHENG Longsheng. Research on Roling Bearing Failure Mode Classification Based on MTS and SVM [J]. Computer Engineering and Applications, 2021, 57(6): 239-246.
[5]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[6]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[7]	ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167.
[8]	PENG Qihui, XUAN Shibin, GAO Qing. Distribution Automatic Threshold Density Peak Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 71-78.
[9]	LI Yongzhen, LIAO Husheng. Multi-view Clustering via Graph Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(5): 115-122.
[10]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[11]	HU Xiaomin, WANG Mingfeng, ZHANG Shourong, LI Min. New Differential Evolution with Particle Swarm Optimization Algorithm for Text Clustering [J]. Computer Engineering and Applications, 2021, 57(4): 61-67.
[12]	WEN Jiebin, YANG Wenzhong, MA Guoxiang, ZHANG Zhihao, LI Hailei. Micro-expression Recognition Based on Apex Frame Optical Flow and Convolutional Autoencoder [J]. Computer Engineering and Applications, 2021, 57(4): 127-133.
[13]	WANG Junling, LU Xinming. Video Key Frame Extraction Algorithm Based on Semantic Correlation [J]. Computer Engineering and Applications, 2021, 57(4): 192-198.
[14]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[15]	WANG Fuyin, ZHANG Desheng, ZHANG Xiao. Adaptive Density Peaks Clustering Algorithm Combining with Whale Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(3): 94-102.

SVM accelerated training algorithm based on border sample selection

基于边界样本选择的支持向量机加速算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics