Improved fast classifier based on SVM and density clustering

doi:10.3778/j.issn.1002-8331.2011.02.042

Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (2): 136-138.DOI: 10.3778/j.issn.1002-8331.2011.02.042

• 数据库、信号与信息处理 • Previous Articles Next Articles

Improved fast classifier based on SVM and density clustering

ZHANG Zhenzhen，DONG Cailin，CHEN Zengzhao，HE Xiuling

School of Mathematics and Statistics，Huazhong Normal University，Wuhan 430079，China

Received:2009-05-07 Revised:2009-07-06 Online:2011-01-11 Published:2011-01-11
Contact: ZHANG Zhenzhen

改进的结合密度聚类的SVM快速分类方法

张珍珍，董才林，陈增照，何秀玲

华中师范大学数学与统计学学院，武汉 430079

通讯作者: 张珍珍

Abstract

Abstract: In order to resolve the problem of actual large-scale data sets classification using SVM，this paper provides a method to improve the training speed through reducing data sets.The algorithm is divided into two steps.Firstly，it finds the samples that can represent a similar regional utilizing density clustering，then a set of support vectors will be gotten after using SVM to train reduced data sets.The second step is to find a new train data set assembled some support vectors and samples which belong to the regional represented by the support vector.The simulation shows that the algorithm proposed in this paper can improve classification speed while accuracy rate is acceptable.

Key words: density clustering, Support Vector Machine（SVM） algorithm, fast classification, large data sets

摘要： 针对SVM在对大规模数据分类时求解规模过大的问题，提出了一种缩减数据集以提高训练速度的方法。该算法的第一步利用基于密度的方法大致定位能代表某个局域的质点，然后用SVM训练缩减后的数据得到一组支持向量，第二步的训练数据由支持向量以及其所代表的样本点构成。仿真实验证明该算法在保证分类准确率的情况下能有效地提高分类速度。

关键词: 密度聚类, SVM算法, 快速分类, 大数据集

CLC Number:

O235

ZHANG Zhenzhen，DONG Cailin，CHEN Zengzhao，HE Xiuling. Improved fast classifier based on SVM and density clustering[J]. Computer Engineering and Applications, 2011, 47(2): 136-138.

张珍珍，董才林，陈增照，何秀玲. 改进的结合密度聚类的SVM快速分类方法[J]. 计算机工程与应用, 2011, 47(2): 136-138.

[1]	QIU Ningjia, SHEN Zhuorui, WANG Hui, WANG Peng. Semi-supervised Learning Optimization Algorithm for Communication Spam Text Recognition [J]. Computer Engineering and Applications, 2020, 56(17): 121-128.
[2]	WANG Guang, LIN Guoyu. Improved Adaptive Parameter DBSCAN Clustering Algorithm [J]. Computer Engineering and Applications, 2020, 56(14): 45-51.
[3]	WANG Ziqi, HE Jinwen, JIANG Liangxiao. New Redundancy-Based Algorithm for Reducing Amount of Training Examples in KNN [J]. Computer Engineering and Applications, 2019, 55(22): 40-45.
[4]	LIU Cangsheng, XU Qinglin. Fuzzy C-means clustering algorithm based on density peak value optimization [J]. Computer Engineering and Applications, 2018, 54(14): 153-157.
[5]	CHEN Hao, HOU Huiqun, YANG Chengzhi, QIU Lei. SA-BFSN：adaptive algorithm based on density clustering [J]. Computer Engineering and Applications, 2012, 48(36): 186-189.
[6]	CUI Bingde. Remote sensing image classification based on SVM classifier [J]. Computer Engineering and Applications, 2011, 47(27): 189-191.
[7]	MA Suqin，SHI Huaji. Text density clustering algorithm with optimized threshold values [J]. Computer Engineering and Applications, 2011, 47(17): 134-136.
[8]	WANG Yu-rong，QIAN Xue-zhong . Level two sub sampling algorithm of mining large data sets [J]. Computer Engineering and Applications, 2010, 46(35): 126-128.
[9]	XU Jie,LU De-tang,HAN Wei. Application of hybrid method in parameters optimization of well test [J]. Computer Engineering and Applications, 2009, 45(9): 196-199.
[10]	JIA Jun-fang,ZHANG Ri-quan. Large data sets clustering analysis based on distribution [J]. Computer Engineering and Applications, 2008, 44(28): 133-135.
[11]	ZHANG Hang¹,WANG Wei¹,ZHENG Ling²,LI Dan-dan³,XIONG Fu-qiang¹. Density clustering based niching Differential Evolution [J]. Computer Engineering and Applications, 2008, 44(23): 42-45.
[12]	. A Class-based Feature Selection Algorithm for Test Clustering [J]. Computer Engineering and Applications, 2007, 43(12): 144-146.

Improved fast classifier based on SVM and density clustering

改进的结合密度聚类的SVM快速分类方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 12

Recommended Articles

Metrics