计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (19): 166-168.

• 数据库、信号与信息处理 • 上一篇    下一篇

基于超球支持向量机的兼类文本分类算法研究

秦玉平1,2,王秀坤1,李祥纳2,王春立1   

  1. 1.大连理工大学 电子与信息工程学院,辽宁 大连 116024
    2.渤海大学 信息科学与工程学院,辽宁 锦州 121000
  • 收稿日期:2007-09-27 修回日期:2007-11-30 出版日期:2008-07-01 发布日期:2008-07-01
  • 通讯作者: 秦玉平

Study on multi-class text classification algorithm based on hyper-sphere support vector machines

QIN Yu-ping1,2,WANG Xiu-kun1,LI Xiang-na2,WANG Chun-li1   

  1. 1.School of Electronic and Information Engineering,Dalian University of Technology,Dalian,Liaoning 116024,China
    2.College of Information Science and Technology,Bohai University,Jinzhou,Liaoning 121000,China
  • Received:2007-09-27 Revised:2007-11-30 Online:2008-07-01 Published:2008-07-01
  • Contact: QIN Yu-ping

摘要: 针对兼类文本,提出了一种分类算法。对属于同一类别的文本,利用超球支持向量机在特征空间中求得一个能包围该类尽可能多文本的最小超球,使各类文本之间通过超球分隔开,达到分类效果。对待分类文本,计算它到各超球球心的距离,根据距离判定该文本所属的类别。实验结果证明,该算法不仅具有较快的分类速度,而且具有较高的分类精度。

关键词: 支持向量机, 超球, 兼类, 分类

Abstract: To multi-class text,a classification algorithm based on hyper-sphere support vector machines is proposed in this paper.Hyper-sphere support vector machine is used to get the smallest hyper-sphere in feature space that contains most texts of a class,which can divide the class texts from others.For the text to be classified,the distances from it to the centre of every hyper-sphere are used to confirm the classes that the text belongs to.The experimental results show that the algorithm not only has a faster performance on classification speed,but also has a higher performance on classification precision.

Key words: support vector machines, hyper-sphere, multi-class, classification