计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (4): 162-164.DOI: 10.3778/j.issn.1002-8331.2009.04.046

• 数据库、信号与信息处理 • 上一篇    下一篇

基于中心距离比值准则的无监督特征选择算法

叶 菲,罗景青,俞志富   

  1. 解放军电子工程学院,合肥 230037
  • 收稿日期:2008-01-08 修回日期:2008-04-16 出版日期:2009-02-01 发布日期:2009-02-01
  • 通讯作者: 叶 菲

Unsupervised feature selection algorithm based on center distance ratio principle

YE Fei,LUO Jing-qing,YU Zhi-fu   

  1. PLA Electronic Engineering Institute,Hefei 230037,China
  • Received:2008-01-08 Revised:2008-04-16 Online:2009-02-01 Published:2009-02-01
  • Contact: YE Fei

摘要: 特征选择是模式识别中的一个重要组成部分。针对未知类标号的样本集,提出基于中心距离比值准则的无监督特征选择算法。该算法利用爬山法确定聚类数目范围和估计初始聚类中心,再通过K-均值聚类算法确定特征子集的最佳分类数,然后用中心距离比值准则来评价特征子集的分类性能,并通过特征间的相关性分析,从中选择出分类效果好,相关程度低的特征组成特征子集。

Abstract: Feature selection is an important component of pattern recognition.For unknown class label samples set,an unsupervised feature selection algorithm based on center distance ratio principle is proposed.The algorithm uses the mountain method to get the range of clustering number and estimate original clustering centers,then K-means clustering algorithm is adopted to confirm the optimal classification number of feature subset,and then center distance ratio principle is used to measure the classification performance of feature subset,moreover the feature correlation is analyzed,so the features with good class effect and low correlation are selected.