Computer Engineering and Applications ›› 2014, Vol. 50 ›› Issue (6): 112-117.

Previous Articles     Next Articles

Dictionary learning based on spectral clustering with Nystr?m

WANG Xin1, LI Lu1, WANG Xiaofang2   

  1. 1.Department of Mathematics and Physics, Anhui Jianzhu University, Hefei 230601, China
    2.Department of Computer, Army Officer Academy of PLA, Hefei 230031, China
  • Online:2014-03-15 Published:2015-05-12

基于Nystr?m谱聚类的词典学习

王  鑫1,李  璐1,王晓芳2   

  1. 1.安徽建筑大学 数理系,合肥 230601
    2.解放军陆军军官学院 计算机教研室,合肥 230031

Abstract: Construction of visual vocabulary is a crucial step in popular Bag-of-Visual-Words(BOVW) model. Currently, K-means clustering is generally applied to constructing the visual vocabulary. However, the visual dictionary tends to be of low discrimination due to limitation of K-means clustering and complexity of high dimensional spatial structure of samples. Under the frame of spectral clustering, a dictionary learning algorithm with stronger discriminative capability is proposed. In order to reduce degradation of descriptors discriminative power during quantization and the inherent problems of storage and calculation in spectral clustering, the training samples are divided into sub-sample sets according to the label information of category. Centers of each data set are obtained based on spectral clustering with Nystr?m algorithm and then the final compact visual dictionary is generated. Experimental results in Scene-15 dataset verify the correctness and effectiveness of the proposed algorithm. Especially when the training samples are limited, the visual dictionary via the algorithm can obtain better performance.

Key words: image classification, bag of visual words, visual vocabulary, spectral clustering

摘要: 构建视觉词典是BOVW模型中关键的一个步骤,目前大多数视觉词典是基于K-means聚类方式构建。然而由于K-means聚类的局限性以及样本空间结构的复杂性与高维性,这种方式构建的视觉词典往往区分性能较差。在谱聚类的框架下,提出一种区分性能更强的视觉词典学习算法,为了减少特征在量化过程中区分性能的降低以及谱聚类固有的存储计算问题,算法根据训练样本的类别标签对训练数据进行划分,基于Nystr?m谱聚类得到各子样本数据集的中心并得到最终的视觉词典。在Scene-15数据集上的实验结果验证了算法的正确性和有效性。特别当训练样本有限时,采用该算法生成的视觉词典性能较优。

关键词: 图像分类, 视觉词袋模型, 视觉词典, 谱聚类