Computer Engineering and Applications ›› 2017, Vol. 53 ›› Issue (7): 197-201.DOI: 10.3778/j.issn.1002-8331.1509-0174

Previous Articles     Next Articles

Object recognition based on multi-direction spatial Bags of Words model

QI Mei1, HU Min2   

  1. 1.Anhui Open University, Hefei 230022, China
    2.HeFei University of Technology, Hefei 230022, China
  • Online:2017-04-01 Published:2017-04-01


齐  梅1,胡  敏2   

  1. 1.安徽广播电视大学,合肥 230022
    2.合肥工业大学,合肥 230022

Abstract: To alleviate the problem that spatial position information is completely ignored in Bags of Words representation, an optimized method based on multi-direction spatial Bags of Words is proposed. Firstly, the representation of spatial sub region of an image is modeled by spatial pyramid. Then, projection method is applied to local features of image blocks in horizontal, vertical and inclined [±45°], spatial structure information is well expressed in multi-direction. Furthermore, it reduces redundant effects of different object and the dimension of features by means of samples visual codebook. Finally, the proposed method is evaluated on two object databases, the results of comparative experiment show that the proposed algorithm has inspiring performance.

Key words: object recognition, multi-direction projection, spatial information, samples codebook, Bags of Words

摘要: 针对词袋模型完全忽略空间位置信息的问题,提出了一种多方向空间词袋模型的物体识别方法。该算法通过空间金字塔划分,形成图像的空间子区域特征表达;分别在水平、垂直和倾斜[±45°]上对图像局部特征向量进行投影,得到图像在多方向上的空间结构信息;采用样本视觉词典方法,既减少了不同物体类别样本带来的冗余影响,又降低了特征维数。在Caltech101和Caltech256物体库上进行了对比实验,实验结果验证了算法的有效性。

关键词: 物体识别, 多方向投影, 空间信息, 样本词典, 词袋模型