计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (31): 148-150.

• 图形、图像、模式识别 • 上一篇    下一篇

利用模型选择确定视觉词袋模型中词汇数目

许 明,韩军伟,郭 雷,尹文杰   

  1. 西北工业大学 自动化学院,西安 710129
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-11-01 发布日期:2011-11-01

Determine word number of Visual Bag-of-Words model by model selection method

XU Ming,HAN Junwei,GUO Lei,YIN Wenjie   

  1. School of Automation,Northwestern Polytechnical University,Xi’an 710129,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-11-01 Published:2011-11-01

摘要: 视觉词袋(Visual Bag-of-Words)模型在图像分类、检索和识别等计算机视觉领域有了广泛的应用,但是视觉词袋模型中词汇数目往往是根据经验确定或者采用有监督的交叉学习选取。提出一种确定视觉词袋模型中词汇数目的无监督方法,利用模型选择的思想来解决问题。使用高斯混合模型描述具有不同词汇数目的视觉词袋,计算各模型贝叶斯信息准则的值,选取贝叶斯信息准则最小值对应的词汇数目。与交叉验证的监督学习在图像分类实验的对比结果说明该方法准确有效。

关键词: 视觉词袋模型, 模型选择, 高斯混合模型, 贝叶斯信息准则

Abstract: Visual Bag-of-Words model has been widely used in image classification,retrieval and recognition.However,its word number usually is selected by user experience or determined using the supervised cross-validation scheme.In this paper,an unsupervised method is proposed to infer the word number of Visual Bag-of-Words model(BoW) based on the idea of model selection.Firstly,Gaussian Mixture Models(GMM) are built accounting for BoWs with different word number.Afterwards,Bayesian Information Criterion(BIC) is adopted to select the best model that has the minimum BIC value.Compared with cross-validation approach using image classification,the result demonstrates the effectiveness of the proposed approach.

Key words: Visual Bag-of-Words, model selection, Gaussian Mixture Model(GMM), Bayesian information criterion