计算机工程与应用 ›› 2019, Vol. 55 ›› Issue (16): 191-196.DOI: 10.3778/j.issn.1002-8331.1805-0169

• 图形图像处理 • 上一篇    下一篇

结合改进CNN和双线性模型的CBIR方法

蔡鹏飞,叶剑锋   

  1. 1.河南工学院 计算机科学技术系,河南 新乡 453002
    2.南京航空航天大学 机电学院,南京 210016
  • 出版日期:2019-08-15 发布日期:2019-08-13

CBIR Method Based on Improved CNN and Bilinear Model

CAI Pengfei, YE Jianfeng   

  1. 1.Department of Computer Science and Technology, Henan Institute of Technology, Xinxiang, Henan 453002, China
    2.College of Mechanical and Electrical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
  • Online:2019-08-15 Published:2019-08-13

摘要: 针对现有基于内容的图像检索(Content-Based Image Retrieval,CBIR)方法中图像特征维度较大等问题,提出一种结合改进卷积神经网络(Convolutional Neural Network,CNN)和双线性模型的CBIR方法。采用一种低维度池化方法代替传统CNN中的池化过程,以此降低图像特征映射的维度。基于双线性模型的思想,使用两个特征提取器进行特征提取,并在每个图像位置上对两个特征进行内积,以形成最终的图像描述符。通过计算图像间的曼哈顿距离度量来评估相似性,获得相关图像及其排序。实验结果表明,该方法能够准确检索出相关图像,并具有较低的检索时间和内存消耗。

关键词: 基于内容的图像检索(CBIR), 卷积神经网络(CNN), 双线性模型, 低维度图像表示, 曼哈顿距离

Abstract: For the issue of the large image feature dimension in existing Content-Based Image Retrieval(CBIR) methods, a CBIR method based on improved Convolutional Neural Network(CNN) and bilinear model is proposed. Firstly, a low-dimensional pooling method is used to replace the pooling process in the traditional CNN to reduce the dimension of the image features. Then, two feature extractors are used for feature extraction based on the idea of bilinear model, and these two features are inner product at each image location to form the final image descriptor. Finally, the similarity is evaluated by calculating the Manhattan distance between images, so as to obtain the related images and their sorting. Experimental results show that the proposed method can accurately retrieve the relevant images, and has lower retrieval time and memory consumption.

Key words: Content-Based Image Retrieval(CBIR), Convolutional Neural Network(CNN), bilinear model, low-dimensional image representation, Manhattan distance