Computer Engineering and Applications ›› 2020, Vol. 56 ›› Issue (20): 152-157.DOI: 10.3778/j.issn.1002-8331.1908-0100

Previous Articles     Next Articles

Research on Impact of Different Temporal Modeling Methods on Video-Based Person Re-identification

XIANG Jun, LIN Ranran, HUANG Ziyuan, HOU Jianhua   

  1. College of Electronics and Information Engineering, South-Central University for Nationalities, Wuhan 430074, China
  • Online:2020-10-15 Published:2020-10-13

时域模型对视频行人重识别性能影响的研究

项俊,林染染,黄子源,侯建华   

  1. 中南民族大学 电子信息工程学院,武汉 430074

Abstract:

As an important task in computer vision field, video-based person re-identification(Re-ID) has attracted significant attention in recent years due to the increasing demand of video surveillance. Typically, a video-based person Re-ID system is composed of an image-level feature extractor(e.g.CNN), a temporal modeling method to capture temporal information and a loss function. In this paper, it explores the impact of different temporal modeling methods on video-based person Re-ID by fixing the image-level feature extractor and the loss function to be the same. Three temporal models are considered, including temporal pooling, temporal attention and Recurrent Neural Network(RNN). Experimental results conducted on the MARS dataset demonstrate that compared with the image-based Re-ID baseline, both temporal pooling and temporal attention models can improve the recognition accuracy, and RNN’s performance has dropped to some extent. The conclusions achieved in the paper can provide helpful insight for the design of video-based person re-identification algorithms.

Key words: video-based person re-identification, deep neural networks, feature extracting, temporal model

摘要:

行人重识别是计算机视觉领域一个重要的研究方向。近年来,随着视频监控需求的日益增长,基于视频序列的行人重识别研究受到了广泛的关注。典型的视频序列行人重识别系统由三部分构成:图片特征提取器(例如卷积神经网络)、提取时域信息的时域模型、损失函数。在固定特征提取器和损失函数的前提下,研究不同时域模型对视频行人重识别算法性能的影响,包括时域池化、时域注意力、循环神经网络。在Mars数据集上的实验结果表明:与基于图像的行人重识别基准算法相比,采用时域池化模型、时间注意力模型可以有效改善识别精度,但采用循环神经网络后识别效果比基准算法有所下降。

关键词: 视频行人重识别, 深度神经网络, 特征提取, 时域模型