Computer Engineering and Applications ›› 2023, Vol. 59 ›› Issue (9): 159-166.DOI: 10.3778/j.issn.1002-8331.2201-0024

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Weakly Supervised Person Search Combining Dual-Path Network and Multi-Label Classification

ZHANG Jianhe, JIANG Xiaoyan   

  1. School of Electric and Electronic Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
  • Online:2023-05-01 Published:2023-05-01

结合双路网络和多标签分类的弱监督行人搜索

张建贺,姜晓燕   

  1. 上海工程技术大学 电子电气工程学院,上海 201620

Abstract: Supervised person search relies entirely on person bounding boxes and person identity labels. It is easy to annotate person bounding boxes in large-scale datasets, but it’s extremely difficult to collect person identity association information cross multi-camera. In order to get rid of the dependence on person identity label, a weakly supervised person search combining dual-path network and multi-label classification with only person bounding box label method is proposed. In order to reduce the background information interference caused by person detection error, the combination of panoramic image branch and the cutting image branch is used to study the dual-path person instance feature, and to enhance the representation of the semantic information of the person area by minimizing the feature of the same instances in the two paths. At the same time, for the learning of the person re-identification feature, the single class label is assigned to each instance, then prediction multi-label by feature similarity threshold and mutual neighbor methods, and learning feature by multi-label based on the non-parametric classifier. The experimental results show that the mAP and top-1 of CUHK-SYSU dataset are 84.2% and 86.0%, respectively, and the mAP and top-1 of PRW dataset are 38.8% and 85.1%, respectively, showing excellent performance compared with the latest method.

Key words: person search, weakly supervised learning, person re-identification, multi-label classification

摘要: 有监督的行人搜索方法依赖于行人框和行人身份的精细标记,而大规模数据集下行人框的标注较易实现,但跨图像的行人身份标记却非常困难。为了摆脱对行人身份标签的依赖,只借助行人框标注,设计了结合双路网络和多标签分类的弱监督行人搜索方法,同时对行人定位和再识别任务进行联合优化。为减少行人定位误差引起的背景信息干扰,融合全景图像分支和裁剪图像分支进行双路特征学习,通过最小化两分支中同行人实例的特征差异来增强网络对行人区域语义信息的表征能力。同时,为解决无身份标签监督下行人可辨识特征的学习问题,设计了在线多标签预测,通过相似度阈值和互近邻原则来提升标签的可靠性。最后利用基于特征存储的非参数化分类器进行多标签分类学习,鼓励相似度高的特征聚合,相似度低的特征分离。实验评估在CUHK-SYSU数据集的mAP和top-1分别达到84.2%和86.0%,在PRW数据集的mAP和top-1分别达到38.8%和85.1%,与最新的方法相比性能表现突出。

关键词: 行人搜索, 弱监督学习, 行人再识别, 多标签分类