Semi-automatic Video Target Annotation by Combining Detection and Tracking

doi:10.3778/j.issn.1002-8331.2004-0386

Abstract

Abstract:

Aiming at the problem that the target between consecutive frames is redundant in the video and manual annotation is time-consuming and laborious, a semi-automatic video target annotation framework by combining detection and tracking is proposed. First, manually annotated samples are used to train the improved YOLOv3 detection model offline and the detection model is used as an online annotation detector. Then during online annotation, the target position and label are determined manually in the first frame, target position is determined automatically according to the IOU（Intersection-Over-Union） of the detection box and the tracking box in the subsequent frame, and the response of the tracker is used to judge the target disappearance so that the current target annotation is stopped automatically. Finally, a key frame extraction algorithm based on the target saliency is used to select the key frames. The performance comparison experiment of the improved YOLOv3 is carried out by using the self-built ship target data set, and the effectiveness of the semi-automatic video target annotation method is verified by using a ship video sequence. Experimental results show that this method can improve the annotation efficiency and generate annotated data quickly, and it is suitable for video target annotation tasks in scenes such as sea-surface ships video.

Key words: video image, target annotation, target detection, target tracking, key frames extraction

摘要：

针对视频图像连续帧间的目标具有冗余性，采用手动标注方式耗时耗力的问题，提出一种融合检测和跟踪算法的视频目标半自动标注框架。利用手动标注的样本离线训练改进YOLO v3模型，并将该检测模型作为在线标注的检测器。在线标注时在初始帧手动确定目标位置和标签，在后续帧根据检测框与跟踪框的IOU（Intersection-Over-Union）值自动确定目标的位置，并利用跟踪器的响应输出判断目标消失，从而自动停止当前目标标注。采用一种基于目标显著性的关键帧提取算法选择关键帧。采用自建舰船目标数据集进行了改进YOLO v3检测性能对比实验，并采用舰船视频序列验证了提出的视频目标半自动标注方法的有效性。实验结果表明，该方法可以显著提高标注效率，能够快速生成标注数据，适用于海上舰船等场景的视频目标标注任务。

关键词: 视频图像, 目标标注, 目标检测, 目标跟踪, 关键帧提取

CHEN Qinglin, GU Yu, SONG Zhonghao, NIE Shengdong. Semi-automatic Video Target Annotation by Combining Detection and Tracking[J]. Computer Engineering and Applications, 2021, 57(14): 223-230.

陈庆林，谷雨，宋忠浩，聂圣东. 融合检测与跟踪的半自动视频目标标注[J]. 计算机工程与应用, 2021, 57(14): 223-230.

[1]	ZHANG Zhentong, SHAN Yugang, YUAN Jie. Remote Sensing Image Detection Algorithm Combining Multi-scale and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 212-216.
[2]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[3]	DONG Xubin, ZHAO Qinghua. Research and Application of Improved Mask R-CNN in Aerial Image Target Detection [J]. Computer Engineering and Applications, 2021, 57(8): 133-144.
[4]	MA Qiaomei, WANG Mingjun, LIANG Haoran. License Plate Location Detection Algorithm Based on Improved YOLOv3 in Complex Scenes [J]. Computer Engineering and Applications, 2021, 57(7): 198-208.
[5]	HOU Xuan, XUE Fei, CHEN Tao. UAV Target Detection on Quantum Multi-pattern Recognition Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(7): 228-236.
[6]	SHEN Xinfeng, JIANG Ping, ZHOU Genrong. Application of Improved SSD Algorithm in Parts Detection [J]. Computer Engineering and Applications, 2021, 57(7): 257-262.
[7]	WEI Wei, YANG Ru, ZHU Ye. Target Detection of Improved CenterNet to Remote Sensing Images [J]. Computer Engineering and Applications, 2021, 57(6): 191-199.
[8]	LI Zixing, LAN Zhen, TANG Dengqing, YAN Chao, XIANG Xiaojia, ZHOU Han. Survey on ERP-Based Target Detection [J]. Computer Engineering and Applications, 2021, 57(23): 37-49.
[9]	CHEN Fujian, XIE Weixin, XIA Ting. Adaptive Anti-occlusion Target Tracking Algorithm Based on LCT+ [J]. Computer Engineering and Applications, 2021, 57(22): 190-198.
[10]	ZHANG Shihao, YANG Xiujun, WU Linhuang, CHEN Pingping. Lightweight Multi-scale Attention Fusion Algorithm for License Plate Detection [J]. Computer Engineering and Applications, 2021, 57(22): 208-214.
[11]	XIE Junzhang, PENG Hui, TANG Jianfeng, HOU Yichen, ZENG Qingxi. Improved YOLOv4 for Dense Remote Sensing Target Detection [J]. Computer Engineering and Applications, 2021, 57(22): 247-256.
[12]	QIN Weiwei, SONG Tainian, LIU Jieyu, WANG Hongwei, LIANG Zhuo. Remote Sensing Military Target Detection Algorithm Based on Lightweight YOLOv3 [J]. Computer Engineering and Applications, 2021, 57(21): 263-269.
[13]	JIANG Runxi, Alifu·Kuerban, GENG Liting. Safety Helmet Detection Algorithm for Lightweight Network [J]. Computer Engineering and Applications, 2021, 57(20): 263-270.
[14]	ZHANG Bowen, ZHOU Yang, YIN Haibing. Research Progress of Virtual Viewpoint Synthesis Technology for 3D Video [J]. Computer Engineering and Applications, 2021, 57(2): 12-17.
[15]	WANG Bei, CHEN Jinguang, WANG Mingming. Improved Target Tracking Algorithm Based on Kernelized Correlation Filter in Complex Scenarios [J]. Computer Engineering and Applications, 2021, 57(2): 198-208.

Semi-automatic Video Target Annotation by Combining Detection and Tracking

融合检测与跟踪的半自动视频目标标注

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics