Computer Engineering and Applications ›› 2023, Vol. 59 ›› Issue (1): 207-212.DOI: 10.3778/j.issn.1002-8331.2106-0241

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Research on Offline Data Augmentation Method Jointed with Cannikin’s Law

DENG Xue, ZHAO Hao, ZHANG Jing, MEI Boping, ZHANG Hua   

  1. 1.School of Information Engineering, Southwest University of Science and Technology, Mianyang, Sichuan 621010, China
    2.School of Information Science and Technology, University of Science and Technology of China, Hefei 230026, China
  • Online:2023-01-01 Published:2023-01-01

结合Cannikin’s Law的离线数据增广方法研究

邓雪,赵皓,张静,梅菠萍,张华   

  1. 1.西南科技大学 信息工程学院,四川 绵阳 621010
    2.中国科学技术大学 信息科学技术学院,合肥 230026

Abstract: Data augmentation is one of the effective methods to improve the performance of a deep learning model. Aiming at the imbalance of detection performance in multi-class object detection tasks, an offline data augmentation method for “shortboard categories”(categories whose detection performance is far lower than the average detection performance of the model) is proposed. Inspired by Cannikin’s Law, a scene diversity augmentation method based on copy-paste mechanism is adopted. The instance area of the “shortboard category” in the training dataset is randomly collected, and the augmented target samples in the training set are selected by a similarity measurement mechanism for random pasting. In order to reduce the occlusion problem caused by random pasting, an augmented method based on the cut-replace mechanism is used to improve the occlusion representation ability of the model. By intercepting the sample’s own region, the most significant region of feature expression is self-occluded. Experimental results show that the mean average precision(mAP) of FCOS object detection framework on PASCAL VOC data is improved from 79.10% to 83.90%, and the shortboard category is more significant, which is improved by 20.8?percentage points. In MS COCO data, the average detection accuracy is improved by 0.9?percentage points.

Key words: data augmentation, Cannikin’s Law, similarity measurement mechanism, self-occlusion, object detection

摘要: 数据增广是提升深度学习模型性能的有效方法之一。针对多类别目标检测任务中检测性能不平衡问题,提出一种针对“短板类别”(检测性能远低于模型平均检测性能的类别)的离线数据增广方法。受Cannikin’s Law的启发,采用基于复制粘贴(copy-paste)机制的场景多样性增广方法。随机采集训练集中“短板类别”实例区域,通过相似性度量机制选取训练集中增广目标样本进行随机粘贴。为了降低随机粘贴导致的遮挡问题,采用基于自遮挡(cut-replace)机制的增广方法提升模型遮挡表达能力。通过截取样本自身区域,对特征表达最显著区域进行遮挡。实验表明,FCOS目标检测框架在PASCAL VOC数据上的平均检测精度(mean average precision,mAP)从79.10%提升到83.90%,其中短板类别更为显著,提升了20.8个百分点。在MS-COCO数据上平均检测精度提升了0.9个百分点。

关键词: 数据增广, Cannikin’s Law, 相似性度量机制, 自遮挡, 目标检测