计算机工程与应用 ›› 2023, Vol. 59 ›› Issue (1): 228-235.DOI: 10.3778/j.issn.1002-8331.2107-0516

• 图形图像处理 • 上一篇    下一篇

面向样本扩充的新型风格迁移网络研究

田敏,刘名果,陈立家,韩宗桓,兰天翔,梁倩   

  1. 河南大学 物理与电子学院,河南 开封 475000
  • 出版日期:2023-01-01 发布日期:2023-01-01

Research on New Style Transfer Network for Sample Expansion

TIAN Min, LIU Mingguo, CHEN Lijia, HAN Zonghuan, LAN Tianxiang, LIANG Qian   

  1. School of Physics and Electronics, Henan University, Kaifeng, Henan 475000, China
  • Online:2023-01-01 Published:2023-01-01

摘要: 全监督语义分割网络在训练时需要耗费大量的人力与时间成本来标注样本。所以减少人工标注样本的时间,同时提升语义分割效果,对于深度学习网络的快速部署和应用推广具有重要意义。提出一种基于改进图像风格迁移网络(CycleGAN-AD)的样本扩充方法。以CycleGAN为基础,在生成器中引入注意力机制并将深度残差网络改为密集连接卷积网络。利用计算机批量产生自带标签的模拟样本,使用CycleGAN-AD网络将模拟样本风格迁移成为真实样本风格(标签不变),并用于扩充训练样本。对石墨电极的钢印字符进行语义分割的实验结果表明,采用CycleGAN-AD网络进行样本扩充后,其分割效果得到显著提升,MIoU值最高升至0.826?0。可见,提出的样本扩充方法有希望在显著减少人工标注工作量的同时,获得高质量的训练样本。

关键词: 语义分割, 样本扩充, CycleGAN, 风格迁移

Abstract: The training of fully supervised semantic segmentation network needs a lot of manpower and time cost to label samples. Therefore, reducing the time of manually labeling samples and improving the effect of semantic segmentation are of great significance for the rapid deployment and application promotion of deep learning network. A sample expansion method based on improved image style transfer network(CycleGAN-AD) is proposed. Based on CycleGAN, the attention mechanism is introduced into the generator, and the depth residual network is changed into a densely connected convolution network. The computer is used to batch generate the simulation samples with their own labels, and the CycleGAN-AD network is used to transfer the simulation sample style to the real sample style(the label remains unchanged), which is used to expand the training samples. The experimental results of semantic segmentation of steel seal characters on graphite electrode show that the segmentation effect is significantly improved after sample expansion with CycleGAN-AD network, and the MIoU value is up to 0.826?0. It can be seen that the sample expansion method proposed is hopeful to obtain high-quality training samples while significantly reducing the workload of manual annotation.

Key words: semantic segmentation, sample expansion, CycleGAN, style transfer