计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (5): 240-249.DOI: 10.3778/j.issn.1002-8331.2210-0234

• 图形图像处理 • 上一篇    下一篇

基于组残差块生成对抗网络的面部表情生成

林本旺,赵光哲,王雪平,李昊   

  1. 北京建筑大学 电气与信息工程学院,北京 102616
  • 出版日期:2024-03-01 发布日期:2024-03-01

Facial Expression Generation Based on Group Residual Block Generative Adversarial Netxwork

LIN Benwang, ZHAO Guangzhe, WANG Xueping, LI Hao   

  1. School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 102616, China
  • Online:2024-03-01 Published:2024-03-01

摘要: 面部表情生成是通过某种表情计算方法生成带有表情的人脸图像,在人脸编辑、影视制作和数据扩增等方面应用广泛。随着生成对抗网络的出现,面部表情生成取得了显著的进步,但是生成的面部表情图像会出现重叠、模糊等现象,缺乏真实感。为了解决上述问题,提出了一种带有混合注意力机制组残差块的生成对抗网络(group residuals with attention mechanism-generative adversarial network,GRA-GAN)用于生成高质量的面部表情图像。在生成网络进行下采样前和上采样后,分别嵌入混合注意力机制来自适应地学习关键区域特征,增强对图像关键区域的学习。将分组的思想融入到残差网络中,提出了带有混合注意力机制的组残差块来实现更好的生成效果。在公开数据集RaFD进行了实验验证。实验结果表明,GRA-GAN模型在定性评估和定量分析指标上均优于相关方法。

关键词: 生成对抗网络, 表情生成, 注意力机制, 组残差块

Abstract: Facial expression generation is the generation of facial images with expressions through a certain expression calculation method, which is widely used in face editing, film and television production, and data augmentation. With the advent of generative adversarial network (GAN), facial expression generation has made significant progress, but problems such as overlapping, blurring, and lack of realism still occur in facial expression generation images. In order to address the above issues, group residuals with attention mechanism generative adversarial network (GRA-GAN) is proposed to generate high-quality facial expressions. Firstly, an adaptive mixed attention mechanism (MAT) is embedded in the generative network before downsampling and after upsampling to adaptively learn the key region features and enhance the learning of key regions of the image. Secondly, the idea of grouping is integrated into the residual network, and the group residuals block with attention mechanism (GRA) module is proposed to achieve better generation effect. Finally, the experimental verification is carried out on the public dataset RaFD. The experimental results show that the proposed GRA-GAN outperforms the related methods in both qualitative and quantitative analysis.

Key words: generative adversarial network (GAN), expression generation, attention mechanism, group residual block