计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (2): 254-263.DOI: 10.3778/j.issn.1002-8331.2208-0344

• 图形图像处理 • 上一篇    下一篇

权重推断与标签平滑的轻量级人脸表情识别

刘劲,罗晓曙,徐照兴   

  1. 1.广西师范大学 电子工程学院,广西 桂林 541004
    2.江西服装学院 大数据学院,南昌 330000
  • 出版日期:2024-01-15 发布日期:2024-01-15

Weight Inference and Label Smoothing for Lightweight Facial Expression Recognition

LIU Jin, LUO Xiaoshu, XU Zhaoxing   

  1. 1.School of Electronic Engineering, Guangxi Normal University, Guilin, Guangxi 541004, China
    2.School of Big Data, Jiangxi Institute of Fashion Technology, Nanchang 330000, China
  • Online:2024-01-15 Published:2024-01-15

摘要: 针对轻量级网络在复杂环境下对面部表情的特征提取不够充分、模型参数存在冗余以及单标签数据集无法有效描述复杂情感倾向所带来的歧义表情等问题,提出了一种结合改进ShuffleNet与标签平滑学习的人脸表情识别方法。通过对原始网络的分析与剪裁,得到了改进后更紧凑的K5_Light_ShuffleNet,不仅优化了网络参数,还提高了模型的表征能力。为了增强模型对人脸表情图像局部细节特征的提取能力,抑制非表情特征,在模型中嵌入了设计的轻量化通道空间关键权重推断模块。通过标签平滑学习方法,在不引入额外信息的前提下,利用软标签分布监督网络的学习,以减少由于歧义表情对识别性能所带来的不利影响。实验结果表明,在公开的RAF-DB、AffectNet-7和AffectNet-8数据集上分别达到了86.91%、61.80%和58.75%的表情识别准确率,相较于目前其他人脸表情识别方法,其识别率有一定提高,同时模型参数量和计算量均保持在较低水平,利于其在实际中的应用。

关键词: 人脸表情识别, 关键权重推断, 轻量化, 标签平滑学习, 歧义表情

Abstract: Aiming at the problems of insufficient feature extraction of facial expressions by lightweight networks in complex environments, redundant model parameters, and the inability of single-label datasets to effectively describe the ambiguous expressions caused by complex emotional tendencies, a facial expression recognition method combining improved ShuffleNet and label smoothing learning is proposed. Firstly, by analyzing and tailoring the original network, the improved and more compact K5_Light_ShuffleNet is obtained, which not only optimizes the network parameters, but also improves the representation ability of the model. Secondly, in order to enhance the ability of the model to extract local detail features of facial expression images and suppress non-expression features, a designed lightweight channel-space key weight inference module is embedded in the model. Finally, through the label smoothing learning method, the learning of the network is supervised by the soft label distribution without introducing additional information, so as to reduce the adverse effect on the recognition performance due to ambiguous expressions. The experimental results show that the recognition accuracy rates of 86.91%, 61.80% and 58.75% are achieved on the RAF-DB, AffectNet-7 and AffectNet-8 datasets, respectively. Compared with other current facial expression recognition methods, the recognition rate has been improved to a certain extent. At the same time, the amount of model parameters and FLOPs are kept at a low level, which is beneficial to its practical application.

Key words: facial expression recognition, key weight inference, lightweight, label smoothing learning, ambiguous expression