融合关键点属性与注意力表征的人脸表情识别

doi:10.3778/j.issn.1002-8331.2107-0542

摘要/Abstract

摘要： 人脸的表情变化非常细微，通常表现在图像中某些局部点区域的改变，现有的人脸表情识别方法难以捕捉到表情的细微变化，对非表情区域干扰不具有鲁棒性。为了获得描述人脸表情变化的高效特征表示，提出了一种融合关键点属性与注意力表征的人脸表情识别方法。通过添加通道注意力和空间注意力的神经网络提取人脸图像中的关键点信息，实现不同维度和位置的权重分配，有效避免非表情区域的干扰，捕获图像中局部关键点的特征表征。引入Transformer模块学习不同关键点之间的相关联系，引导网络构建对表情类型更具分辨力的特征表示，从而实现精准识别。通过在CK+、JAFFE、FER2013三种公开数据集上进行实验的结果表明：提出算法的识别准确率分别达到了99.22%、96.57%、73.37%。

关键词: 人脸表情识别, 关键点属性表征, 注意力机制, 卷积神经网络, 学习特征图

Abstract: The change of facial expression is very subtle and usually manifested in the change of some local points and regions in the image. The existing facial expression recognition methods are difficult to capture the subtle changes of facial expression, which does not have robust to interference from non-expressive regions. To obtain an efficient feature representation to describe the changes of facial expression, a facial expression recognition method that integrating key point attributes and attention representation is proposed. Firstly, the key points in face image are extracted by the module with channel attention and spatial attention, which realizes the weight distribution of different dimensions and positions, effectively avoids the interference of non-expressive regions, and obtain the feature representation of the local key points in the image. Then, transformer module is introduced to learn the correlation between different key points and guide the network to build a more distinguishing feature representation, so as to achieve accurate recognition. Finally, the experimental results on CK+, JAFFE and FER2013 public datasets show that the recognition accuracy of the proposed algorithm is up to 99.22%, 96.57% and 73.37% respectively.

Key words: facial expression recognition, key point attributes representation, attention mechanism, convolutional neural network, learning feature map

高红霞, 郜伟. 融合关键点属性与注意力表征的人脸表情识别[J]. 计算机工程与应用, 2023, 59(3): 118-126.

GAO Hongxia, GAO Wei. Facial Expression Recognition Integrating Key Point Attributes and Attention Representation[J]. Computer Engineering and Applications, 2023, 59(3): 118-126.

参考文献

[1] 李珊，邓伟洪.深度人脸表情识别研究进展[J].中国图象图形学报，2020，25（11）：2306-2320.
LI S，DENG W H.Deep facial expression recognition：a survey[J].Journal of Image and Graphics，2020，25（11）：2306-2320.
[2] EKMAN P，FRIESEN W V.“Nonverbal Behavior”[J].Journal of Communication and Social Interaction，1977：37-46.
[3] 石聪聪，田媚.类别均衡与局部中值损失联合监督的自然场景中人脸表情识别[J].计算机辅助设计与图形学学报，2020，32（9）：1484-1491.
SHI C C，TIAN M.Class-balanced and local median loss jointly supervised for wild facial expression recognition[J].Journal of Computer-Aided Design & Computer Graphics，2020，32（9）：1484-1491.
[4] 徐峰，张军平.人脸微表情识别综述[J].自动化学报，2017，43（3）：333-348.
XU F，ZHANG J P.Facial microexpression recognition：a survey[J].Acta Automatica Sinica，2017，43（3）：333-348.
[5] 张延良，卢冰，蒋涵笑，等.微表情类别与区域间关联度的分析方法研究[J].计算机工程与应用，2020，56（19）：146-151.
ZHANG Y L，LU B，JIANG H X，et.al.Study on investigating correlation between categories of micro-expression and corresponded regions[J].Computer Engineering and Applications，2020，56（19）：146-151.
[6] 张哲源，张灵，陈云华.结合分块LBP与投影字典对学习的表情识别[J].计算机工程与应用，2019，55（12）：149-154.
ZHANG Z Y，ZHANG L，CHEN Y H.Facial expression recognition combined with block LBP and projective dictionary pair learning[J].Computer Engineering and Applications，2019，55（12）：149-154.
[7] REVINA I M，EMMANUEL W S.Face expression recognition using ldn and dominant gradient local ternary pattern descriptors[J].Journal of King Saud University-Computer and Information Sciences，2018，33（4）：392-398.
[8] SHI Y，LV Z，BI N，et al.An improved SIFT algorithm for robust emotion recognition under various face poses and illuminations[J].Neural Computing and Applications，2020，32（13）：9267-9281.
[9] MENG H，YUAN F，WU Y，et al.Facial expression recognition algorithm based on fusion of transformed multilevel features and improved weighted voting SVM[J].Mathematical Problems in Engineering，2021（9）：1-17.
[10] 李勇，林小竹，蒋梦莹.基于跨连接LeNet-5网络的面部表情识别[J].自动化学报，2018，44（1）：176-182.
LI Y，LIN X Z，JIANG M Y.Facial expression recognition with cross-connect LeNet-5 network[J].Acta Automatica Sinica，2018，44（1）：176-182.
[11] 姜月武，张玉金，施建新.结合关键点与权重分配残差网络的表情识别[J].计算机工程与应用，2022，58（17）：181-188.
JIANG Y W，ZHANG J Y，SHI J X.Expression recognition method combining key points and residual network of weight allocation[J].Computer Engineering and Applications，2022，58（17）：181-188.
[12] LU G，ZHU H，HAO Q，et al.Facial expression recognition based on deep residual network[J].Journal of Data Acquisition and Processing，2019.
[13] SHI C，TAN C，WANG L.A facial expression recognition method based on a multibranch cross-connection convolutional neural network[J].IEEE Access，2021，9：39255-39274.
[14] 崔子越，皮家甜，陈勇，等.结合改进VGGNet和Focal Loss的人脸表情识别[J].计算机工程与应用，2021，57（19）：171-178.
CUI Z Y，PI J T，CHEN Y，et al.Facial expression recognition combined with improved VGGNet and Focal Loss[J].Computer Engineering and Applications，2021，57（19）：171-178.
[15] 梁华刚，王亚茹，张志伟.基于Res-Bi-LSTM的人脸表情识别[J].计算机工程与应用，2020，56（13）：204-209.
LIANG H G，WANG Y R，ZHANG Z W.Facial expression recognition based on Res-Bi-LSTM[J].Computer Engineering and Applications，2020，56（13）：204-209.
[16] LI K，JIN Y，AKRAM M W，et al.Facial expression recognition with convolutional neural networks via a new face cropping and rotation strategy[J].The Visual Computer，2020，36（2）：391-404.
[17] 钱勇生，邵洁，季欣欣，等.基于改进卷积神经网络的多视角人脸表情识别[J].计算机工程与应用，2018，54（24）：12-19.
QIAN Y S，SHAO J，JI X X，et al.Multi-view facial expression recognition based on improved convolutional neural network[J].Computer Engineering and Applications，2018，54（24）：12-19.
[18] 亢洁，李思禹.基于注意力机制的人脸表情识别迁移学习方法[J].计算机工程与设计，2021，42（3）：797-804.
KANG J，LI S Y.Transfer learning method for facial expression based on attention mechanism[J].Computer Engineering and Design，2021，42（3）：797-804.
[19] 程换新，成凯，蒋泽芹.基于注意力机制的CNN人脸表情识别[J].电子测量技术，2021，44（10）：128-132.
CHENG H X，CHENG K，JIANG Z Q.CNN facial expression recognition based on attention mechanism[J].Electronic Measurement Technology，2021，44（10）：128-132.
[20] 乔伟涛，黄海燕，王珊.基于Transformer编码器的语义相似度算法研究[J].计算机工程与应用，2021，57（14）：158-163.
QIAO W T，HUANG H Y，WANG S.Semantic similarity calculation based on transformer encoder[J].Computer Engineering and Applications，2021，57（14）：158-163.
[21] DOSOVITSKIY A，BEYER L，KOLESNIKOV A，et al.An image is worth 16×16 words：transformers for image recognition at scale[J].arXiv：2010.11929，2020.
[22] 高涛，邵倩，张亚南，等.基于深度残差网络的人脸表情识别研究[J].电子设计工程，2020，28（23）：101-104.
GAO T，SHAO Q，ZHANG Y N，et al.Research on facial expression recognition based on deep residual network[J].Electronic Design Engineering，2020，28（23）：101-104.
[23] LIU C，HIROTA K，MA J，et al.Facial expression recognition using hybrid features of pixel and geometry[J].IEEE Access，2021，9：18876-18889.