计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (7): 245-254.DOI: 10.3778/j.issn.1002-8331.2311-0323

• 模式识别与人工智能 • 上一篇    下一篇

深度度量注意力混合模型表情识别方法

姚丽莎   

  1. 安徽新华学院 大数据与人工智能学院,合肥 230088
  • 出版日期:2025-04-01 发布日期:2025-04-01

Facial Expression Recognition Method Based on Mixed Model of Measure Depth Attention

YAO Lisha   

  1. School of Big Data and Artificial Intelligence, Anhui Xinhua University, Hefei 230088, China
  • Online:2025-04-01 Published:2025-04-01

摘要: 深度学习网络在人脸表情识别中已广泛采用,但因表情图像复杂多变,受光照、个体差异等各个因素的影响,现有方法的识别效果有待提高。为了提高深度学习网络的表达能力,在深度学习网络中,结合面部关键区域的位置特征,提出融合位置信息的深层注意力反馈机制卷积神经网络模型。同时,由于表情特征的类间差异小,为了提高分类器的分类学习能力,引入度量学习方法增强特征的判别性,使同类之间的距离减小,异类之间的距离加大。通过度量学习将面部表情图像的特征映射到具有表情判别性的新的特征空间中,由此判断各表情样本的表情类别。对原图进行人脸检测,确定人脸裁剪出人脸关键区域,去除头发、背景等因素的干扰;通过深层注意力反馈机制的CNN模型对人脸关键区域进行特征学习,学习获得面部表情深度特征,之后引入判别性度量学习方法,通过度量矩阵将特征向量映射为新的学习后的特征向量;将提取的样本表情特征送入全连接层并通过Softmax分类器识别划分到预先定义好的7种基本表情。在CK+和RAF-DB数据库的实验表明,该方法取得了98.69%和87.68%的平均识别率,提高了分类器的分类学习能力。

关键词: 深度注意力, 表情识别, 卷积神经网络, 度量学习

Abstract: Deep learning network has been widely used in facial expression recognition, but the recognition effect of existing methods needs to be improved due to the complexity and variety of expression images and the influence of various factors such as illumination and individual differences. In order to improve the expressive ability of deep learning network, a deep attention feedback mechanism convolutional neural network (CNN) model is proposed in the deep learning network, which combines the location features of key facial areas. At the same time, due to the small differences between different classes of expression features, in order to improve the classification learning ability of the classifier, metric learning method is introduced to enhance the discrimination of features, which reduces the distance between the same kind and increases the distance between different kinds. Through metric learning, the features of facial expression images are mapped into a new feature space with discriminative expression, so as to judge the expression categories of each expression sample. Firstly, face detection is carried out on the original image, and the key areas of human face are determined, and the interference of hair, background and other factors is removed. Secondly, the CNN model with deep attention feedback mechanism is used to learn the features of key facial areas, and the facial expression depth features are obtained by learning. Then, the discriminant metric learning method is introduced, and the feature vectors are mapped into new learned feature vectors by metric matrix. Finally, the extracted sample expression features are sent to the full connection layer, and identified by Softmax classifier and divided into seven predefined basic expressions. Experiments on CK+ and RAF-DB databases show that the average recognition rate of this method is 98.69% and 87.68%. The classification learning ability of the classifier is improved.

Key words: depth attention, expression recognition, convolutional neural network, metric learning