计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (24): 144-150.DOI: 10.3778/j.issn.1002-8331.1909-0229

• 模式识别与人工智能 • 上一篇    下一篇

结合改进的损失函数与多重范数的人脸识别

张飞翔,余学儒,何卫锋,李琛   

  1. 1.上海交通大学 电子信息与电气工程学院 微纳电子学系,上海 200240
    2.上海集成电路研发中心有限公司 AI部,上海 201203
  • 出版日期:2020-12-15 发布日期:2020-12-15

Face Recognition with Improved Loss Function and Multiple-Norm

ZHANG Feixiang, YU Xueru, HE Weifeng, LI Chen   

  1. 1.School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
    2.Department of Artificial Intelligence, Shanghai Integrated Circuit Research and Development Center Co., Ltd., Shanghai 201203, China
  • Online:2020-12-15 Published:2020-12-15

摘要:

针对在卷积神经网络中定义损失函数为余弦裕度损失函数(Cosineface)后导致收敛变慢以及在实际使用过程中使用L2范数衡量特征相似度存在缺陷的问题,提出了斜率可变的余弦裕度损失函数(Kcosine)和多重范数计算特征相似度的方法。该方法通过在余弦裕度损失函数的基础上添加余弦斜率因子,使得损失函数类内约束随着余弦值的增大而逐渐增强,显式地缩小类内距离,同时利用L2范数和L∞范数构建人脸特征相似度向量,并通过支撑向量机(SVM)实现分类,修正L2范数空间衡量的不稳定性。在LFW和Agedb的数据库上1∶1验证实验表明,改进的损失函数不仅加快了训练的收敛速度,并且将类内距离减少15%以上,同时通过使用多重范数特征代替L2范数,可以将识别率均值提升0.1%左右,标准差也有所降低。

关键词: 损失函数, L2范数, 多重范数, 人脸识别

Abstract:

To solve the problem of slow convergence after defining loss function as Cosine Margin Loss Function(Cosineface) in convolutional neural network and the defect of measuring feature similarity with L2 norm in practical use, a variable slope Cosine Margin Loss Function (Kcosine) and a multiple-norm method calculating feature similarity are proposed. By adding the cosine slope factor on the basis of cosine margin loss function, intra-class constraints increase with the increase of cosine value, and the intra-class distance is reduced explicitly. At the same time, the face feature similarity vector is constructed by using L2 norm and L∞ norm, and the classification is realized by Support Vector Machine(SVM) so that modifying instability of L2 norm space measurement. The 1∶1 validation experiments on LFW and Agedb databases show that the improved loss function not only accelerates the convergence speed of training, but also reduces the intra-class distance by more than 15%. At the same time, by using multiple-norm features instead of L2 norm, the average recognition rate can be increased by about 0.1%, and the standard deviation of recognition rate can also be reduced.

Key words: loss function, L2-norm, multiple-norm, face recognition