多尺度非监督特征学习的人脸识别

计算机工程与应用 ›› 2016, Vol. 52 ›› Issue (14): 136-141.

多尺度非监督特征学习的人脸识别

尹晓燕1，冯志勇1，徐超2

1.天津大学计算机科学与技术学院，知识科学与工程研究所，天津 300072
2.天津大学软件学院，天津 300072

出版日期:2016-07-15 发布日期:2016-07-18

Multi-scale unsupervised feature learning for face recognition

YIN Xiaoyan1, FENG Zhiyong1, XU Chao2

1.School of Computer Science and Technology, Tianjin University, Institute of Knowledge Science and Engineering, Tianjin 300072, China
2.School of Computer Software, Tianjin University, Tianjin 300072, China

Online:2016-07-15 Published:2016-07-18

摘要/Abstract

摘要： 为了充分利用人脸图像的潜在信息，提出一种通过设置不同尺寸的卷积核来得到图像多尺度特征的方法，多尺度卷积自动编码器（Multi-Scale Convolutional Auto-Encoder，MSCAE）。该结构所提取的不同尺度特征反映人脸的本质信息，可以更好地还原人脸图像。这种特征提取框架是一个卷积和采样交替的层级结构，使得特征对旋转、平移、比例缩放等具有高度不变性。MSCAE以encoder-decoder模式训练得到特征提取器，用它提取特征，并融合形成用于分类的特征向量。BP神经网络在ORL和Yale人脸库上的分类结果表明，多尺度特征在识别率和性能上均优于单尺度特征。此外，MSCAE特征与HOG（Histograms of Oriented Gradients）的融合特征取得了比单一特征更高的识别率。

关键词: 非监督特征学习, 多尺度, 卷积自动编码器, 深度学习

Abstract: In order to fully utilize latent information of human face, a method called Multi-Scale Convolutional Auto-
Encoder（MSCAE） is proposed. MSCAE extracts image’s multi-scale features using different sizes of convolution kernels. Since the new features reflect natural facial contents, human face can be restored better. The MSCAE applies a hierarchy of alternating filtering and sub sampling, and it makes features invariant to deformations including rotation, translation, and scale. The form of encoder-decoder is introduced to train the MSCAE so as to obtain the feature extractor and vectors combining multi-scale features for further classification. Experiments are conducted with Neural Network（NN） on ORL and Yale face datasets, and the experimental results suggest that multi-scale features are superior to single-scale ones on recognition rate and efficiency. Furthermore, fusion features of MSCAE and Histograms of Oriented Gradients（HOG） can get higher recognition rate than either of them.

Key words: unsupervised feature learning, multi-scale, convolutional auto-encoder, deep learning

尹晓燕1，冯志勇1，徐超2. 多尺度非监督特征学习的人脸识别[J]. 计算机工程与应用, 2016, 52(14): 136-141.

YIN Xiaoyan1, FENG Zhiyong1, XU Chao2. Multi-scale unsupervised feature learning for face recognition[J]. Computer Engineering and Applications, 2016, 52(14): 136-141.

[1]	黄冬宜，杨兵，吴子豪，匡佳一，颜泽明. 用于全市蜂窝流量预测的时空全连接卷积网络[J]. 计算机工程与应用, 2021, 57(9): 168-175.
[2]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[3]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[4]	张朕通，单玉刚，袁杰. 联合多尺度和注意力机制的遥感影像检测[J]. 计算机工程与应用, 2021, 57(9): 212-216.
[5]	武文杰，宋文爱，高雪梅，杨吉江，王青，黄丽萍，雷毅. 基于X线的成人OSA计算机辅助诊断综述[J]. 计算机工程与应用, 2021, 57(9): 1-8.
[6]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[7]	李晓筱，胡晓光，王梓强，杜卓群. 基于深度学习的实例分割研究进展[J]. 计算机工程与应用, 2021, 57(9): 60-67.
[8]	李明山，韩清鹏，张天宇，王道累. 改进SSD的安全帽检测方法[J]. 计算机工程与应用, 2021, 57(8): 192-197.
[9]	曾春艳，严康，王志锋，余琰，纪纯妹. 深度学习模型可解释性研究综述[J]. 计算机工程与应用, 2021, 57(8): 1-9.
[10]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[11]	蒋斌，钟瑞，张秋闻，张焕龙. 采用深度学习方法的非正面表情识别综述[J]. 计算机工程与应用, 2021, 57(8): 48-61.
[12]	王兵，乐红霞，李文璟，张孟涵. 改进YOLO轻量化网络的口罩检测算法[J]. 计算机工程与应用, 2021, 57(8): 62-69.
[13]	赵圆丽，梁志剑. 基于异核卷积双注意机制的立场检测研究[J]. 计算机工程与应用, 2021, 57(8): 119-125.
[14]	李健，孙大松，张备伟. 结合双编码器与对抗训练的图像修复[J]. 计算机工程与应用, 2021, 57(7): 192-197.
[15]	李现国，冯欣欣，李建雄. 多尺度残差网络的单幅图像超分辨率重建[J]. 计算机工程与应用, 2021, 57(7): 215-221.