计算机工程与应用 ›› 2021, Vol. 57 ›› Issue (24): 61-73.DOI: 10.3778/j.issn.1002-8331.2109-0196

• 热点与综述 • 上一篇    下一篇

唇语识别的深度学习方法综述

马金林,朱艳彬,马自萍,巩元文,陈德光,刘宇灏   

  1. 1.北方民族大学 计算机科学与工程学院,银川 750021
    2.图像图形智能信息处理国家民委重点实验室,银川 750021
    3.北方民族大学 数学与信息科学学院,银川 750021
  • 出版日期:2021-12-15 发布日期:2021-12-13

Review of Deep Learning Methods for Lip Recognition

MA Jinlin, ZHU Yanbin, MA Ziping, GONG Yuanwen, CHEN Deguang, LIU Yuhao   

  1. 1.School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China
    2.Key Laboratory for Intelligent Processing of Computer Images and Graphics of the State Ethnic Affairs Commission of PRC, Yinchuan 750021, China
    3.School of Mathematics and Information Science, North Minzu University, Yinchuan 750021, China
  • Online:2021-12-15 Published:2021-12-13

摘要:

随着深度学习的不断发展,唇语识别领域的研究取得了重大进展,涌现了许多唇语识别的深度学习算法。依据识别对象的连续性,将唇语识别分为孤立唇语识别和连续唇语识别,并对各识别任务的深度学习方法进行了详细和深入的分析总结。从孤立唇语识别的深度学习方法和连续唇语识别的深度方法两个方面介绍了主流唇语识别方法,并对各方法的优缺点和性能进行比较;对不同数据集下代表性方法的特点和性能进行比较,对两类方法的优缺点和适用范围进行阐述;讨论了唇语识别方法存在的问题和挑战,并对唇语识别方法的研究趋势进行了展望。

关键词: 唇语识别, 深度学习, 卷积神经网络, 注意力机制

Abstract:

With the continuous development of deep learning, significant progress has been made in the field of lip recognition, and many deep learning algorithms for lip recognition have emerged. According to the continuity of the recognition object, this paper divides lip recognition into isolated lip recognition and continuous lip recognition, analyzes and summarizes the deep learning methods of each recognition task in detail and in-depth. Firstly, it introduces the mainstream lip recognition methods from the two aspects of the deep learning method of isolated lip recognition and the deep method of continuous lip recognition, and compares the advantages, disadvantages and performance of each method. Secondly, the characteristics and performance of representative methods for different data sets are compared, and the advantages, disadvantages and application scope of the two methods are expounded. Finally, the problems and challenges of lip recognition methods are discussed, and the research trend of lip recognition methods is prospected.

Key words: lip recognition, deep learning, convolutional neural networks, attentional mechanisms