计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (12): 18-33.DOI: 10.3778/j.issn.1002-8331.2309-0497

• 热点与综述 • 上一篇    下一篇

基于深度学习的视线估计方法综述

温铭淇,任路乾,陈镇钦,杨卓,战荫伟   

  1. 广东工业大学 计算机学院,广州 510006
  • 出版日期:2024-06-15 发布日期:2024-06-14

Survey of Deep Learning Based Approaches for Gaze Estimation

WEN Mingqi, REN Luqian, CHEN Zhenqin, YANG Zhuo, ZHAN Yinwei   

  1. School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
  • Online:2024-06-15 Published:2024-06-14

摘要: 视线估计是一种预测人眼注视位置或注视方向的技术,在人机交互和计算机视觉的应用中发挥重要作用。近几年,深度学习的飞速发展改变了许多计算机视觉任务,利用深度学习进行基于外观的视线估计已成为关注热点。围绕深度学习模型的训练流程,从视线数据预处理、视线特征提取、视线学习策略、视线估计模型结构四个方面对近年基于深度学习的视线估计方法进行了综述和分析;然后介绍视线估计领域主流公开数据集,并对常用数据集分别进行2D和3D视线估计方法的对比分析。最后,探讨了当前视线估计领域的研究难点与挑战,并对未来的发展趋势进行总结与展望。

关键词: 计算机视觉, 深度学习, 视线估计, 眼动跟踪, 人机交互

Abstract: Gaze estimation is a technique for predicting the gaze position or gaze direction of the human eye and plays an important role in human-computer interaction and computer vision applications. The recent development of deep learning has revolutionized many computer vision tasks, and using deep learning for appearance-based gaze estimation has also become a hot topic. Focusing on the training process of the deep learning model, this paper analyzes state-of-the-art gaze estimation methods from four perspectives: gaze data preprocessing, gaze feature extraction, gaze learning strategies, and deep gaze model structures. In addition, the mainstream public datasets are summarized, and the performance evaluation and analysis of 2D and 3D gaze estimation methods are carried out on several popular datasets. Finally, the challenges faced by the existing gaze estimation methods are discussed, and the future development directions are prospected.

Key words: computer vision, deep learning, gaze estimation, eye tracking, human-computer interaction