计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (24): 68-85.DOI: 10.3778/j.issn.1002-8331.2503-0238

• 热点与综述 • 上一篇    下一篇

视觉手语翻译技术研究综述

吕佳威,李绍彬+,朱若琳   

  1. 中国传媒大学 信息与通信学院,北京 100024
  • 出版日期:2025-12-15 发布日期:2025-12-15

Review of Research on Visual Sign Language Translation Technology

LYU Jiawei, LI Shaobin+, ZHU Ruolin   

  1. School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
  • Online:2025-12-15 Published:2025-12-15

摘要: 视觉手语翻译技术是连接听障人群与健听人群的重要桥梁,近年来在计算机视觉与深度学习技术的驱动下取得显著进展。该技术旨在将手语视频动作自动转化为自然语言文本,从而实现两个群体的无障碍沟通。为便于研究者全面系统地了解视觉手语翻译任务,分别从三个方面展开综述研究:梳理并分类视觉手语翻译相关研究成果,并探讨其方法特点与技术演进;阐述手语数据的采集设备、多语言公开手语数据集以及常用评价指标;从当前手语技术的研究现状与应用实践出发,探讨该领域面临的挑战,并提出相应的展望和建议。

关键词: 视频理解, 手语翻译, 计算机视觉, 深度学习

Abstract: Visual sign language translation (VSLT) serves as a crucial bridge between the deaf and hearing communities. With the rapid development of computer vision and deep learning, VSLT has made significant progress in recent years. Its aim is to automatically convert sign language video sequences into natural language text, facilitating accessibility and inclusivity. To provide a comprehensive and systematic review of VSLT, this study examines the field from three key perspectives. It categorizes and analyzes VSLT researches, discussing the methodological characteristics and technological evolution. It provides the description of sign language data acquisition equipment, publicly available multi-language sign language datasets, and commonly used evaluation metrics. It examines the current state of research and practical applications, identifies existing challenges, and proposes relevant outlooks and feasible measures for researchers.

Key words: video understanding, sign language translation, computer vision, deep learning