计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (19): 12-42.DOI: 10.3778/j.issn.1002-8331.2502-0072

• 热点与综述 • 上一篇    下一篇

无人驾驶深度强化学习决策模型性能评测方法综述

顾同成,徐东伟,孙成巨   

  1. 1.浙江工业大学 信息工程学院,杭州 310014
    2.浙江工业大学 网络空间安全研究院,杭州 310014
  • 出版日期:2025-10-01 发布日期:2025-09-30

Review of Performance Evaluation Methods for Deep Reinforcement Learning Decision Models in Autonomous Driving

GU Tongcheng, XU Dongwei, SUN Chengju   

  1. 1.College of Information Engineering, Zhejiang University of Technology, Hangzhou 310014, China
    2.Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou 310014, China
  • Online:2025-10-01 Published:2025-09-30

摘要: 目前,以深度强化学习(deep reinforcement learning,DRL)为主要决策方法的端到端无人驾驶技术在典型交通驾驶任务中的表现取得显著进展。但是,由于DRL“试错”交互的独特学习方式,使其在应用到真实驾驶环境之前,必须经过严格的多维评测过程。因此,性能评测成为DRL无人驾驶决策模型向真实世界迁移的一个关键且不可或缺的步骤。梳理分析当前无人驾驶领域主流的技术实现方法;聚焦DRL方法,综述其在无人驾驶决策中的研究模式与最新成果,探讨其在处理无人驾驶任务时所面临的问题与瓶颈;面向端到端DRL无人驾驶决策模型,从安全性、鲁棒性、舒适性、效率、可靠性五个方面全面综述性能评测方法,分析影响因素并梳理性能评测流程;对比总结目前常用且开源的无人驾驶虚拟仿真平台的特点及适用场景;概述性能评测存在的开放性问题及对未来评测方法的研究展望,为相关研究和模型应用部署提供理论支持和参考依据。

关键词: 智能交通, 无人驾驶, 深度强化学习, 评测方法, 决策性能, 端到端控制

Abstract: Recently, end-to-end autonomous driving technology, using deep reinforcement learning (DRL) as the primary decision-making method, has achieved significant progress in complex dynamic environments. However, due to the unique trial-and-error learning mechanism of DRL, it must undergo a strict multi-dimensional evaluation process before being applied to real-world driving environments. Thus, performance evaluation becomes a critical and indispensable step for transferring DRL-based autonomous driving decision models to the real world. Firstly, this paper reviews the mainstream technical implementation methods in the field of autonomous driving. Next, the focus is placed on DRL methods, summarizing their research paradigms and the latest advances in autonomous driving decision-making, and discussing the issues and bottlenecks they face when addressing autonomous driving tasks. Following this, a comprehensive review of performance evaluation methods for end-to-end DRL autonomous driving decision models is provided, covering aspects such as safety, robustness, comfort, efficiency, and reliability, analyzing influencing factors, and summarizing the performance evaluation process. Subsequently, commonly used and open-source virtual simulation platforms for autonomous driving are compared and summarized regarding their features and applicable scenarios. Finally, open issues in performance evaluation and prospects for future research on evaluation methods are presented, providing theoretical support and reference for related research and model deployment.

Key words: smart transportation, autonomous driving, deep reinforcement learning, evaluation methods, decision performance, end-to-end control