计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (4): 1-24.DOI: 10.3778/j.issn.1002-8331.2407-0034

• 热点与综述 • 上一篇    下一篇

多智能体深度强化学习及可扩展性研究进展

刘延飞,李超,王忠,王杰铃   

  1. 火箭军工程大学 基础部,西安 710025
  • 出版日期:2025-02-15 发布日期:2025-02-14

Research Progress on Multi-Agent Deep Reinforcement Learning and Scalability

LIU Yanfei, LI Chao, WANG Zhong, WANG Jieling   

  1. Department of Basic Courses, Rocket Force University of Engineering, Xi’an 710025, China
  • Online:2025-02-15 Published:2025-02-14

摘要: 多智能体深度强化学习近年来在解决智能体协作、竞争和通信问题上展现出巨大潜力。然而伴随着其在更多领域的应用,可扩展性问题备受关注,是理论研究到大规模工程应用的重要问题。回顾了强化学习理论和深度强化学习的典型算法,介绍了多智能体深度强化学习三类学习范式及其代表算法,并简要整理出当前主流的开源实验平台。详细探讨了多智能体深度强化学习在数量和场景上的可扩展性研究进展,分析了各自面临的核心问题并给出了现有的解决思路。展望了多智能体深度强化学习的应用前景和发展趋势,为推动该领域的进一步研究提供参考和启示。

关键词: 多智能体系统, 强化学习, 深度强化学习, 可扩展性

Abstract: Multi-agent deep reinforcement learning has shown great potential in solving agent collaboration, competition, and communication problems in recent years. However, as its application expands across more domains, scalability has become a focal concern, which is an important problem from theoretical research to large-scale engineering applications. This paper reviews the reinforcement learning theory and typical algorithms of deep reinforcement learning, introduces three learning paradigms of multi-agent deep reinforcement learning and their representative algorithms, and briefly summarizes the current mainstream open-source experimental platforms. Then, this paper delves into the research progress on the scalability of the number and scenarios in multi-agent deep reinforcement learning, analyzes the main problems faced by each method and providing existing solutions. Finally, the application prospect and development trend of multi-agent deep reinforcement learning are prospected, providing references and inspiration to further advance research in this field.

Key words: multi-agent system, reinforcement learning, deep reinforcement learning, scalability