计算机工程与应用 ›› 2015, Vol. 51 ›› Issue (17): 8-13.

• 博士论坛 • 上一篇    下一篇

基于多智能体增强学习的公交驻站控制方法

陈春晓1,陈治亚1,2,陈维亚1   

  1. 1.中南大学 交通运输工程学院,长沙 410075
    2.西安电子科技大学,西安 710071
  • 出版日期:2015-09-01 发布日期:2015-09-14

Bus holding control method in public transit systems with multi-agent reinforcement learning

CHEN Chunxiao1, CHEN Zhiya1,2, CHEN Weiya1   

  1. 1.School of Traffic and Transportation Engineering, Central South University, Changsha  410075, China
    2.Xidian University, Xi’an 710071, China
  • Online:2015-09-01 Published:2015-09-14

摘要: 车辆驻站是减少串车现象和改善公交服务可靠性的常用且有效控制策略,其执行过程需要在随机交互的系统环境中进行动态决策。考虑实时公交运营信息的可获得性,研究智能体完全合作环境下公交车辆驻站增强学习控制问题,建立基于多智能体系统的单线公交控制概念模型,描述学习框架下包括智能体状态、动作集、收益函数、协调机制等主要元素,采用hysteretic Q-learning算法求解问题。仿真实验结果表明该方法能有效防止串车现象并保持单线公交服务系统车头时距的均衡性。

关键词: 驻站, 多智能体增强学习, 多智能体系统, 控制策略

Abstract: Vehicle holding is a commonly used strategy among a variety of control strategies in transit operation for improving transit service reliability, whose implementation needs dynamic decision-making in an interactive and stochastic system environment. This paper introduces a novel use of a reinforcement learning framework to obtain vehicle holding autonomous control strategy in cooperative multi-agent system. Transit operation control model is developed based on multi-agent system. In the multi-agent reinforcement learning framework, each bus is modeled as an  independent agent with learning abilities, for which the state, actions and reward are defined and a coordination mechanism for multiple bus agents is designed to obtain a joint holding actions. The hysteretic Q-learning algorithm is used to solve this holding problem. From the simulation experiments, the results illustrate that the proposed approach is able to prevent buses from bunching and regulate bus headway.

Key words: bus holding, multi-agent reinforcement learning, multi-agent system, control strategy