Bus holding control method in public transit systems with multi-agent reinforcement learning

Abstract

Abstract: Vehicle holding is a commonly used strategy among a variety of control strategies in transit operation for improving transit service reliability, whose implementation needs dynamic decision-making in an interactive and stochastic system environment. This paper introduces a novel use of a reinforcement learning framework to obtain vehicle holding autonomous control strategy in cooperative multi-agent system. Transit operation control model is developed based on multi-agent system. In the multi-agent reinforcement learning framework, each bus is modeled as an independent agent with learning abilities, for which the state, actions and reward are defined and a coordination mechanism for multiple bus agents is designed to obtain a joint holding actions. The hysteretic Q-learning algorithm is used to solve this holding problem. From the simulation experiments, the results illustrate that the proposed approach is able to prevent buses from bunching and regulate bus headway.

Key words: bus holding, multi-agent reinforcement learning, multi-agent system, control strategy

摘要： 车辆驻站是减少串车现象和改善公交服务可靠性的常用且有效控制策略，其执行过程需要在随机交互的系统环境中进行动态决策。考虑实时公交运营信息的可获得性，研究智能体完全合作环境下公交车辆驻站增强学习控制问题，建立基于多智能体系统的单线公交控制概念模型，描述学习框架下包括智能体状态、动作集、收益函数、协调机制等主要元素，采用hysteretic Q-learning算法求解问题。仿真实验结果表明该方法能有效防止串车现象并保持单线公交服务系统车头时距的均衡性。

关键词: 驻站, 多智能体增强学习, 多智能体系统, 控制策略

CHEN Chunxiao1, CHEN Zhiya1，2, CHEN Weiya1. Bus holding control method in public transit systems with multi-agent reinforcement learning[J]. Computer Engineering and Applications, 2015, 51(17): 8-13.

陈春晓1，陈治亚1，2，陈维亚1. 基于多智能体增强学习的公交驻站控制方法[J]. 计算机工程与应用, 2015, 51(17): 8-13.

[1]	CHEN Shiming, LIN Zipeng, GAO Yanli, PEI Huiqin. Heterogeneous Group Consensus Under Adaptive Coupling Weights [J]. Computer Engineering and Applications, 2021, 57(4): 231-235.
[2]	LI Zhentao, FENG Yuanzhen, WANG Zhengxin. Fixed-Time Bipartite Consensus of Multi-agent Systems via Event-Triggered Control [J]. Computer Engineering and Applications, 2021, 57(21): 80-86.
[3]	SUN Yu, CAO Lei, CHEN Xiliang, XU Zhixiong, LAI Jun. Overview of Multi-Agent Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2020, 56(5): 13-24.
[4]	CHEN Liangkang, GUO Liuxiao, YANG Yongqing. Projective Group Consensus of Network Systems with Smart Leader [J]. Computer Engineering and Applications, 2020, 56(19): 42-47.
[5]	WANG Mengjiao, YIN Xiang, HUANG Ningxin. Multitask Assignment Algorithm Based on Transfer Learning [J]. Computer Engineering and Applications, 2020, 56(13): 150-155.
[6]	FENG Yuanzhen, LIU Min. Group Consensus of Mixed-Order Multi-Agent Systems with Time Delays [J]. Computer Engineering and Applications, 2019, 55(12): 67-71.
[7]	LI Yang, XU Feng, XIE Guangqiang, HUANG Xianglong. Survey of development and application of multi-agent technology [J]. Computer Engineering and Applications, 2018, 54(9): 13-21.
[8]	SHAN Bingran, TAO Fengming. Design change control of complex products based on important nodes [J]. Computer Engineering and Applications, 2018, 54(6): 222-227.
[9]	LI Jie1, LI Hao2, ZHAO Xinqu1. Multi-objective optimization of hybrid electrical vehicle based on immune genetic algorithm [J]. Computer Engineering and Applications, 2018, 54(4): 237-243.
[10]	LIANG Jiaqi, BU Xuhui, LIU Jian. Iterative learning consensus tracking control for a class of multi-agent systems with data dropouts [J]. Computer Engineering and Applications, 2018, 54(20): 42-47.
[11]	QIU Li, GUO Liuxiao. Event-triggered control for exponential synchronization of linear multi-agent systems with randomly occurring uncertainties [J]. Computer Engineering and Applications, 2018, 54(17): 141-145.
[12]	HUANG Hongwei1, HUANG Tianmin2. Leader-following consensus of multi-agent systems via event-triggered control [J]. Computer Engineering and Applications, 2017, 53(6): 29-33.
[13]	LI Kun1, ZHENG Bochao1，2, ZHONG Lu1. Research on robust quantized consensusof multi-agent systems with uncertainties [J]. Computer Engineering and Applications, 2017, 53(24): 48-54.
[14]	HU Xiaohui, LI Lanfeng, FANG Zheng, LIU Xueliang. Research and application of improved task allocation strategy in WSN [J]. Computer Engineering and Applications, 2017, 53(2): 124-128.
[15]	ZHAO Rui, ZHU Meiling, XU Yong. Adaptive tracking control for multi-agent systems [J]. Computer Engineering and Applications, 2017, 53(18): 39-43.

Bus holding control method in public transit systems with multi-agent reinforcement learning

基于多智能体增强学习的公交驻站控制方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics