Multiagent Q-learning based on ant colony algorithm and roulette algorithm

doi:10.3778/j.issn.1002-8331.2009.16.016

Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (16): 60-62.DOI: 10.3778/j.issn.1002-8331.2009.16.016

• 研究、探讨 • Previous Articles Next Articles

Multiagent Q-learning based on ant colony algorithm and roulette algorithm

MENG Xiang-ping¹,WANG Sheng-bin²,WANG Xin-xin²

1.Department of Electrical Engineering，Changchun Institute of Technology，Changchun 130012，China
2.Department of Computer Engineering，Northeast Dianli University，Jilin 132012，China

Received:2008-04-11 Revised:2008-06-18 Online:2009-06-01 Published:2009-06-01
Contact: MENG Xiang-ping

基于蚁群算法和轮盘算法的多Agent Q学习

孟祥萍¹,王圣镔²,王欣欣²

1.长春工程学院电气与信息学院，长春 130012
2.东北电力大学信息工程学院，吉林 132012

通讯作者: 孟祥萍

Abstract

Abstract: Authors present a novel Multiagent Reinforcement Learning Algorithm based on Q-Learning，ant colony algorithm and roulette algorithm.In reinforcement learning algorithm，when the number of agents is large enough，all of the action selection methods will be failed：the speed of learning is decreased sharply.Besides，as the Agent makes use of the Q value to choose the next action，the next action is restrainted seriously by the high Q value，in the prophase.So，authors combine the ant conlony algorithm，roulette algorithm with Q-learning，hope that the problems will be resolved with the algorithm proposed.At last，the theory analysis and experiment result both demonstrate that the improved Q-learning is feasible and increases the learning efficiency.

Key words: multiagent reinforcement learning algorithm, ant colony algorithm, roulette algorithm

摘要： 提出了一种新颖的基于Q-学习、蚁群算法和轮盘赌算法的多Agent强化学习。在强化学习算法中，当Agent数量增加到足够大时，就会出现动作空间灾难性问题，即：其学习速度骤然下降。另外，Agent是利用Q值来选择下一步动作的，因此，在学习早期，动作的选择严重束缚于高Q值。把蚁群算法、轮盘赌算法和强化学习三者结合起来，期望解决上述提出的问题。最后，对新算法的理论分析和实验结果都证明了改进的Q学习是可行的，并且可以有效地提高学习效率。

关键词: 多Agent强化学习算法, 蚁群算法, 轮盘赌算法

MENG Xiang-ping¹,WANG Sheng-bin²,WANG Xin-xin². Multiagent Q-learning based on ant colony algorithm and roulette algorithm[J]. Computer Engineering and Applications, 2009, 45(16): 60-62.

孟祥萍¹,王圣镔²,王欣欣². 基于蚁群算法和轮盘算法的多Agent Q学习[J]. 计算机工程与应用, 2009, 45(16): 60-62.

[1]	SHI Chuntian, ZENG Yanyang, HOU Shouming. Summary of Application of Swarm Intelligence Algorithms in Image Segmentation [J]. Computer Engineering and Applications, 2021, 57(8): 36-47.
[2]	ZHANG Songcan, PU Jiexin, SI Yanna, SUN Lifan. Adaptive Improved Ant Colony Algorithm Based on Population Similarity and Its Application [J]. Computer Engineering and Applications, 2021, 57(8): 70-77.
[3]	BU Guannan, LIU Jianhua, JIANG Lei, ZHANG Dongyang. Ant Colony Algorithm with Adaptive Grouping [J]. Computer Engineering and Applications, 2021, 57(6): 67-73.
[4]	MA Xianghua, ZHANG Qian. Research on Improved Ant Colony Algorithm in Robots Path Planning [J]. Computer Engineering and Applications, 2021, 57(5): 210-215.
[5]	WANG Xiaoguang, YANG Peibei. Design and Effect Analysis of Digital Transformation of Shipping Logistics Enterprises [J]. Computer Engineering and Applications, 2021, 57(21): 241-247.
[6]	ZHANG Ziran, HUANG Weihua, CHEN Yang, ZHANG Zheng, LI Ziyuan. Improved Ant Colony Path Planning Algorithm Based on Bidirectional Search [J]. Computer Engineering and Applications, 2021, 57(21): 270-277.
[7]	LI Erchao, QI Kuankuan. Improved Bidirectional Ant Colony Algorithm Mobile Robot Path Planning [J]. Computer Engineering and Applications, 2021, 57(18): 281-288.
[8]	FU Zhaohui, LIU Changshi. Research on Multi-depot Vehicle Routing Problem Based on Joint Distribution Mode [J]. Computer Engineering and Applications, 2021, 57(16): 291-298.
[9]	ZHANG Suying, GUO Baoliang, CHEN Lingzhi, LIU Huixian. Path Planning of Intelligent Fire Evacuation Map Based on Bidirectional Ant Colony Algorithm [J]. Computer Engineering and Applications, 2021, 57(14): 259-266.
[10]	FU Zhaohui, LIU Changshi. Research on Open Time-Dependent Vehicle Routing Problem of Fresh Food E-commerce Distribution [J]. Computer Engineering and Applications, 2021, 57(1): 271-278.
[11]	ZHANG Songcan, PU Jiexin, SI Yanna, SUN Lifan. Survey on Application of Ant Colony Algorithm in Path Planning of Mobile Robot [J]. Computer Engineering and Applications, 2020, 56(8): 10-19.
[12]	CHEN Xi, GAO Junwei, GUAN Sheng. Path Planning for Wave Glider Based on Artificial Pathfinder Ant [J]. Computer Engineering and Applications, 2020, 56(4): 241-246.
[13]	XU Chuanglai, HU Jiankun, HUANG Youfang. Robust Optimization of Express Delivery Network with Uncertain Demand [J]. Computer Engineering and Applications, 2020, 56(3): 272-278.
[14]	ZHANG Xiaoli, YANG Yaxin, XIE Yongcheng. Application of Improved Ant Colony Algorithm in Robot Path Planning [J]. Computer Engineering and Applications, 2020, 56(2): 29-34.
[15]	SONG Fangzhen, XU Yanyan, TANG Xin, PAN Shaoming. Emergency Communication Network Routing Protocol Based on Improved Ant Colony Algorithm [J]. Computer Engineering and Applications, 2020, 56(18): 90-96.

Multiagent Q-learning based on ant colony algorithm and roulette algorithm

基于蚁群算法和轮盘算法的多Agent Q学习

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics