Cooperative Multi-Robot Foraging Based on Reinforcement Learning in Unknown Environment

Computer Engineering and Applications ›› 2007, Vol. 43 ›› Issue (10): 19-21.

• 博士论坛 • Previous Articles Next Articles

Cooperative Multi-Robot Foraging Based on Reinforcement Learning in Unknown Environment

Zhao Jie Jian JIANG Zang Xizhe

Received:2006-12-25 Revised:1900-01-01 Online:2007-04-01 Published:2007-04-01
Contact: Jian JIANG

基于强化学习的未知环境多机器人协作搜集

赵杰姜健臧希喆

哈尔滨工业大学机器人研究所多传感器集成及控制研究室哈尔滨工业大学机器人研究所多传感器集成及控制研究室哈尔滨工业大学机器人研究所多传感器集成及控制研究室

通讯作者: 姜健

Abstract

Abstract: To reduce the learning status space of complex foraging task and improve the learning speed , a double-deck hierarchical reinforcement learning with share zone is presented . The arithmetic can perform not only the lower hierarchical of state-action learning but also the higher hierachical of station-behavior learning . the higher hierachical of station-behavior learning can avoid the combination explosion of status space . the use of the share zone reinforces the ability of cooperative learning . Simulation results show that the arithmetic can improve the learning speed of robots and satisfy the time need of multi-robot complex foraging task in unknown environment .

Key words: foraging task, multi-robot systems, reinforcement learning, cooperative

摘要： 针对多机器人协作复杂搜集任务中学习空间大，学习速度慢的问题，提出了带共享区的双层强化学习算法。该强化学习算法不仅能够实现低层状态-动作对的学习，而且能够实现高层条件-行为对的学习。高层条件-行为对的学习避免了学习空间的组合爆炸，共享区的应用强化了机器人间协作学习的能力。仿真实验结果说明所提方法加快了学习速度，满足了未知环境下多机器人复杂搜集任务的要求。

关键词: 搜集任务, 多机器人系统, 强化学习, 协作

Zhao Jie Jian JIANG Zang Xizhe. Cooperative Multi-Robot Foraging Based on Reinforcement Learning in Unknown Environment[J]. Computer Engineering and Applications, 2007, 43(10): 19-21.

赵杰姜健臧希喆. 基于强化学习的未知环境多机器人协作搜集[J]. 计算机工程与应用, 2007, 43(10): 19-21.

[1]	WEI Tingting, YUAN Weilin, LUO Junren, ZHANG Wanpeng. Survey of Opponent Modeling Methods and Applications in Intelligent Game Confrontation [J]. Computer Engineering and Applications, 2022, 58(9): 19-29.
[2]	GAO Jingpeng, HU Xinyu, JIANG Zhiye. Unmanned Aerial Vehicle Track Planning Algorithm Based on Improved DDPG [J]. Computer Engineering and Applications, 2022, 58(8): 264-272.
[3]	SI Yanna, PU Jiexin, SUN Lifan. Review of Research on Approximate Reinforcement Learning Algorithms [J]. Computer Engineering and Applications, 2022, 58(8): 33-44.
[4]	XU Jie, ZHU Yukun, XING Chunxiao. Research on Financial Trading Algorithm Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2022, 58(7): 276-285.
[5]	ZHAO Shuxu, YUAN Lin, ZHANG Zhanping. Multi-agent Edge Computing Task Offloading [J]. Computer Engineering and Applications, 2022, 58(6): 177-182.
[6]	DENG Xin, NA Jun, ZHANG Handuo, WANG Yulin, ZHANG Bin. Personalized Adjustment Method of Intelligent Lamp Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2022, 58(6): 264-270.
[7]	CHEN Zhongyu, HAN Xie, XIE Jianbin, XIONG Fengguang, KUANG Liqun. Reinforcement Learning-Based Image Matching Method Under Double Loss Estimations [J]. Computer Engineering and Applications, 2022, 58(5): 240-246.
[8]	ZHANG Honghong, GAN Xusheng, SUN Jingjuan, WANG Ning, CHEN Zhiyuan. Optimal Anti-collision Strategy for Cooperative UAV [J]. Computer Engineering and Applications, 2022, 58(4): 290-297.
[9]	XU Bo, ZHOU Jianguo, WU Jing, LUO Wei. Routing Optimization Method Based on DDPG and Programmable Data Plane [J]. Computer Engineering and Applications, 2022, 58(3): 143-150.
[10]	WANG Jun, CAO Lei, CHEN Xiliang, CHEN Ying, ZHAO Zhiruo. Game Reinforcement Learning of Pure Strategy Nash Equilibrium [J]. Computer Engineering and Applications, 2022, 58(15): 78-86.
[11]	LYU Dongjian, WANG Chunli. Variable Size for Recurrent Attention Model and Application Research [J]. Computer Engineering and Applications, 2022, 58(12): 243-248.
[12]	WANG Yi, GE Jingyi, XUE Xinwei, WANG Shengfa, LI Fengqi. Path Planning for Complex Thin-Walled Structures in 3D Printing：Improved Q-Learning Method [J]. Computer Engineering and Applications, 2022, 58(12): 299-303.
[13]	SONG Haonan, ZHAO Gang, SUN Ruoying. Developments of Knowledge Reasoning Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2022, 58(1): 12-25.
[14]	NIU Pengfei, WANG Xiaofeng, LU Lei, ZHANG Jiulong. Survey on Vehicle Reinforcement Learning in Routing Problem [J]. Computer Engineering and Applications, 2022, 58(1): 41-55.
[15]	MEN Jiawei, LAI Chengzhe. Trust Management Scheme with Incentive Mechanism for Cooperative Download of Internet of Vehicles [J]. Computer Engineering and Applications, 2021, 57(5): 100-106.

Cooperative Multi-Robot Foraging Based on Reinforcement Learning in Unknown Environment

基于强化学习的未知环境多机器人协作搜集

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics