Multi-robot cooperative carrying in dynamic environment

Abstract

Abstract: In the multi-robot cooperative carrying process, traditional reinforcement learning only uses numerical analysis and ignored reasoning approach. To solve this problem, independence reinforcement learning for multi-robot combines with Belief-Desire-Intention（BDI） model, which makes reinforcement learning link logical reasoning capabilities. And the distance nearest principle is employed which means that the nearest robot ranged from obstacles is the leader robot to control other robots move. Evaluation function which changes with the location of multi-robot and the barriers is proposed, and it combines with the behavior weight based on reinforcement learning which becomes more and more optimized through constantly interacting with the environment. Simulation results show that this method is feasible, and the cooperative carrying process can be successfully achieved.

Key words: multi-robot, reinforcement learning, cooperative carrying, obstacle avoidance

摘要： 在多机器人协同搬运过程中，针对传统的强化学习算法仅使用数值分析却忽略了推理环节的问题，将多机器人的独立强化学习与“信念-愿望-意向”（BDI）模型相结合，使得多机器人系统拥有了逻辑推理能力，并且，采用距离最近原则将离障碍物最近的机器人作为主机器人，并指挥从机器人运动，提出随多机器人系统位置及最近障碍物位置变化的评价函数，同时将其与基于强化学习的行为权重结合运用，在多机器人通过与环境不断交互中，使行为权重逐渐趋向最佳。仿真实验表明，该方法可行，能够成功实现协同搬运过程。

关键词: 多机器人, 强化学习, 协同搬运, 避障

CAO Jie, ZHU Ningning. Multi-robot cooperative carrying in dynamic environment[J]. Computer Engineering and Applications, 2013, 49(23): 252-256.

曹洁，朱宁宁. 动态环境中的多机器人协同搬运[J]. 计算机工程与应用, 2013, 49(23): 252-256.

[1]	LIAO Liefa, LI Haohan, LI Shuai, ZHU Helong, LI Zhijun. Research on Control Strategy of Soccer Robot Combined with Winner-Take-All [J]. Computer Engineering and Applications, 2021, 57(7): 136-143.
[2]	WANG Xiao, TANG Lun, HE Xiaoyu, CHEN Qianbin. Multi-dimensional Resource Optimization of Service Function Chain Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(4): 68-76.
[3]	LAI Jun, WEI Jingyi, CHEN Xiliang. Overview of Hierarchical Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(3): 72-79.
[4]	MA Zhihao, ZHU Xiangbin. Research on Quasi-hyperbolic Momentum Gradient for Adversarial Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(24): 90-99.
[5]	YANG Lingyao, ZHANG Aihua, ZHANG Jie, SONG Jiqiang. Real-Time Path Planning of Velocity Potential for Robot in Grid Map Environment [J]. Computer Engineering and Applications, 2021, 57(24): 290-295.
[6]	LI Baoshuai, YE Chunming. Job Shop Scheduling Problem Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(23): 248-254.
[7]	WANG Jun, CAO Lei, CHEN Xiliang, LAI Jun, ZHANG Legui. Overview on Reinforcement Learning of Multi-agent Game [J]. Computer Engineering and Applications, 2021, 57(21): 1-13.
[8]	CHENG Yi, HAO Mimi. Path Planning for Indoor Mobile Robot with Improved Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(21): 256-262.
[9]	KUANG Liqun, LI Siyuan, FENG Li, HAN Xie, XU Qingyu. Application of Deep Reinforcement Learning Algorithm on Intelligent Military Decision System [J]. Computer Engineering and Applications, 2021, 57(20): 271-278.
[10]	LI Hao, NING Haoyu, KANG Yan, LIANG Wentao, HUO Wen. SMRFGAN Model for Text Emotion Transfer [J]. Computer Engineering and Applications, 2021, 57(2): 170-176.
[11]	KONG Songtao, LIU Chichi, SHI Yong, XIE Yi, WANG Kun. Review of Application Prospect of Deep Reinforcement Learning in Intelligent Manufacturing [J]. Computer Engineering and Applications, 2021, 57(2): 49-59.
[12]	SONG Haonan, ZHAO Gang, WANG Xingfen. Knowledge Reasoning Method Combining Knowledge Representation with Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(19): 189-197.
[13]	ZHANG Rongxia, WU Changxu, SUN Tongchao, ZHAO Zengshun. Progress on Deep Reinforcement Learning in Path Planning [J]. Computer Engineering and Applications, 2021, 57(19): 44-56.
[14]	YANG Xueyu, CHEN Jianping, FU Qiming, LU You, WU Hongjie. Deep Deterministic Policy Gradient Algorithm Based on Stochastic Variance Reduction Method [J]. Computer Engineering and Applications, 2021, 57(19): 104-111.
[15]	WANG Keyin, SHI Zhen, YANG Zhengcai, YANG Yahui, WANG Sishan. Path Planning for Mobile Robot Using Improved Reinforcement Learning Algorithm [J]. Computer Engineering and Applications, 2021, 57(18): 270-274.

Multi-robot cooperative carrying in dynamic environment

动态环境中的多机器人协同搬运

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics