万向结构蛇形机器人的设计及控制策略研究

doi:10.3778/j.issn.1002-8331.2302-0250

摘要/Abstract

摘要： 为了解决蛇形机器人结构复杂、灵活性不足的问题，设计了一款十字轴式万向关节的蛇形机器人。该蛇形机器人由6个模块单元组成，每个模块上均带有被动轮，通过电机驱动滚珠丝杆上的滑块移动，使连杆带动万向关节偏转，以实现蜿蜒运动。不仅如此，万向节限位机构的多自由度，保证了蛇形机器人运动的灵活性。同时针对蛇形机器人建模复杂的难题，研究提出了一种基于深度强化学习的控制策略。通过MuJoCo物理引擎搭建出用于学习的交互环境，并采用近端策略优化算法（proximal policy optimization，PPO）训练出最优运动策略以指导机器人动作。使用所设计的机器人模型进行学习训练，仿真实验数据表明，采用PPO算法训练出的运动策略能够在不同摩擦系数的环境下完成直行前进的任务，机器人也具备对于不同的地形环境的适应性。最后通过实物实验验证了这一方案的可行性和稳定性。

关键词: 蛇形机器人, 万向结构, 强化学习, 近端策略优化算法（PPO）

Abstract: In order to solve the problems of complex structure and insufficient flexibility of the snake-like robot, a snake-like robot structure with cross shaft universal joint is proposed. The snake-like robot is composed of 6 modules, each module is equipped with a passive wheel, and the motor drives the slider on the ball screw to move, so that the connecting rod drives the universal joint to deflect, so as to realize the meandering movement and ensure the flexibility of the robot’s movement. At the same time, a control scheme based on deep reinforcement learning is presented for the complex problem of snake robot modeling. The MuJoCo physics engine is used to build an interactive environment for learning, and the proximal policy optimization（PPO） algorithm is adopted to train the optimal motion strategy to guide the action. After using the proposed model for learning and training, the simulation experimental data show that the motion strategy trained by the PPO algorithm can complete the straight forward motion in the environment of different friction coefficients, which showing that it has a certain adaptability for different terrain environments. Finally, the feasibility and stability of this scheme are verified by prototype physical test experiments.

Key words: snake-like robot, universal structure, reinforcement learning, proximal policy optimization（PPO）

李亚鑫, 逯云飞, 何梓玮, 周政辉. 万向结构蛇形机器人的设计及控制策略研究[J]. 计算机工程与应用, 2023, 59(16): 143-149.

LI Yaxin, LU Yunfei, HE Ziwei, ZHOU Zhenghui. Research on Design and Control Strategy of Universal Joint Snake-Like Robot[J]. Computer Engineering and Applications, 2023, 59(16): 143-149.

参考文献

[1] 董炳艳，张自强，徐兰军，等.智能应急救援装备研究现状与发展趋势[J].机械工程学报，2020，56（11）：1-25.
DONG B Y，ZHANG Z Q，XU L J，et al.Research status and development trend of intelligent emergency rescue equipment[J].Journal of Mechanical Engineering，2020，56（11）：1-25.
[2] LIU J D，TONG Y C，LIU J G.Review of snake robots in constrained environments[J].Robotics and Autonomous Systems，2021，141：103785.
[3] 张军豪，陈英龙，杨双喜，等.蛇形机器人：仿生机理、结构驱动和建模控制[J].机械工程学报，2022，58（7）：75-92.
ZHANG J H，CHEN Y L，YANG S X，et al.Snake robotics：bionic mechanism，structure，actuation，modeling and control[J].Journal of Mechanical Engineering，2022，58（7）：75-92.
[4] HIROSE S.Biologically inspired robots：snake-like locomotors and manipulators[M].Oxford：Oxford University Press，1993：220.
[5] 栾宪超，常健，王聪，等.主动关节履带式蛇形救援机器人结构参数多目标优化设计[J].机器人，2022，44（3）：267-280.
LUAN X C，CHANG J，WANG C，et al.Multi-objective optimization design of structural parameters for a crawler type snake-like rescue robot with active joint[J].Robot，2022，44（3）：267-280.
[6] KOMURA H，YAMADA H，HIROSE S.Development of snake-like robot ACM-R8 with large and mono-tread wheel[J].Advanced Robotics，2015，29（17）：1081-1094.
[7] JIA Y Y，MA S G.A coach-based Bayesian reinforcement learning method for snake robot control[J].IEEE Robotics and Automation Letters，2021，6（2）：2319-2326.
[8] 苏中，张双彪，李兴城.蛇形机器人的研究与发展综述[J].中国机械工程，2015，26（3）：414-425.
SU Z，ZHANG S B，LI X C.Present situation and development tendency of snake-like robots[J].China Mechanical Engineering，2015，26（3）：414-425.
[9] WRIGHT C，BUCHAN A，BROWN B，et al.Design and architecture of the unified modular snake robot[C]//2012 IEEE International Conference on Robotics and Automation，Saint Paul，2012：4347-4354.
[10] 魏巍，庄哲明，唐昭，等.基于3-RSR并联机构的蛇形机器人本体构型设计与运动性能研究[J].机械工程学报，2021，57（23）：21-33.
WEI W，ZHUANG Z M，TANG S，et al.Body configuration design and kinematic performance research of snake-like robot based on 3-RSR parallel mechanism[J].Journal of Mechanical Engineering，2021，57（23）：21-33.
[11] 方勇纯，朱威，郭宪.基于路径积分强化学习方法的蛇形机器人目标导向运动[J].模式识别与人工智能，2019，32（1）：1-9.
FANG Y C，ZHU W，GUO X.Target-directed locomotion of a snake-like robot based on path integral reinforcement learning[J].Pattern Recognition and Artificial Intelligence，2019，32（1）：1-9.
[12] WANG C，PENG Y B，LI D F，et al.Turning strategy of snake-like robot based on serpenoid curve under cloud assisted smart conditions[J].Cluster Computing，2019（22）：13041-13053.
[13] 乔贵方，韦中，张颖，等.基于双层级CPG的3维蛇形机器人运动控制方法[J].机器人，2019，41（6）：779-787.
QIAO G F，WEI Z，ZHANG Y，et al.Double-layered CPG based motion control method of the 3D snake-like robot[J].Robot，2019，41（6）：779-787.
[14] SCHULMAN J，WOLSKI F，DHARIWAL P，et al.Proximal policy optimization algorithms[J].arXiv：1707.06347.
[15] 郭宪，方勇纯.仿生机器人运动步态控制：强化学习方法综述[J].智能系统学报，2020，15（1）：152-159.
GUO X，FANG Y C.Locomotion gait control for bionic：a review of reinforcement learning methods[J].CAAI Transactions on Intelligent Systems，2020，15（1）：152-159.
[16] 刘旭鹏，郜志英，臧勇，等.蛇形机器人蜿蜒运动的摩擦机理及推进条件[J].机械工程学报，2021，57（21）：189-201.
LIU X P，GAO Z Y，ZANG Y，et al.Tribological mechanism and propulsion conditions for creeping locomotion of the snake-like robot[J].Journal of Mechanical Engineering，2021，57（21）：189-201.