Mobile robot navigation based on support vector machine and Q-learning

Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (23): 242-244.

• 工程与应用 • Previous Articles Next Articles

Mobile robot navigation based on support vector machine and Q-learning

HOU Yanli

Department of Computer，Shangqiu Teachers College，Shangqiu，Henan 476000，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-08-11 Published:2011-08-11

基于支持向量机和Q学习的移动机器人导航

侯艳丽

商丘师范学院计算机科学系，河南商丘 476000

Abstract

Abstract: Continuous Q-learning algorithm based on neural has been used in robotic navigation domain for its simplicity and well-developed theory.Aiming at the neural easily falling into local minimum，a new mobile robot navigation method using Q-learning based on a Support Vector Machine（SVM） is proposed.According to the developed mobile robot CASIA-I and its working environment，an approach is proposed，used to determine the reward/penalty function of Q-learning.A SVM is used to estimate the Q-value of state-action pair on-line，at the same time，in order to decrease the on-time learning time of SVM，a sliding time-window is introduced.Experimental results are included to show that the action policy obtained through Q-learning based on SVM can make the mobile robot reach the destination without obstacle collision.

Key words: mobile robot, Q-learning, support vector machine, navigation, on-line learning

摘要： 基于神经网络的连续状态空间Q学习已应用在机器人导航领域。针对神经网络易陷入局部极小，提出了将支持向量机与Q学习相结合的移动机器人导航方法。首先以研制的CASIA-I移动机器人和它的工作环境为实验平台，确定出Q学习的回报函数;然后利用支持向量机对Q学习的状态——动作对的Q值进行在线估计，同时，为了提高估计速度，引入滚动时间窗机制;最后对所提方法进行了实验，实验结果表明所提方法能够使机器人无碰撞的到达目的地。

关键词: 移动机器人, Q学习, 支持向量机, 导航, 在线学习

HOU Yanli. Mobile robot navigation based on support vector machine and Q-learning[J]. Computer Engineering and Applications, 2011, 47(23): 242-244.

侯艳丽. 基于支持向量机和Q学习的移动机器人导航[J]. 计算机工程与应用, 2011, 47(23): 242-244.

[1]	GAO Yikai, PENG Li, XU Longzhuang. Flame Recognition Method Using TWSVM with Improved Artificial Fish Swarm Algorithm [J]. Computer Engineering and Applications, 2021, 57(8): 204-213.
[2]	HAN Weiyu, CHENG Longsheng. Research on Roling Bearing Failure Mode Classification Based on MTS and SVM [J]. Computer Engineering and Applications, 2021, 57(6): 239-246.
[3]	WANG Di, LI Caihong, GUO Na, LIU Guoming, GAO Tengteng. Local Path Planning of Mobile Robot Based on Fuzzy Potential Field Method [J]. Computer Engineering and Applications, 2021, 57(6): 212-218.
[4]	JIANG Lin, FANG Dongjun, LEI Bin, LI Weigang. Research Status and Trend of Navigation Algorithm for Mobile Robot with Monocular Vision [J]. Computer Engineering and Applications, 2021, 57(5): 1-9.
[5]	LEI Henglin, Gulanbaier Tuerhong, Mairidan Wushouer, ZHANG Dongmei. Review of Novelty Detection [J]. Computer Engineering and Applications, 2021, 57(5): 47-55.
[6]	WEN Jiebin, YANG Wenzhong, MA Guoxiang, ZHANG Zhihao, LI Hailei. Micro-expression Recognition Based on Apex Frame Optical Flow and Convolutional Autoencoder [J]. Computer Engineering and Applications, 2021, 57(4): 127-133.
[7]	ZHANG Junjie, ZHANG Cong, ZHAO Hanjie. Dueling Deep Q Network Algorithm with State Value Reuse [J]. Computer Engineering and Applications, 2021, 57(4): 134-140.
[8]	LI Yuqi, LIU Zhiqian, CHENG Ningyi, WANG Yingying, ZHU Chunli. Path Planning of UAV Under Multi-constraint Conditions [J]. Computer Engineering and Applications, 2021, 57(4): 225-230.
[9]	XU Xianfeng, CAI Lulu, ZHANG Li. Photovoltaic Power Generation Prediction Algorithm Based on MLP and DBN [J]. Computer Engineering and Applications, 2021, 57(3): 266-272.
[10]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[11]	YANG Lingyao, ZHANG Aihua, ZHANG Jie, SONG Jiqiang. Real-Time Path Planning of Velocity Potential for Robot in Grid Map Environment [J]. Computer Engineering and Applications, 2021, 57(24): 290-295.
[12]	CHEN Fujian, XIE Weixin, XIA Ting. Adaptive Anti-occlusion Target Tracking Algorithm Based on LCT+ [J]. Computer Engineering and Applications, 2021, 57(22): 190-198.
[13]	XIAO Liejun, BAO Wenjing, GAO Qingji. Modeling Research of Rotorcraft Integrated Positioning Based on CPS Framework [J]. Computer Engineering and Applications, 2021, 57(21): 287-294.
[14]	ZHANG Ziran, HUANG Weihua, CHEN Yang, ZHANG Zheng, LI Ziyuan. Improved Ant Colony Path Planning Algorithm Based on Bidirectional Search [J]. Computer Engineering and Applications, 2021, 57(21): 270-277.
[15]	YANG Quan. SVM Algorithm for N1+N2 Structure Syntax Relation Determination [J]. Computer Engineering and Applications, 2021, 57(20): 104-108.

Mobile robot navigation based on support vector machine and Q-learning

基于支持向量机和Q学习的移动机器人导航

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics