Journal of Guangdong University of Technology ›› 2019, Vol. 36 ›› Issue (01): 51-56,62.doi: 10.12052/gdutxb.180044

Previous Articles     Next Articles

Path Planning of Opposite Q Learning Robot Based on Virtual Sub-Target in Unknown Environment

Wang Sheng-min, Lin Wei, Zeng Bi   

  1. School of Computers, Guangdong University of Technology, Guangzhou 510006, China
  • Received:2018-03-16 Online:2019-01-25 Published:2018-12-29

Abstract: Aiming at the problem that in Q learning algorithm Q value is slow in updating speed in complex unknown environment and the dimensionality disaster is easy to occur, a path planning algorithm based on virtual subtarget for Q learning robot in unknown environment is proposed. According to the state trajectory explored by the mobile robot, two state chains are established to record the state-action pair and the state-reverse action pair respectively. The Q value of each single chain current state is fed back to the Q value of the previous state in turn till it affects the head of a single chain. Meanwhile, the problem that Q learning is prone to dimensionality disaster in large-scale environment is solved by finding the optimal virtual subtarget in the local detection domain. The experimental results show that the algorithm can effectively accelerate the convergence of the algorithm learning, improve the learning efficiency and complete the robot navigation task with a better path in the complex unknown environment.

Key words: mobile robot, virtual subtarget, opposite Q learning, unknown environment

CLC Number: 

  • TP242.6
[1] Wang Dong, Huang Rui-yuan, Li Wei-zheng, Huang Zhi-feng. A Research on Docking Position Optimization Method of Mobile Robot for Grasping Task [J]. Journal of Guangdong University of Technology, 2021, 38(06): 53-61.
[2] Ye Pei-chu, Li Dong, Zhang Yun. Direct Sparse Visual Odometer Based on Enhanced Stereo-Camera Constraints [J]. Journal of Guangdong University of Technology, 2021, 38(04): 65-70.
[3] Liu Rui-xue, Zeng Bi, Wang Ming-hui, Lu Zhi-liang. An Autonomous Mapping Method for Robot Based on Efficient Frontier Exploration [J]. Journal of Guangdong University of Technology, 2020, 37(05): 38-45.
[4] Wu Yun-xiong, Zeng Bi. Trajectory Tracking and Dynamic Obstacle Avoidance of Mobile Robot Based on Deep Reinforcement Learning [J]. Journal of Guangdong University of Technology, 2019, 36(01): 42-50.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!