Autonomous Gait Learning of Hexapod Robot Based on Reward Shaping

ZHU Xiaoqing; CHEN Lu; RAN Dengyu; QIAN Yizhao; WANG Mingchao; BIE Tong

doi:10.11936/bjutxb2019110007

Journal of Beijing University of Technology > 2021 > 47(2): 120-126. > DOI: 10.11936/bjutxb2019110007

ZHU Xiaoqing, CHEN Lu, RAN Dengyu, QIAN Yizhao, WANG Mingchao, BIE Tong. Autonomous Gait Learning of Hexapod Robot Based on Reward Shaping[J]. Journal of Beijing University of Technology, 2021, 47(2): 120-126. DOI: 10.11936/bjutxb2019110007

Citation:

PDF (4608 KB)

Autonomous Gait Learning of Hexapod Robot Based on Reward Shaping

1.
Faulty of Information Technology, Beijing University of Technology, Beijing 100124, China
2.
Fan Gongxiu Honors College, Beijing University of Technology, Beijing 100124, China

More Information

Received Date: November 11, 2019
Available Online: August 03, 2022
Published Date: February 09, 2021

Graphical Abstract

Abstract

Abstract

To improve the autonomous learning ability of robots, a novel autonomous gait learning algorithm based on reward shaping was proposed. First, the mechanical structure of gait synchronous hexapod robot was introduced, with unique multi-layer steel leg design and special mechanical synchronous transmission scheme, which laid a good foundation for robots' gait learning research. Second, a robot gait learning algorithm based on reward shaping was presented, allowing the robot to learn skills in an unknown environment. A concise reward and punishment function, which was based on the principle of simplicity, was designed. Finally, experiments performed on MATLAB and ADAMS joint simulation platform verified the proposed algorithm. Two reasonable convergent results were presented when speed and height, respectively, were the main evaluating indicators. Moreover, the hexapod robot independently learned the difficult skills to move over the blocks with the proposed algorithm.
- reinforcement learning,
- hexapod robot,
- autonomous Learning,
- reward shaping,
- neural network,
- gait learning

FullText(HTML)

References (19)

References

[1]	刘庆运, 景甜甜.六足步行机器人及其步态规划研究进展[J].重庆理工大学学报(自然科学), 2015, 29(7): 87-94. https://www.cnki.com.cn/Article/CJFDTOTAL-CGGL201507016.htm LIU Q Y, JING T T. Survey on hexapod walking robot and gait planning[J]. Journal of Chongqing University of Technology (Natural Science), 2015, 29(7): 87-94. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-CGGL201507016.htm
[2]	李满宏, 张明路, 张建华.六足机器人关键技术综述[J].机械设计, 2015(10): 1-8. https://www.cnki.com.cn/Article/CJFDTOTAL-JXSJ201510001.htm LI M H, ZHANG M L, ZHANG J H. Review on key technologies of hexapod robot[J]. Journal of Machine Design, 2015(10): 1-8. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-JXSJ201510001.htm
[3]	郭建, 谢鑫, 蒋品.仿生六足机器人的结构设计及运动分析[J].机床与液压, 2019, 47(3): 21-26. https://www.cnki.com.cn/Article/CJFDTOTAL-JCYY201903005.htm GUO J, XIE X, JIANG P. Structure design and motion analysis of hexapod bionic robot[J]. Machine Tool & Hydraulics, 2019, 47(3): 21-26. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-JCYY201903005.htm
[4]	ZHANG H, LIU Y, ZHAO J, et al. Development of a bionic hexapod robot for walking on unstructured terrain[J]. Journal of Bionic Engineering, 2014, 11(2): 176-187. doi: 10.1016/S1672-6529(14)60041-X
[5]	SCHILLING M, CRUSE H, ARENA P. Hexapod walking: an expansion to walknet dealing with leg amputations and force oscillations[J]. Biological Cybernetics, 2007, 96(3): 323-340. doi: 10.1007/s00422-006-0117-1
[6]	CHEN X, WANG L, YE X, et al. Prototype development and gait planning of biologically inspired multi-legged crablike robot[J]. Mechatronics, 2013, 23(4): 429-444. doi: 10.1016/j.mechatronics.2013.03.006
[7]	HAUSER K, BRETL T, LATOMBE J, et al. Motion planning for legged robots on varied terrain[J]. International Journal of Robotics Research, 2008, 27(11/12): 1325-1349.
[8]	KENNEDY B, OKON A, AGHAZARIAN H, et al. Lemur Ⅱb: a robotic system for steep terrain access[J]. Industrial Robot, 2006, 33(4): 265-269. doi: 10.1108/01439910610667872
[9]	丁良宏, 王润孝, 冯华山, 等.浅析BigDog四足机器人[J].中国机械工程, 2012, 23(5): 505-514. doi: 10.3969/j.issn.1004-132X.2012.05.001 DING L H, WANG R X, FENG H S, et al. Brief analysis of a bigdog quadruped robot[J]. China Mechanical Engineering, 2012, 23(5): 505-514. (in Chinese) doi: 10.3969/j.issn.1004-132X.2012.05.001
[10]	李满宏, 张明路, 张建华, 等.基于离散化的六足机器人自由步态生成算法[J].机械工程学报, 2016, 52(3): 18-25. https://www.cnki.com.cn/Article/CJFDTOTAL-JXXB201603003.htm LI M H, ZHANG M L, ZHANG J H, et al. Free gait generation algorithm for a hexapod robot based on discretization[J]. Journal of Mechanical Engineering, 2016, 52(3): 18-25. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-JXXB201603003.htm
[11]	ERDEN M S, KEMAL L. Free gait generation with reinforcement learning for a six-legged robot[J]. Robotics and Autonomous Systems, 2008, 56(3): 199-212. doi: 10.1016/j.robot.2007.08.001
[12]	周礼广.基于机器学习的六足机器人波动步态自由切换控制研究[D].杭州: 浙江工业大学, 2016. ZHOU L G. Research on free transition control of hexapod robot wave gait based on machine learning[D]. Hangzhou: Zhejiang University of Technology, 2016. (in Chinese)
[13]	ZHANG H, LIU Y B, ZHAO J, et al. Development of a bionic hexapod robot for walking on unstructured terrain[J]. Journal of Bionic Engineering, 2014, 11(2): 176-187. doi: 10.1016/S1672-6529(14)60041-X
[14]	李满宏, 张明路, 张建华, 等.基于增强学习的六足机器人自由步态规划[J].机械工程学报, 2019, 55(5): 36-44. https://www.cnki.com.cn/Article/CJFDTOTAL-JXXB201905006.htm LI M H, ZHANG M L, ZHANG J H, et al. Free gait planning for a hexapod robot based on reinforcement learning[J]. Journal of Mechanical Engineering, 2019, 55(5): 36-44. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-JXXB201905006.htm
[15]	SANTOS C P, MATOS V. Gait transition and modulation in a quadruped robot: a brainstem-like modulation approach[J]. Robotics and Autonomous Systems, 2011, 59(9): 620-634. doi: 10.1016/j.robot.2011.05.003
[16]	毛勇, 李实, 王家廞, 等.基于再励学习的被动动态步行机器人[J].清华大学学报(自然科学版), 2008(1): 92-96. doi: 10.3321/j.issn:1000-0054.2008.01.025 MAO Y, LI S, WANG J X, et al. Reinforcement learning for passive dynamic walking robot[J]. Journal of Tsinghua University (Science and Technology), 2008(1): 92-96. (in Chinese) doi: 10.3321/j.issn:1000-0054.2008.01.025
[17]	SUTTON R, BARTO A. Reinforcement learning: an introduction[M]. Cambridge: MIT Press, 1998.
[18]	CHALMERS P. Lobster special[J]. Mechanical Engineering, 2000, 122(9): 82-84. doi: 10.1115/1.2000-SEP-5
[19]	SARANLI U, BUEHLER M, KODITSCHEK D E. RHex: a simple and highly mobile hexapod robot[J]. The International Journal of Robotics Research, 2001, 20(10): 616-631.

Cited By

Cited by

Periodical cited type(2)

1.	孙文革. 基于X86平台和RSI的工业机器人步态自动控制系统设计. 计算机测量与控制. 2024(05): 122-128 .
2.	王雨佳，夏宏博. 基于嵌入式的教育机器人设计. 自动化与仪器仪表. 2022(02): 120-124 .

Other cited types(3)

Get Citation

PDF

XML

Article views (282) PDF downloads (66) Cited by(5)

Turn off MathJax

Article Contents

Abstract

References

Autonomous Gait Learning of Hexapod Robot Based on Reward Shaping

Abstract

References

Cited by

Periodical cited type(2)

Other cited types(3)

Catalog

Export File

Citation

Format

Content