基于奖励引导的六足机器人自主步态学习

    Autonomous Gait Learning of Hexapod Robot Based on Reward Shaping

    • 摘要: 为提高机器人自主学习能力,提出一种基于奖励引导的机器人自主步态学习算法.首先给出自主设计的机械式步态同步六足机器人的机械结构,其独特的多层钢堆叠腿部设计方案和特有的机械同步传动方案为机器人后续步态学习研究奠定良好基础;接下来提出一种基于奖励引导的机器人步态学习算法,该算法可以让机器人在未知环境中进行技能学习;最后,在MATLAB和ADAMS联合仿真平台上进行算法验证实验,实验结果显示以速度和高度分别为主评价指标会呈现出2种合理的收敛结果,且在该算法下六足机器人可以自主学习到翻越木块的高难度技能.

       

      Abstract: To improve the autonomous learning ability of robots, a novel autonomous gait learning algorithm based on reward shaping was proposed. First, the mechanical structure of gait synchronous hexapod robot was introduced, with unique multi-layer steel leg design and special mechanical synchronous transmission scheme, which laid a good foundation for robots' gait learning research. Second, a robot gait learning algorithm based on reward shaping was presented, allowing the robot to learn skills in an unknown environment. A concise reward and punishment function, which was based on the principle of simplicity, was designed. Finally, experiments performed on MATLAB and ADAMS joint simulation platform verified the proposed algorithm. Two reasonable convergent results were presented when speed and height, respectively, were the main evaluating indicators. Moreover, the hexapod robot independently learned the difficult skills to move over the blocks with the proposed algorithm.

       

    /

    返回文章
    返回