基于自监督循环卷积神经网络的位姿估计方法

    Pose Estimation Method Based on Self-supervised Recurrent Convolutional Neural Networks

    • 摘要: 针对基于监督学习的视觉里程计需要数据集提供真实的位姿数据,但实际上符合条件的样本数量又较少的问题,提出了一种基于自监督循环卷积神经网络的位姿估计方法.该方法以图像序列为输入,首先通过卷积神经网络提取与运动相关的特征,然后使用卷积长短期记忆网络进行时序建模,建立多帧之间的运动约束,最后输出六自由度的位姿.该模型使用了一种基于对极几何的损失函数以自监督学习方式优化网络参数.将模型在KITTI数据集上进行实验,并与其他4种算法进行对比.结果表明,该方法在位姿估计准确性上优于其他单目算法,并且具有不错的泛化能力.

       

      Abstract: To solve the problem that the visual odometry based on supervised learning requires the real pose data of dataset and in fact the number of qualified samples is small, a pose estimation method was proposed based on self-supervised convolutional neural network and convolutional long short-term memory. First, image sequences were taken as input, and the features related to motion were extracted through convolutional neural network. Then, convolutional long short term memory network was used for sequential modeling. Finally, the pose with 6 degrees of freedom was output. The model used a loss function based on epipolar geometry to optimize network parameters by self-supervised learning. The model was tested on KITTI dataset and compared with other four algorithms. Results show that the proposed method is superior to other monocular algorithms in accuracy of the pose estimation, and it also has good generalization ability.

       

    /

    返回文章
    返回