基于深度神经网络的自动驾驶场景三维目标检测算法

陆慧敏; 杨朔

doi:10.11936/bjutxb2021100027

基于深度神经网络的自动驾驶场景三维目标检测算法

陆慧敏,
杨朔

Three-dimensional Object Detection Algorithm Based on Deep Neural Networks for Automatic Driving

摘要

摘要: 在自动驾驶场景中，使用激光雷达相机获取精确度较高、可感知距离的点云数据，因此，有效利用点云数据，实现目标检测是完成自动驾驶任务的关键技术. 点云数据本身具有稀疏性、无序性和数据量较大的问题，传统的深度学习目标检测方法难以有效处理提取点云特征和满足准确度要求. 针对这一现状，提出一种体素化卷积网络与多层感知机模型融合的三维目标检测方法，利用体素化卷积网络提取点云数据的全局特征，结合多层感知机所提取点云数据的局部特征与距离关系，再利用候选三维区域检测方法，可以改进三维目标分类与位置预测的精度和速度. 采用德国卡尔斯鲁厄理工学院提供的KITTI自动驾驶数据集，对所提出的方法与经典的方法进行对比实验. 结果表明，本研究所提出的方法比以往的方法在精度上有较大提升.

Abstract: In the automatic driving scene, LiDAR camera is usually used to obtain the point cloud data with high accuracy and perceptible distance. Therefore, achieving object detection by effectively using point cloud data is the key technology to complete the automatic driving task. Point cloud has the problems of sparsity, disorder, and large amount. Traditional deep learning object detection method is difficult to effectively extract features map and meet the accuracy requirements. This paper proposed a 3D object detection method based on the fusion of voxel convolution network and multi-layer perception model. The voxel convolution network was used to extract the global features of point cloud, combined with the local features and distance relationship of point cloud extracted by multi-layer perception. It can improve the accuracy and speed of 3D object classification and position prediction. In this paper, the KITTI dataset was used to compare the proposed method with the classical method. According to the experimental results, the accuracy of the proposed method is significantly improved than the previous methods.

HTML全文

参考文献(26)

施引文献

资源附件(0)