面向助老行为识别的三维卷积神经网络设计

Design of 3D Convolutional Neural Network for Action Recognition for Helping the Aged

摘要: 针对室内老人跌倒问题，提出一种室内人体跌倒行为识别方法.首先，提出基于卷积核分解与分组卷积的轻量化3D网络；之后融合浅层2D子网络与轻量化3D子网络，并采用随机滑动组合采样策略改进3D卷积行为识别网络.为进一步提高网络泛化性能，对视频帧进行视觉显著性检测，通过加强背景纹理与人物行为之间关联性提高真实场景识别准确度.实验结果表明：该网络参数量为6.9×10⁶，时间复杂度降低至8.04×10⁹；实现算法在室内跌倒行为识别任务上达到81.5%的准确度.

Abstract: To solve the problem of action recognition in indoor environment, a method for human falling recognition in indoor environment was proposed. First, a lightweight 3D network, which uses grouping convolution and factorization to lighten the network structure for action classification, was proposed. Then 2D subnetworks and lightweight 3D sub-networks were fused to improve behavior recognition network based on the 3D convolution. Finally, visual saliency detection was performed on video frames to improve the accuracy of real scene recognition by enhancing the correlation between background texture and human behavior. Results show that the network's parameter is reduced to 6.9×10⁶ and the floating point of operations is reduced to 6.9×10⁹. The algorithm achieves 81.5% accuracy in the task of indoor fall behavior recognition.