监控场景下基于机器注意的多任务行人属性识别

    Multi-task Pedestrian Attribute Recognition Based on Machine Attention in Surveillance Scenarios

    • 摘要: 行人属性作为一种高级语义特征,对照明、姿势改变等具有鲁棒性,已经被广泛用于行人重识别和视频分析中.为了提高监控场景下行人属性识别(pedestrian attribute recognition,PAR)的准确性,结合最新的人工智能(artificial intelligence,AI)解释性研究,从机器注意的角度出发,综合考虑属性间的互斥性以及依赖性,提出一种限制神经网络注意的多任务行人属性识别方法.首先,根据神经网络对各属性的不同关注区域将行人属性识别划分为多个子任务;然后,通过端到端网络模型和辅助分类损失函数的设计,控制各子任务之间的信息共享,鼓励子任务内部属性相互竞争,不同子任务之间的属性相互促进;最后,融合各子任务的信息进行行人属性的预测.经过实验,该方法在2个监控场景中的公开数据集上取得了最佳的精度,证明了该方法的有效性.

       

      Abstract: As a high-level semantic feature, pedestrian attributes have been widely used in person re-identification(Re-ID) and video analysis due to their robustness to lighting and posture changes. To improve the accuracy of pedestrian attribute recognition (PAR) in monitoring scene, the latest artificial intelligence (AI) explanatory research was referred, and the perspective of the machine attention was considered from. Considering comprehensively the mutual exclusion and dependence of attributes, a multi-task attribute recognition method limiting the attention of the neural network was proposed. First, according to the different areas of the neural network for each attribute, the pedestrian attribute recognition was divided into multiple sub-tasks. Then through the design of the end-to-end network model and the auxiliary classification loss function, the shared information among the subtasks was controlled, and the internal attributes of the subtasks to compete with each other was encouraged. On the contrary, the attributes between different subtasks were improved each other. Finally, the pedestrian attributes were predicted by fusing the information of each subtask. Through experiments, the proposed method has achieved the best accuracy on two public datasets in monitoring scenarios, which proves the effectiveness of the method.

       

    /

    返回文章
    返回