基于双通道混合3D-2D RBM模型的手势识别

李敬华; 淮华瑞; 孔德慧; 王立春; 孙艳丰

doi:10.11936/bjutxb2017090018

基于双通道混合3D-2D RBM模型的手势识别

Dynamic Hand Gesture Recognition Based on Two-channel Hybrid 3D-2D RBM

摘要

摘要: 为了挖掘基于视频的动态手势识别问题中手势的固有时空表示，提出一种3D-2D受限玻尔兹曼机（restricted Boltzmann machine，RBM）模型，以便建模手势视频数据的时空相关信息.特别地，为了更好地描述动态手势的时空特征，提出传统手工定义特征与3D-2D RBM结合的混合特征表示方法，该方法首先提取Canny-2D HOG表观特征以及光流-2D HOG运动特征，然后基于3D-2D RBM进一步学习动态手势潜在的高层时空语义特征，提升动态手势的特征描述力.融合手势外观判别和运动判别的双通道融合判别改进了单通道分类的能力.在公开的剑桥手势数据集上的实验验证了所提方法的有效性和优越性.

Abstract: To explore the intrinsic spatio-temporal representation of dynamic hand gesture in the video-based hand gesture recognition, this paper proposed a 3D-2D restricted Boltzmann machine (RBM) model, which is able to model the spatio-temporal correlation of hand gesture video data. Especially, a method combining traditional hand-defined feature with 3D-2D RBM was proposed to describe hand gesture better. The proposed hybrid 3D-2D RBM model consists of three phases. First, Canny-2D HOG and optical flow 2D HOG were used to describe the spatial and temporal feature, respectively. A 3D-2D RBM was then adopted to learn the latent high-level semantics. Finally, the two-channel discrimination results were fused together for recognition. The experimental results on the public Cambridge Hand Gesture Data set show that the proposed hybrid 3D-2D RBM outperforms the state-of-the-art.

HTML全文

参考文献(19)

施引文献

资源附件(0)