Action recognition using linear dynamic systems

作者：

Highlights：

•

摘要

In this paper, we propose a novel approach based on Linear Dynamic Systems (LDSs) for action recognition. Our main contributions are two-fold. First, we introduce LDSs to action recognition. LDSs describe the dynamic texture which exhibits certain stationarity properties in time. They are adopted to model the spatiotemporal patches which are extracted from the video sequence, because the spatiotemporal patch is more analogous to a linear time invariant system than the video sequence. Notably, LDSs do not live in the Euclidean space. So we adopt the kernel principal angle to measure the similarity between LDSs, and then the multiclass spectral clustering is used to generate the codebook for the bag of features representation. Second, we propose a supervised codebook pruning method to preserve the discriminative visual words and suppress the noise in each action class. The visual words which maximize the inter-class distance and minimize the intra-class distance are selected for classification. Our approach yields the state-of-the-art performance on three benchmark datasets. Especially, the experiments on the challenging UCF Sports and Feature Films datasets demonstrate the effectiveness of the proposed approach in realistic complex scenarios.

论文关键词：Linear dynamic system,Kernel principal angle,Multiclass spectral clustering,Supervised codebook pruning,Action recognition

论文评审过程：Received 20 April 2012, Revised 26 November 2012, Accepted 1 December 2012, Available online 12 December 2012.

论文官网地址：https://doi.org/10.1016/j.patcog.2012.12.001