Structured learning of local features for human action classification and localization

作者:

Highlights:

摘要

Human action recognition is a promising yet non-trivial computer vision field with many potential applications. Current advances in bag-of-feature approaches have brought significant insights into recognizing human actions within complex context. It is, however, a common practice in literature to consider action as merely an orderless set of local salient features. This representation has been shown to be oversimplified, which inherently limits traditional approaches from robust deployment in real-life scenarios. In this work, we propose and show that, by taking into account global configuration of local features, we can greatly improve recognition performance. We first introduce a novel feature selection process called Sparse Hierarchical Bayes Filter to select only the most contributive features of each action type based on neighboring structure constraints. We then present the application of structured learning in human action analysis. That is, by representing human action as a complex set of local features, we can incorporate different spatial and temporal feature constraints into the learning tasks of human action classification and localization. In particular, we tackle the problem of action localization in video using structured learning with two alternatives: one is Dynamic Conditional Random Field from probabilistic perspective; the other is Structural Support Vector Machine from max-margin point of view. We evaluate our modular classification-localization framework on various testbeds, in which our proposed framework is proven to be highly effective and robust compared against bag-of-feature methods.

论文关键词:Action recognition,Action localization,Structured Learning,Local spatio-temporal features,Hierarchical sparse Bayesian filter,Support vector machine,Dynamic conditional random fields,Structural support vector machine

论文评审过程:Received 5 April 2011, Revised 2 November 2011, Accepted 16 December 2011, Available online 29 December 2011.

论文官网地址:https://doi.org/10.1016/j.imavis.2011.12.006