Robust sequence alignment for actor–object interaction recognition: Discovering actor–object states

作者：

Highlights：

•

摘要

In this paper, we address the problem of recognizing atomic human–object interactions from videos. Our method is based on the observation that, at the moment of physical contact with the object, both the motion and appearance (i.e., shape) of the interacting person are constrained by the target object. We introduce the concept of actor–object states as the instantaneous configuration of actor and object that usually corresponds to the moment of physical contact. We argue that the information content in frames belonging to the actor–object states is descriptive of the specific interaction. We use the actor–object state concept to propose an approach in which human–object interactions are represented by a combination of image patches and velocity information extracted along tracked body-point trajectories. However, determining the set of video frames corresponding to actor–object states is challenging as, before and after physical contact, human motion and appearance may vary significantly for the same interaction type. We address this issue by means of a robust sequence-matching algorithm that discovers actor–object states by matching pairs of misaligned sequences of features. We then show how these discovered actor–object states can be used for the recognition of basic interactions with objects. Finally, we evaluate the proposed concept on classification tasks performed on a new dataset of atomic human–object interactions.

论文关键词：

论文评审过程：Received 19 June 2009, Accepted 19 November 2010, Available online 8 December 2010.

论文官网地址：https://doi.org/10.1016/j.cviu.2010.11.012