Monitoring human behavior from video taken in an office environment

作者:

Highlights:

摘要

In this paper, we describe a system which automatically recognizes human actions from video sequences taken of a room. These actions include entering a room, using a computer terminal, opening a cabinet, picking up a phone, etc. Our system recognizes these actions by using prior knowledge about the layout of the room. In our system, action recognition is modeled by a state machine, which consists of ‘states’ and ‘transitions’ between states. The transitions from different states can be made based on a position of a person, scene change detection, or an object being tracked. In addition to generating textual description of recognized actions, the system is able to generate a set of key frames from video sequences, which is essentially content-based video compression. The system has been tested on several video sequences and has performed well. A representative set of results is presented in this paper. The ideas presented in this system are applicable to automated security.

论文关键词:Video,Action recognition,Key frames,Context

论文评审过程:Received 11 May 1999, Revised 18 December 2000, Accepted 20 January 2001, Available online 20 September 2001.

论文官网地址:https://doi.org/10.1016/S0262-8856(01)00047-6