Segmentation and tracking of multiple video objects

作者:

Highlights:

摘要

This paper describes a technique that produces a content-based representation of a video shot composed by a background (still) mosaic and one or more foreground moving objects. Segmentation of moving objects is based on ego-motion compensation and on background modelling using tools from robust statistics. Region matching is carried out by an algorithm that operates on the Mahalanobis distance between region descriptors in two subsequent frames and uses singular value decomposition to compute a set of correspondences satisfying both the principle of proximity and the principle of exclusion. The sequence is represented as a layered graph, and specific techniques are introduced to cope with crossing and occlusion. Examples of MPEG-4 (main profile) encoding are reported.

论文关键词:Content-based representation,MPEG,Video coding,Video sequence analysis,Mosaicing,Motion segmentation

论文评审过程:Received 7 June 2005, Revised 23 February 2006, Accepted 14 July 2006, Available online 28 September 2006.

论文官网地址:https://doi.org/10.1016/j.patcog.2006.07.008