Hallucinating uncertain motion and future for static image action recognition

作者:

Highlights:

摘要

Static image action recognition, aiming to recognize human action in a single image, is a challenging task due to the lack of motion and temporal information in static images. Therefore, some previous works have leveraged freely available unlabeled videos to assist image action recognition, which can be categorized into the following two research lines: predict motion or future information of static images using the predictor learned from unlabeled videos. In this paper, following the above two research lines, we propose a novel Multi-modal Motion feature Generator (MMG) and a novel Multi-modal Future feature Generator (MFG) to hallucinate multiple plausible motion features and future visual features for a static image, which could significantly facilitate the image action recognition task. Extensive experiments on two video datasets and two static action image datasets demonstrate the effectiveness of our methods.

论文关键词:

论文评审过程:Received 6 May 2021, Revised 4 December 2021, Accepted 9 December 2021, Available online 18 December 2021, Version of Record 28 December 2021.

论文官网地址:https://doi.org/10.1016/j.cviu.2021.103337