Incremental human action recognition with dual memory

作者:

Highlights:

摘要

Incremental learning is a topic of great interest in the current state of machine learning research. Real-world problems often require a classifier to incorporate new knowledge while preserving what was learned before. One of the most challenging problems in computer vision is Human Action Recognition (HAR) in videos. However, most of the existing works approach HAR from a non-incremental point of view. This work proposes a framework for performing HAR in the incremental learning scenario called Incremental Human Action Recognition with Dual Memory (IHAR-DM). IHAR-DM contains three main components: a 3D convolutional neural network for capturing Spatio-temporal features; a Triplet Network to perform metric learning; and the dual-memory Extreme Value Machine, which is introduced in this work. The proposed method is compared with 10 other state-of-the-art incremental learning models. We propose five experimental settings containing different numbers of tasks and classes using two widely known HAR datasets: UCF-101 and HMDB51. Our results show superior performance in terms of Normalized Mutual Information (NMI) and Inter-task Intransigence (ITI), which is a new metric proposed in this work. Overall results show the feasibility of the proposal for real HAR problems, which mostly present the requirements imposed by incremental learning.

论文关键词:Incremental learning,Human Action Recognition,Metric Learning,Triplet Networks,Dual-memory Extreme Value Machine

论文评审过程:Received 26 July 2021, Revised 21 September 2021, Accepted 22 September 2021, Available online 1 October 2021, Version of Record 9 October 2021.

论文官网地址:https://doi.org/10.1016/j.imavis.2021.104313