Visual appearance based person retrieval in unconstrained environment videos

作者:

Highlights:

摘要

Visual appearance-based person retrieval is a challenging problem in surveillance. It uses attributes like height, cloth color, cloth type and gender to describe a human. Such attributes are known as soft biometrics. This paper proposes person retrieval from surveillance video using height, torso cloth type, torso cloth color and gender. The approach introduces an adaptive torso patch extraction and bounding box regression to improve the retrieval. The algorithm uses fine-tuned Mask R-CNN and DenseNet-169 for person detection and attribute classification respectively. The performance is analyzed on AVSS 2018 challenge II dataset and it achieves 11.35% improvement over state-of-the-art based on average Intersection over Union measure.

论文关键词:Linear filtering,Person retrieval,Semantic description,Soft biometrics,Video surveillance

论文评审过程:Received 21 September 2019, Accepted 6 October 2019, Available online 24 October 2019, Version of Record 7 November 2019.

论文官网地址:https://doi.org/10.1016/j.imavis.2019.10.002