A robust and fast multispectral pedestrian detection deep network

作者:

Highlights:

摘要

Multispectral pedestrian detection is a difficult task, especially with pedestrian images of different sizes. In most convolutional neural network (CNN) models, the shared receptive fields of each layer are of the same size, which constrains detection results of multiple scales pedestrians. In this paper, we propose a dynamic selection scheme to adaptive adjust receptive field size in multispectral pedestrian detection. Specifically, a network in network (NIN) is employed to combine visible and thermal information. Selective kernel networks (SKNets) which uses selective kernel unit with different kernel size are employed. To effectively fuse the feature representation in each layer, a build block is designed, in which different features are fused. Feature pyramid is employed to integrate feature information in each layer. We empirically show that our method outperforms existing 8 state-of-the-art methods on one multispectral dataset and 4 state-of-the-art methods on another multispectral dataset. Detailed analyses show that our proposed method can capture multispectral pedestrian detection of different scales, which confirms the effective of SKNets for adaptively resizing the receptive field sizes. In addition, our method can operate at 14 frames per second (fps) on a GPU.

论文关键词:Multispectral pedestrian detection,SKNets,SSD,Convolutional neural network,Network in network

论文评审过程:Received 25 June 2020, Revised 23 March 2021, Accepted 24 March 2021, Available online 26 March 2021, Version of Record 8 June 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.106990