ATCC: Accurate tracking by criss-cross location attention

作者:

Highlights:

摘要

In recent years, discriminative correlation filters (DCF) and Siamese networks based trackers have significantly advanced the performance in tracking. However, the problem of accurate target state estimation is not fully solved yet. Therefore, in this paper, we propose a Criss-Cross Location Attention (CCLA) module, which pays more concerns to global and local contextual information and is used for the adaptation of IoU-Net based trackers. Besides, our CCLA module has capability of high computational efficiency with a slight increase of network parameters. Then, we present our tracker called ATCC, a Siamese architecture with CCLA. Finally, we evaluate our tracker on OTB100, VOT-2018, LaSOT, and TrackingNet benchmark datasets. Experimental results show that our tracker performs favorably against other state-of-the-art trackers, while operating at 30 FPS on single GPU. We will release the code and models at https://github.com/yongwuSHU/atcc.

论文关键词:Visual tracking,Target state estimation,Criss-Cross location attention

论文评审过程:Received 9 February 2021, Accepted 21 April 2021, Available online 24 April 2021, Version of Record 29 April 2021.

论文官网地址:https://doi.org/10.1016/j.imavis.2021.104188