Real-time 6D pose estimation from a single RGB image

作者:

Highlights:

摘要

We propose an end-to-end deep learning architecture for simultaneously detecting objects and recovering 6D poses in an RGB image. Concretely, we extend the 2D detection pipeline with a pose estimation module to indirectly regress the image coordinates of the object's 3D vertices based on 2D detection results. Then the object's 6D pose can be estimated using a Perspective-n-Point algorithm without any post-refinements. Moreover, we elaborately design a backbone structure to maintain spatial resolution of low level features for pose estimation task. Compared with state-of-the-art RGB based pose estimation methods, our approach achieves competitive or superior performance on two benchmark datasets at an inference speed of 25 fps on a GTX 1080Ti GPU, which is capable of real-time processing.

论文关键词:6D pose estimation,Real-time processing,Coordinate localization,Backbone design

论文评审过程:Received 6 December 2018, Accepted 29 June 2019, Available online 9 July 2019, Version of Record 26 July 2019.

论文官网地址:https://doi.org/10.1016/j.imavis.2019.06.013