Modeling visual and word-conditional semantic attention for image captioning

作者:

Highlights:

• A new dual temporal model is proposed for image captioning.

• Word-conditional semantic attention is proposed for functional-words

• generation.

• A self-balancing model is exploited to balance the visual and semantic attention.

摘要

•A new dual temporal model is proposed for image captioning.•Word-conditional semantic attention is proposed for functional-words•generation.•A self-balancing model is exploited to balance the visual and semantic attention.

论文关键词:Image captioning,Word-conditional semantic attention,Visual attention,Attention variation

论文评审过程:Received 1 December 2017, Revised 28 May 2018, Accepted 4 June 2018, Available online 15 June 2018, Version of Record 21 June 2018.

论文官网地址:https://doi.org/10.1016/j.image.2018.06.002