Multi-Modal fusion with multi-level attention for Visual Dialog

作者:

Highlights:

• We propose a novel visual dialog method with multi-level attention.

• Three high-level attention modules are devised to select important words.

• We also use attention to select relevant regions in the image.

• We show the multi-level attention is effective in the visual dialog.

摘要

•We propose a novel visual dialog method with multi-level attention.•Three high-level attention modules are devised to select important words.•We also use attention to select relevant regions in the image.•We show the multi-level attention is effective in the visual dialog.

论文关键词:Visual Dialog,Multi-Modal,Multi-Level,Attention mechanism,00-01,99-00

论文评审过程:Received 10 July 2019, Revised 7 September 2019, Accepted 24 October 2019, Available online 11 November 2019, Version of Record 6 May 2020.

论文官网地址:https://doi.org/10.1016/j.ipm.2019.102152