Multi-Modal fusion with multi-level attention for Visual Dialog

作者：

Highlights：

• We propose a novel visual dialog method with multi-level attention.

• Three high-level attention modules are devised to select important words.

• We also use attention to select relevant regions in the image.

• We show the multi-level attention is effective in the visual dialog.

摘要

•We propose a novel visual dialog method with multi-level attention.•Three high-level attention modules are devised to select important words.•We also use attention to select relevant regions in the image.•We show the multi-level attention is effective in the visual dialog.

论文关键词：Visual Dialog,Multi-Modal,Multi-Level,Attention mechanism,00-01,99-00

论文评审过程：Received 10 July 2019, Revised 7 September 2019, Accepted 24 October 2019, Available online 11 November 2019, Version of Record 6 May 2020.

论文官网地址：https://doi.org/10.1016/j.ipm.2019.102152

原文链接
谷歌学术
必应学术
百度学术