VQA as a factoid question answering problem: A novel approach for knowledge-aware and explainable visual question answering

作者:

Highlights:

• The proposed model is a free form, open ended and knowledge aware VQA model.

• VQA modeled as an explainable, end to end factoid question answering problem.

• Model capable of leveraging granular details, correlate inter-related details in scenes.

• Model capable of leveraging external world knowledge to answer questions.

• Model capable of predicting likely explanations to justify the predicted answers.

摘要

Highlights•The proposed model is a free form, open ended and knowledge aware VQA model.•VQA modeled as an explainable, end to end factoid question answering problem.•Model capable of leveraging granular details, correlate inter-related details in scenes.•Model capable of leveraging external world knowledge to answer questions.•Model capable of predicting likely explanations to justify the predicted answers.

论文关键词:Visual question answering,Factoid question answering,Knowledge based reasoning,Explainable VQA

论文评审过程:Received 2 September 2020, Revised 6 October 2021, Accepted 7 October 2021, Available online 24 October 2021, Version of Record 1 November 2021.

论文官网地址:https://doi.org/10.1016/j.imavis.2021.104328