VIsual TRAnslator: Linking perceptions and natural language descriptions

作者:Gerd Herzog, Peter Wazinski

摘要

Despite the fact that image understanding and natural language processing constitute two major areas of AI, there have only been a few attempts toward the integration of computer vision and the generation of natural language expressions for the description of image sequences. In this contribution we will report on practical experience gained in the projectVitra (VIsual TRAnslator) concerning the design and construction of integrated knowledge-based systems capable of translating visual information into natural language descriptions. InVitra different domains, like traffic scenes and short sequences from soccer matches, have been investigated.

论文关键词:computer vision, high-level scene analysis, natural language access

论文评审过程:

论文官网地址:https://doi.org/10.1007/BF00849073