Conceptual representations between video signals and natural language descriptions

作者：

Highlights：

•

摘要

An artificial cognitive vision system associates video signals with conceptual descriptions of the depicted time-varying scene. This linkage is mediated by knowledge representation formalisms. An experimental implementation of such an approach yielded initial results for the conceptual description of videos recorded at innercity traffic scenes, see [M. Haag, H.-H. Nagel, Incremental recognition of traffic situations from video image sequences, Image and Vision Computing 18 (2) (2000) 137–153]. Accumulating experience with this system approach and its extension for the generation of natural language texts from videos caused us to redesign the overall computer vision system as well as the knowledge representation formalisms utilised within that system.

论文关键词：Cognitive vision,Knowledge representation

论文评审过程：Received 13 July 2004, Revised 17 June 2005, Accepted 21 July 2005, Available online 24 April 2006.

论文官网地址：https://doi.org/10.1016/j.imavis.2005.07.026