Interactive visual dialog

作者:

Highlights:

摘要

In this paper we propose a paradigm called the interactive visual dialog (IVD) as a means of facilitating a system's ability to recognize objects presented to it by a human. The presentation centers around a supermarket checkout scenario in which an operator presents an item to be tallied to a stationary television camera. An active vision approach is used to provide feedback to the operator in the form of an image (or images) depicting what the system thinks the operator is most likely holding, shown in a viewpoint that suggests how the object should next be presented to improve the certainty of interpretation. Interaction proceeds iteratively until the system converges on the correct interpretation. We show how the IVD can be implemented using an entropy-based gaze planning strategy and a sequential Bayes recognition system using optical flow as input. Experimental results show that the system does, in practice, improve recognition accuracy, leading to convergence to a correct solution in a minimal number of iterations.

论文关键词:Interactive visual dialog,Entropy map

论文评审过程:Received 10 June 2001, Revised 4 March 2002, Accepted 19 March 2002, Available online 28 May 2002.

论文官网地址:https://doi.org/10.1016/S0262-8856(02)00053-7