CoMMA: a framework for integrated multimedia mining using multi-relational associations

作者：Ankur M. Teredesai, Muhammad A. Ahmad, Juveria Kanodia, Roger S. Gaborski

摘要

Generating captions or annotations automatically for still images is a challenging task. Traditionally, techniques involving higher-level (semantic) object detection and complex feature extraction have been employed for scene understanding. On the basis of this understanding, corresponding text descriptions are generated for a given image. In this paper, we pose the auto-annotation problem as that of multi-relational association rule mining where the relations exist between image-based features, and textual annotations. The central idea is to combine low-level image features such as color, orientation, intensity, etc. and corresponding text annotations to generate association rules across multiple tables using multi-relational association mining. Subsequently, we use these association rules to auto-annotate test images.

论文关键词：Image captioning, Multimedia data mining, Auto-annotation, Multi-relational association rule mining, FP-Growth, Multi-relational FP-Growth, Text-based image retrieval

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10115-005-0221-x