CoMMA: a framework for integrated multimedia mining using multi-relational associations
作者:Ankur M. Teredesai, Muhammad A. Ahmad, Juveria Kanodia, Roger S. Gaborski
摘要
Generating captions or annotations automatically for still images is a challenging task. Traditionally, techniques involving higher-level (semantic) object detection and complex feature extraction have been employed for scene understanding. On the basis of this understanding, corresponding text descriptions are generated for a given image. In this paper, we pose the auto-annotation problem as that of multi-relational association rule mining where the relations exist between image-based features, and textual annotations. The central idea is to combine low-level image features such as color, orientation, intensity, etc. and corresponding text annotations to generate association rules across multiple tables using multi-relational association mining. Subsequently, we use these association rules to auto-annotate test images.
论文关键词:Image captioning, Multimedia data mining, Auto-annotation, Multi-relational association rule mining, FP-Growth, Multi-relational FP-Growth, Text-based image retrieval
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10115-005-0221-x