Beyond pixels: Exploiting camera metadata for photo classification

作者:

Highlights:

摘要

Semantic scene classification based only on low-level vision cues has had limited success on unconstrained image sets. On the other hand, camera metadata related to capture conditions provide cues independent of the captured scene content that can be used to improve classification performance. We consider three problems, indoor–outdoor classification, sunset detection, and manmade–natural classification. Analysis of camera metadata statistics for images of each class revealed that metadata fields, such as exposure time, flash fired, and subject distance, are most discriminative for each problem. A Bayesian network is employed to fuse content-based and metadata cues in the probability domain and degrades gracefully even when specific metadata inputs are missing (a practical concern). Finally, we provide extensive experimental results on the three problems using content-based and metadata cues to demonstrate the efficacy of the proposed integrated scene classification scheme.

论文关键词:Semantic scene classification,Low-level cues,Camera metadata,Exposure time,Flash fired,Subject distance

论文评审过程:Revised 9 August 2004, Available online 17 January 2005.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.11.013