Embedding spatial information into image content description for scene retrieval

作者:

Highlights:

摘要

This article presents Δ-TSR, an image content representation describing the spatial layout with triangular relationships of visual entities, which can be symbolic objects or low-level visual features. A semi-local implementation of Δ-TSR is also proposed, making the description robust to viewpoint changes. We evaluate Δ-TSR for image retrieval under the query-by-example paradigm, on contents represented with interest points in a bag-of-features model: it improves state-of-the-art techniques, in terms of retrieval quality as well as of execution time, and is scalable. Finally, its effectiveness is evaluated on a topical scenario dedicated to scene retrieval in datasets of city landmarks.

论文关键词:CBIR,Spatial relationships,Local image features,Scalability

论文评审过程:Received 30 October 2009, Revised 21 March 2010, Accepted 25 March 2010, Available online 8 April 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.03.024