Large-scale document image retrieval and classification with runlength histograms and binary embeddings

作者:

Highlights:

摘要

We present a new document image descriptor based on multi-scale runlength histograms. This descriptor does not rely on layout analysis and can be computed efficiently. We show how this descriptor can achieve state-of-the-art results on two very different public datasets in classification and retrieval tasks. Moreover, we show how we can compress and binarize these descriptors to make them suitable for large-scale applications. We can achieve state-of-the-art results in classification using binary descriptors of as few as 16–64 bits.

论文关键词:Visual document descriptor,Compression,Large-scale,Retrieval,Classification

论文评审过程:Received 16 January 2012, Revised 26 November 2012, Accepted 10 December 2012, Available online 19 December 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.12.004