Image-based form document retrieval

作者:

Highlights:

摘要

We address the problem of image-based form document retrieval. The essential element of this problem is the definition of a similarity measure that is applicable in real situations, where query images are allowed to differ from the database images. Based on the definition of form signature, we have proposed a similarity measure that is insensitive to translation, scaling, moderate skew (<5°) and variations in the geometrical proportion of the form layout. This similarity measure also has a good tolerance to line detection errors. We have developed a prototype form retrieval system based on the proposed similarity measure. Preliminary experimental results on a database containing 100 different kinds of forms are encouraging.

论文关键词:Document analysis,Form processing,Form recognition,Image database,Line detection,Similarity measure,Document retrieval

论文评审过程:Received 29 September 1998, Accepted 22 February 1999, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(99)00066-7