Text image matching without language model using a Hausdorff distance

作者:

Highlights:

摘要

In this paper, we propose a text matching method for document image retrieval without any language model. Two word images are first normalized to an appropriate size and image features are extracted using the local crowdedness method. Similarity between the two features is then measured by calculating a Hausdorff distance. We performed three experiments. The first experiment proves the effectiveness of the proposed method for text matching, and the other two experiments verify the language independence and font size independence of the proposed method.

论文关键词:Document image retrieval,Image-to-image matching,Hausdorff distance,Local crowdedness

论文评审过程:Received 30 July 2007, Revised 22 November 2007, Accepted 25 November 2007, Available online 28 January 2008.

论文官网地址:https://doi.org/10.1016/j.ipm.2007.11.004