An improved binarization algorithm based on a water flow model for document image with inhomogeneous backgrounds

作者:

Highlights:

摘要

A segmentation algorithm using a water flow model [Kim et al., Pattern Recognition 35 (2002) 265–277] has already been presented where a document image can be efficiently divided into two regions, characters and background, due to the property of locally adaptive thresholding. However, this method has not decided when to stop the iterative process and required long processing time. Plus, characters on poor contrast backgrounds often fail to be separated successfully. Accordingly, to overcome the above drawbacks to the existing method, the current paper presents an improved approach that includes extraction of regions of interest (ROIs), an automatic stopping criterion, and hierarchical thresholding. Experimental results show that the proposed method can achieve a satisfactory binarization quality, especially for document images with a poor contrast background, and is significantly faster than the existing method.

论文关键词:Water flow model,Document image,Adaptive thresholding,Binarization,Speed-up

论文评审过程:Received 11 March 2004, Revised 23 November 2004, Accepted 23 November 2004, Available online 8 September 2005.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.11.025