A binarization method with learning-built rules for document images produced by cameras

作者:

Highlights:

摘要

In this paper, we propose a novel binarization method for document images produced by cameras. Such images often have varying degrees of brightness and require more careful treatment than merely applying a statistical method to obtain a threshold value. To resolve the problem, the proposed method divides an image into several regions and decides how to binarize each region. The decision rules are derived from a learning process that takes training images as input. Tests on images produced under normal and inadequate illumination conditions show that our method yields better visual quality and better OCR performance than three global binarization methods and four locally adaptive binarization methods.

论文关键词:Document image binarization,Global threshold,Image processing,Local threshold,Multi-label problem,Non-uniform brightness,Support vector machine

论文评审过程:Received 7 July 2006, Revised 27 June 2008, Accepted 23 October 2009, Available online 1 November 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.10.016