Segmentation of page images having artifacts of photocopying and scanning

作者:

Highlights:

摘要

The analysis of scanned documents is important in the construction of digital libraries and paperless offices. One significant challenge is coping with artifacts of photocopying and scanning. We present a series of simple techniques for handling these difficulties. Using 125 images of the University of Washington scanned documents database, we demonstrate the effectiveness of these methods in preparing the images for segmentation by a multiresolution algorithm.

论文关键词:Document analysis,Artifact elimination,Segmentation,Print-through,Marginal artifact,Partial extra page,Digital library

论文评审过程:Received 18 June 1999, Revised 16 February 2001, Accepted 26 February 2001, Available online 11 February 2002.

论文官网地址:https://doi.org/10.1016/S0031-3203(01)00082-6