Digital geometric methods in document image analysis

作者：

Highlights：

•

摘要

One of the main tasks of digital image analysis is to recognize the properties of real objects based on their digital images. These images are obtained by some sampling device, like a CCD camera, and are represented as finite sets of points that are assigned some value in a gray level or color scale. A fundamental question in image understanding is which features in the digital image correspond, under a given set of conditions, to certain properties of the underlying objects. In many practical applications this question is answered empirically by visually inspecting the digital images. In this paper, we present a mathematically comprehensive answer to this question with respect to topological properties. In particular, we derive conditions relating properties of real objects to the grid size of the sampling device which guarantee that a real object and its digital image are topologically equivalent. These conditions also imply that two digital images of a given object are topologically equivalent. Moreover, we prove that a topologically invariant digitization must result in well-composed or strongly connected sets and that only certain local neighborhoods are realizable for such a digitization. Using the derived topological model of a well-composed digital image, we demonstrate the effectiveness of this model with respect to the digitization, thresholding, correction, and compression of digital document images.

论文关键词：Digital topology,Digital geometry,Topologically invariant digitization,Document image analysis,Well-composed sets,Topological compression

论文评审过程：Received 30 September 1997, Revised 16 June 1998, Available online 7 June 2001.

论文官网地址：https://doi.org/10.1016/S0031-3203(98)00102-2