Chinese document layout analysis based on adaptive split-and-merge and qualitative spatial reasoning

作者:

Highlights:

摘要

The ultimate goal of automatic document processing is to understand the semantics of a document. Towards such an end, one of the primary enabling steps has been to first reason about the layout of the document by means of page segmentation and segment spatial reasoning or labeling. This, in turn, allows for the derivation of document logical organization. This paper describes a generic document segmentation and geometric relation labeling method with applications to Chinese document analysis. Unlike the previous document segmentation methods where text spacing, border lines, and/or a priori layout models based on template matching processing are performed, the present method begins with a hierarchy of partitioned image layers where inhomogeneous higher-level regions are recursively partitioned into lower-level rectangular subregions and at the same time lower-level smaller homogeneous regions are merged into larger homogeneous regions. Furthermore, the derived segment data structure readily enables efficient search for geometric relationships between identified document segments.

论文关键词:Chinese document processing,Geometric structure,Adaptive split-and-merge,Segment spatial reasoning

论文评审过程:Received 16 July 1996, Revised 21 October 1996, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(96)00165-3