Rectifying perspective views of text in 3D scenes using vanishing points

作者:

Highlights:

摘要

Documents may be captured at any orientation when viewed with a hand-held camera. Here, a method of recovering fronto-parallel views of perspectively skewed text documents in single images is presented, useful for ‘point-and-click’ scanning or when generally seeking regions of text in a scene. We introduce a novel extension to the commonly used 2D projection profiles in document recognition to locate the horizontal vanishing point of the text plane. Following further analysis, we segment the lines of text to determine the style of justification of the paragraphs. The change in line spacings exhibited due to perspective is then used to locate the document's vertical vanishing point. No knowledge of the camera focal length is assumed. Using the vanishing points, a fronto-parallel view is recovered which is then suitable for OCR or other high-level recognition. We provide results demonstrating the algorithm's performance on documents over a wide range of orientations.

论文关键词:Document perspective recovery,Paragraph format,Vanishing point detection,Document analysis and recognition

论文评审过程:Received 1 October 2002, Accepted 2 April 2003, Available online 30 May 2003.

论文官网地址:https://doi.org/10.1016/S0031-3203(03)00132-8