PRPN: Progressive region prediction network for natural scene text detection

作者:

Highlights:

摘要

With the development of deep learning, scene text detection methods have made great progress in recent years. Most text detection methods are based on bounding box prediction with a 0-1 discrete distribution; thus, separating adjacent text instances is difficult. Direct prediction of the bounding box also renders difficult the detection various shapes of text, such as quadrangular text and curved text. In this work, we design a 2D progressive kernel for describing the progressive variety of text regions. It transforms the original ground truth (GT) of bounding boxes into the GT of a 0-1 progressive probability distribution. We also propose a novel progressive region prediction network (PRPN) with directional pooling for predicting the probability distributions of text regions. Then, a postprocessing algorithm is used to transform the probability distributions of the text regions into bounding box output for text detection. Experiments on standard datasets, including ICDAR 2013, ICDAR 2015, MSRA-TD500, and SCUT-CTW1500, demonstrate that the proposed method outperforms state-of-the-art methods in terms of accuracy and robustness. The method obtains an F-measure of 86.0% on ICDAR 2015 and 81.4% on SCUT-CTW1500. The code is available at https://github.com/xinyu-ch/ProgressiveTextDetection.

论文关键词:Natural scene text detection,Progressive region,Pixel prediction

论文评审过程:Received 4 January 2021, Revised 12 November 2021, Accepted 14 November 2021, Available online 27 November 2021, Version of Record 11 December 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107767