MTRNet++: One-stage mask-based scene text eraser

作者:

Highlights:

摘要

A precise, controllable, interpretable and easily trainable text removal approach is necessary for both user-specific and large-scale text removal applications. To achieve this, we propose a one-stage mask-based text inpainting network, MTRNet++. It has a novel architecture that includes mask-refine, coarse-inpainting and fine-inpainting branches, and attention blocks. With this architecture, MTRNet++ can remove text either with or without an external mask. It achieves state-of-the-art results on both the Oxford and SCUT datasets without using external ground-truth masks. The results of ablation studies demonstrate that the proposed multi-branch architecture with attention blocks is effective and essential. It also demonstrates controllability and interpretability.

论文关键词:

论文评审过程:Received 18 December 2019, Revised 4 June 2020, Accepted 17 August 2020, Available online 21 August 2020, Version of Record 26 August 2020.

论文官网地址:https://doi.org/10.1016/j.cviu.2020.103066