Fast and robust text detection in images and video frames

作者:

Highlights:

摘要

Text in images and video frames carries important information for visual content understanding and retrieval. In this paper, by using multiscale wavelet features, we propose a novel coarse-to-fine algorithm that is able to locate text lines even under complex background. First, in the coarse detection, after the wavelet energy feature is calculated to locate all possible text pixels, a density-based region growing method is developed to connect these pixels into regions which are further separated into candidate text lines by structural information. Secondly, in the fine detection, with four kinds of texture features extracted to represent the texture pattern of a text line, a forward search algorithm is applied to select the most effective features. Finally, an SVM classifier is used to identify true text from the candidates based on the selected features. Experimental results show that this approach can fast and robustly detect text lines under various conditions.

论文关键词:Text detection,Multiscale wavelet feature,Feature combination,SVM classification

论文评审过程:Received 12 April 2004, Revised 12 October 2004, Accepted 14 January 2005, Available online 11 April 2005.

论文官网地址:https://doi.org/10.1016/j.imavis.2005.01.004