Text detection in images using sparse representation with discriminative dictionaries

作者:

Highlights:

摘要

Text detection is important in the retrieval of texts from digital pictures, video databases and webpages. However, it can be very challenging since the text is often embedded in a complex background. In this paper, we propose a classification-based algorithm for text detection using a sparse representation with discriminative dictionaries. First, the edges are detected by the wavelet transform and scanned into patches by a sliding window. Then, candidate text areas are obtained by applying a simple classification procedure using two learned discriminative dictionaries. Finally, the adaptive run-length smoothing algorithm and projection profile analysis are used to further refine the candidate text areas. The proposed method is evaluated on the Microsoft common test set, the ICDAR 2003 text locating set, and an image set collected from the web. Extensive experiments show that the proposed method can effectively detect texts of various sizes, fonts and colors from images and videos.

论文关键词:Text detection,Sparse representation,Discriminative dictionary

论文评审过程:Received 23 August 2009, Revised 31 March 2010, Accepted 6 April 2010, Available online 18 April 2010.

论文官网地址:https://doi.org/10.1016/j.imavis.2010.04.002