Novel harmony search-based algorithms for part-of-speech tagging

作者:Rana Forsati, Mehrnoush Shamsfard

摘要

As a fast and high-quality tagger algorithm is a crucial task in natural language processing, this paper presents novel language-independent algorithms based on harmony search (HS) optimization method for handling the part-of-speech (PoS) tagging problem. The first proposed algorithm is a framework for applying HS to PoS-tagging which is called HSTAGger. By modifying HS algorithm and proposing more efficient objective functions, two improved versions of the HSTAGger are also introduced. In addition, a novel class of problematic words called erroneous as well as a method of handling them is proposed for the first time to the best of our knowledge. To demonstrate the effectiveness of the proposed algorithms, we have applied them on standard annotated corpus and compare them with other evolutionary-based and classical PoS-tagging approaches. Experimental results indicate that the proposed algorithms outperform the other taggers previously presented in the literature in terms of average precision.

论文关键词:Natural language processing (NLP), Part-of-speech (PoS) tagging, Harmony search algorithm, Evolutionary algorithms

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-013-0719-6