On the performance of phonetic algorithms in microtext normalization

作者:

Highlights:

• First extensive comparison study of this type.

• 17(+10) phonetic algorithms are analysed in the context of microtext normalization.

• It showed heavily dependant on the subsequent candidate selection process.

• A small candidate set may limit the overall performance of the system.

摘要

•First extensive comparison study of this type.•17(+10) phonetic algorithms are analysed in the context of microtext normalization.•It showed heavily dependant on the subsequent candidate selection process.•A small candidate set may limit the overall performance of the system.

论文关键词:Microtext normalization,Phonetic algorithm,Fuzzy matching,Twitter,Texting

论文评审过程:Received 4 December 2017, Revised 5 July 2018, Accepted 5 July 2018, Available online 7 July 2018, Version of Record 20 July 2018.

论文官网地址:https://doi.org/10.1016/j.eswa.2018.07.016