Using evolutionary computation for discovering spam patterns from e-mail samples

作者:

Highlights:

• Reveal how to take advantage of text patterns to filter spam e-mails.

• Review of existing strategies to automatically discover regular expressions from a dataset.

• Introduce of a novel genetic programing method to automatically find regular expressions.

• Our proposal is compared against another popular method introduced by Eric Conrad (SANS Institute).

摘要

•Reveal how to take advantage of text patterns to filter spam e-mails.•Review of existing strategies to automatically discover regular expressions from a dataset.•Introduce of a novel genetic programing method to automatically find regular expressions.•Our proposal is compared against another popular method introduced by Eric Conrad (SANS Institute).

论文关键词:Genetic programing,Regular expressions,Automatic generation,E-mail,Spam filtering

论文评审过程:Received 27 April 2017, Revised 2 November 2017, Accepted 1 December 2017, Available online 12 December 2017, Version of Record 12 December 2017.

论文官网地址:https://doi.org/10.1016/j.ipm.2017.12.001