An empirical study of three machine learning methods for spam filtering

作者:

Highlights:

摘要

The increasing volumes of unsolicited bulk e-mail (also known as spam) are bringing more annoyance for most Internet users. Using a classifier based on a specific machine-learning technique to automatically filter out spam e-mail has drawn many researchers’ attention. This paper is a comparative study the performance of three commonly used machine learning methods in spam filtering. On the other hand, we try to integrate two spam filtering methods to obtain better performance. A set of systematic experiments has been conducted with these methods which are applied to different parts of an e-mail. Experiments show that using the header only can achieve satisfactory performance, and the idea of integrating disparate methods is a promising way to fight spam.

论文关键词:Spam filtering,Machine learning

论文评审过程:Received 10 July 2005, Accepted 1 May 2006, Available online 5 September 2006.

论文官网地址:https://doi.org/10.1016/j.knosys.2006.05.016