Link-based web spam detection using weight properties

作者:Kwang Leng Goh, Ravi Kumar Patchmuthu, Ashutosh Kumar Singh

摘要

Link spam is created with the intention of boosting one target’s rank in exchange of business profit. This unethical way of deceiving Web search engines is known as Web spam. Since then many anti-link spam detection techniques have constantly being proposed. Web spam detection is a crucial task due to its devastation towards Web search engines and global cost of billion dollars annually. In this paper, we proposed a novel technique by incorporating weight properties to enhance the Web spam detection algorithms. Weight properties can be defined as the influences of one Web node towards another Web node. We modified existing Web spam detection algorithms with our novel technique to evaluate the performances on a large public Web spam dataset – WEBSPAM-UK2007. The overall performance have shown that the modified algorithms outperform the benchmark algorithms up to 30.5 % improvement at host level and 6.11 % improvement at page level.

论文关键词:Host level, Link spam, Adversarial information retrieval, Weight properties, Web spam

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-014-0310-y