Scalable Detection of Server-Side Polymorphic Malware

作者:

Highlights:

摘要

Server-side polymorphism is used by malware distributors in order to evade detection by anti-virus (AV) scanners. It is difficult for traditional AVs to detect this type of malware because the transformation code is not visible for security analysis. Using a tera-scale dataset consisting of antivirus telemetry reports pertaining to more than half a billion files, we conduct what is, to the best of our knowledge, the most wide-scale analysis of the properties of web-borne polymorphic malware done to date.We cluster the files population based on their locality-sensitive hash (LSH) values and analyze the resulting LSH clusters. Using ground truth labels, we identify benign and malicious clusters and analyse the differences between them in terms of the distributions of cluster-size, file download numbers and activity period, and in terms of their web domain utilization patterns. The results of this analysis are then leveraged for devising SPADE - a scalable Server-side Polymorphic mAlware DEtector that provides high-quality detection of both malicious files and malicious web domains.

论文关键词:Malware Detection,Server-Side Polymorphism,Locality-Sensitive Hashing

论文评审过程:Received 9 January 2018, Revised 13 May 2018, Accepted 18 May 2018, Available online 19 May 2018, Version of Record 4 June 2018.

论文官网地址:https://doi.org/10.1016/j.knosys.2018.05.024