BinDeep: A deep learning approach to binary code similarity detection

作者:

Highlights:

• We propose a novel deep learning based solution for binary code similarity detection.

• We use the instruction embedding model to vectorize the extracted instructions.

• We apply a deep learning classification model to identify the types of functions to be compared.

• We utilize the hybrid siamese neural network to measure the binary code similarity.

• We conduct extensive experiments to evaluate the effectiveness of our approach.

摘要

•We propose a novel deep learning based solution for binary code similarity detection.•We use the instruction embedding model to vectorize the extracted instructions.•We apply a deep learning classification model to identify the types of functions to be compared.•We utilize the hybrid siamese neural network to measure the binary code similarity.•We conduct extensive experiments to evaluate the effectiveness of our approach.

论文关键词:Binary code,Deep learning,Similarity comparison,Siamese neural network,LSTM,CNN

论文评审过程:Received 6 December 2019, Revised 30 June 2020, Accepted 17 November 2020, Available online 3 December 2020, Version of Record 11 December 2020.

论文官网地址:https://doi.org/10.1016/j.eswa.2020.114348