Detection of idea plagiarism using syntax–Semantic concept extractions with genetic algorithm

作者:

Highlights:

• A syntax-semantic concept extraction technique with genetic algorithm is proposed for idea plagiarism detection.

• Concept extraction at different structural levels of document is done to capture the idea.

• Evaluation is done on PAN summary obfuscation set which represents idea plagiarism cases.

• Comparison is done with state-of -art systems and the results show considerable improvement.

摘要

• A syntax-semantic concept extraction technique with genetic algorithm is proposed for idea plagiarism detection.• Concept extraction at different structural levels of document is done to capture the idea.• Evaluation is done on PAN summary obfuscation set which represents idea plagiarism cases.• Comparison is done with state-of -art systems and the results show considerable improvement.

论文关键词:Idea plagiarism,Concept extraction,Syntax-semantic,Genetic algorithm,Summary obfuscation

论文评审过程:Received 12 August 2016, Revised 14 December 2016, Accepted 16 December 2016, Available online 18 December 2016, Version of Record 24 December 2016.

论文官网地址:https://doi.org/10.1016/j.eswa.2016.12.022