Candidate document retrieval for cross-lingual plagiarism detection using two-level proximity information

作者:

Highlights:

• Proposing a candidate retrieval model for cross-lingual plagiarism detection

• The method relies on using two levels of proximity information

• Proposing a topic-based text segmentation method

• Comparing the method with other cross-lingual plagiarism detection approaches

• Showing improvements using text segmentation and positional language models

摘要

•Proposing a candidate retrieval model for cross-lingual plagiarism detection•The method relies on using two levels of proximity information•Proposing a topic-based text segmentation method•Comparing the method with other cross-lingual plagiarism detection approaches•Showing improvements using text segmentation and positional language models

论文关键词:Candidate document retrieval,Cross-language plagiarism detection,Text segmentation,Proximity-based retrieval

论文评审过程:Received 21 February 2015, Revised 11 April 2016, Accepted 18 April 2016, Available online 29 April 2016, Version of Record 28 September 2016.

论文官网地址:https://doi.org/10.1016/j.ipm.2016.04.006