SBBS: A sliding blocking algorithm with backtracking sub-blocks for duplicate data detection

作者:

Highlights:

• It clearly analyzes inserting and deleting operations in the traditional SB algorithm.

• It proposes the concept of matching-failed segments due to above operations.

• It proposes an efficient sliding blocking algorithm with backtracking sub-blocks.

• SBBS can detect duplicate data as many as possible in matching-failed segments.

摘要

•It clearly analyzes inserting and deleting operations in the traditional SB algorithm.•It proposes the concept of matching-failed segments due to above operations.•It proposes an efficient sliding blocking algorithm with backtracking sub-blocks.•SBBS can detect duplicate data as many as possible in matching-failed segments.

论文关键词:Data deduplication,Duplicate data detection,Sliding blocking algorithm,Backtracking,SBBS,Content-defined chunking algorithm

论文评审过程:Available online 7 October 2013.

论文官网地址:https://doi.org/10.1016/j.eswa.2013.09.040