Xandy: A scalable change detection technique for ordered XML documents using relational databases

作者:

Highlights:

摘要

Previous work in change detection to XML documents is not suitable for detecting the changes to large XML documents as it requires a lot of memory to keep the two versions of XML documents in the memory. In this article, we take a more conservative yet novel approach of using traditional relational database engines for detecting the changes to large ordered XML documents. To this end, we have implemented a prototype system called Xandy that converts XML documents into relational tuples and detects the changes from these tuples by using SQL queries. Our experimental results show that the relational-based approach has better scalability compared to published algorithm like X-Diff. It has comparable efficiency and result quality compared to X-Diff in some cases. Our experimental results also show that, generally, Xandy has better result quality than XyDiff.

论文关键词:XML,Change detection,RDBMS,Schema-unconscious approach,Performance,Result quality

论文评审过程:Received 24 January 2005, Revised 20 May 2005, Accepted 29 June 2005, Available online 2 November 2005.

论文官网地址:https://doi.org/10.1016/j.datak.2005.06.006