GrouPeer: Dynamic clustering of P2P databases

作者:

Highlights:

摘要

Sharing structured data in a P2P network is a challenging problem, especially in the absence of a mediated schema. The standard practice of answering a consecutively rewritten query along the propagation path often results in significant loss of information. On the opposite, the use of mediated schemas requires human interaction and global agreement, both during creation and maintenance. In this paper we present GrouPeer, an adaptive, automated approach to both issues in the context of unstructured P2P database overlays. By allowing peers to individually choose which rewritten version of a query to answer and evaluate the received answers, information-rich sources left hidden otherwise are discovered. Gradually, the overlay is restructured as semantically similar peers are clustered together. Experimental results show that our technique produces very accurate answers and builds clusters that are very close to the optimal ones by contacting a very small number of nodes in the overlay.

论文关键词:Peer-to-Peer databases,Structured data in unstructured P2P overlays,Semantics in P2P query answering,Query reformulation in P2P databases

论文评审过程:Received 9 February 2007, Revised 13 February 2008, Accepted 16 April 2008, Available online 3 May 2008.

论文官网地址:https://doi.org/10.1016/j.is.2008.04.002