Communication-Efficient Distributed Mining of Association Rules
作者:Assaf Schuster, Ran Wolff
摘要
Mining for associations between items in large transactional databases is a central problem in the field of knowledge discovery. When the database is partitioned among several share-nothing machines, the problem can be addressed using distributed data mining algorithms. One such algorithm, called CD, was proposed by Agrawal and Shafer and was later enhanced by the FDM algorithm of Cheung, Han et al. The main problem with these algorithms is that they do not scale well with the number of partitions. They are thus impractical for use in modern distributed environments such as peer-to-peer systems, in which hundreds or thousands of computers may interact.
论文关键词:Association Rules, data mining, distributed algorithms, communication-efficient
论文评审过程:
论文官网地址:https://doi.org/10.1023/B:DAMI.0000015870.80026.6a