Using Information Retrieval techniques for supporting data mining

作者:

Highlights:

摘要

The classic two-stepped approach of the Apriori algorithm and its descendants, which consisted of finding all large itemsets and then using these itemsets to generate all association rules has worked well for certain categories of data. Nevertheless for many other data types this approach shows highly degraded performance and proves rather inefficient.We argue that we need to search all the search space of candidate itemsets but rather let the database unveil its secrets as the customers use it. We propose a system that does not merely scan all possible combinations of the itemsets, but rather acts like a search engine specifically implemented for making recommendations to the customers using techniques borrowed from Information Retrieval.

论文关键词:Knowledge discovery,E-commerce,Itemsets recommendations,Indexing,Boolean-ranked queries

论文评审过程:Received 5 May 2004, Accepted 21 July 2004, Available online 21 August 2004.

论文官网地址:https://doi.org/10.1016/j.datak.2004.07.004