Database classification for multi-database mining

作者:

Highlights:

摘要

Many large organizations have multiple databases distributed in different branches, and therefore multi-database mining is an important task for data mining. To reduce the search cost in the data from all databases, we need to identify which databases are most likely relevant to a data mining application. This is referred to as database selection. For real-world applications, database selection has to be carried out multiple times to identify relevant databases that meet different applications. In particular, a mining task may be without reference to any specific application. In this paper, we present an efficient approach for classifying multiple databases based on their similarity between each other. Our approach is application-independent.

论文关键词:Database selection,Classification,Multi-database mining

论文评审过程:Received 29 October 2002, Revised 1 June 2003, Accepted 14 October 2003, Available online 14 November 2003.

论文官网地址:https://doi.org/10.1016/j.is.2003.10.001