A Multistrategy Approach to Relational Knowledge Discovery in Databases

作者:Katharina Morik, Peter Brockhausen

摘要

When learning from very large databases, the reduction of complexity is extremely important. Two extremes of making knowledge discovery in databases (KDD) feasible have been put forward. One extreme is to choose a very simple hypothesis language, thereby being capable of very fast learning on real-world databases. The opposite extreme is to select a small data set, thereby being able to learn very expressive (first-order logic) hypotheses. A multistrategy approach allows one to include most of these advantages and exclude most of the disadvantages. Simpler learning algorithms detect hierarchies which are used to structure the hypothesis space for a more complex learning algorithm. The better structured the hypothesis space is, the better learning can prune away uninteresting or losing hypotheses and the faster it becomes.

论文关键词:Knowledge discovery in databases, inductive logic programming, functional dependencies, numerical intervals, background knowledge

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1007317925872