Computationally efficient induction of classification rules with the PMCRI and J-PMCRI frameworks

作者:

Highlights:

摘要

In order to gain knowledge from large databases, scalable data mining technologies are needed. Data are captured on a large scale and thus databases are increasing at a fast pace. This leads to the utilisation of parallel computing technologies in order to cope with large amounts of data. In the area of classification rule induction, parallelisation of classification rules has focused on the divide and conquer approach, also known as the Top Down Induction of Decision Trees (TDIDT). An alternative approach to classification rule induction is separate and conquer which has only recently been in the focus of parallelisation. This work introduces and evaluates empirically a framework for the parallel induction of classification rules, generated by members of the Prism family of algorithms. All members of the Prism family of algorithms follow the separate and conquer approach.

论文关键词:Parallel computing,Parallel rule induction,Modular classification rule induction,PMCRI,J-PMCRI,Prism

论文评审过程:Received 28 January 2011, Revised 10 April 2012, Accepted 11 April 2012, Available online 21 April 2012.

论文官网地址:https://doi.org/10.1016/j.knosys.2012.04.014