Binary classification of imbalanced datasets: The case of CoIL challenge 2000

作者:

Highlights:

• The prediction task of CoIL challenge 2000 is addressed in the paper.

• Three different methods are proposed for direct mailing problem of CoIL challenge 2000.

• The proposed methods outperform the method proposed by the winner of the challenge.

• The proposed methods overcome, also, the unbalanced dataset issue of the problem.

摘要

•The prediction task of CoIL challenge 2000 is addressed in the paper.•Three different methods are proposed for direct mailing problem of CoIL challenge 2000.•The proposed methods outperform the method proposed by the winner of the challenge.•The proposed methods overcome, also, the unbalanced dataset issue of the problem.

论文关键词:Direct mail,Data mining,Classification,Insurance,Sampling,Cost sensitive learning

论文评审过程:Received 19 July 2018, Revised 13 March 2019, Accepted 13 March 2019, Available online 14 March 2019, Version of Record 30 March 2019.

论文官网地址:https://doi.org/10.1016/j.eswa.2019.03.024