Cost sensitive ν-support vector machine with LINEX loss

作者:

Highlights:

摘要

Support vector machine (SVM) is a fundamental machine learning algorithm, while the traditional SVMs have limitations for massive and ubiquitous class-imbalanced data in practice. To address this issue, we propose a novel cost sensitive learning model called ν-CSSVM for imbalanced classification, which simultaneously inherits the advantages of ν-SVM and the asymmetric LINEX loss function. On one hand, the introduction of the LINEX loss function allocates different costs to each instance, demonstrating the high efficiency of realizing cost sensitive learning at the instance level. On the other hand, ν in the objective function is used to constrain the model sparsity. Compared with other studies lacking theoretical analysis, we provide detailed proof of ν-CSSVM generalization error bound through Rademacher complexity. Taking into full consideration of the intrinsic and potential properties including the smoothness, convexity and differentiability of ν-CSSVM, the alternating direction method of multipliers (ADMM) and gradient descent (GD) approach are designed. Considerable experiments validate that ν-CSSVM is more competitive than the benchmarks in class imbalance learning. The statistical test further confirms this conclusion.

论文关键词:Class imbalance,Cost sensitive,Linear-exponential loss,ν-support vector machine,ADMM,GD

论文评审过程:Received 16 June 2021, Revised 29 September 2021, Accepted 21 October 2021, Available online 16 November 2021, Version of Record 16 November 2021.

论文官网地址:https://doi.org/10.1016/j.ipm.2021.102809