Multiple-kernel SVM based multiple-task oriented data mining system for gene expression data analysis

作者:

Highlights:

摘要

Gene expression profiling using DNA microarray technique has been shown as a promising tool to improve the diagnosis and treatment of cancer. Recently, many computational methods have been used to discover maker genes, make class prediction and class discovery based on gene expression data of cancer tissue. However, those techniques fall short on some critical areas. These included (a) interpretation of the solution and extracted knowledge. (b) Integrating various sources data and incorporating the prior knowledge into the system. (c) Giving a global understanding of biological complex systems by a complete knowledge discovery framework. This paper proposes a multiple-kernel SVM based data mining system. Multiple tasks, including feature selection, data fusion, class prediction, decision rule extraction, associated rule extraction and subclass discovery, are incorporated in an integrated framework. ALL-AML Leukemia dataset is used to demonstrate the performance of this system.

论文关键词:Support vector machine,Multiple-kernel learning,Feature selection,Data fusion,Decision rule,Associated rule,Subclass discovery,Gene expression

论文评审过程:Available online 23 March 2011.

论文官网地址:https://doi.org/10.1016/j.eswa.2011.03.025