Two-phase network generation towards within-network classifiers evaluation

作者:

Highlights:

摘要

Within-network classifiers have been widely used to predict unknown data in networks. In order to evaluate the performance of existing classifiers, it is essential to generate synthetic networks with various properties. However, conventional network generation methods become ineffective under this scenario, since they are unable to produce node labels, exert topological constraints, or provide stable generation performance. In this paper, we propose a novel network generation method for evaluating within-network classifiers, which consists of two generation phases. In the first phase of topology generation, network topology can be obtained by incorporating any existing topology generation models. In the second phase of label generation, we model the problem as a multi-objective optimization. Specifically, we prove that generating node labels over an existing topology conforming homophily constraint is NP-hard, and devise a genetic algorithm based strategy for node label generation. Extensive experiments demonstrate that our method can produce synthetic networks with stable properties, and ensure that the network topology is fixed and label parameters take effect independently, thus making it sufficient for evaluating the sensitivity of classifiers against different parameters.

论文关键词:Network generation,Genetic algorithm,Within-network classifiers

论文评审过程:Received 6 July 2016, Revised 29 December 2016, Accepted 1 January 2017, Available online 4 January 2017, Version of Record 15 February 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.01.004