Scalable Semi-Supervised Classification via Neumann Series

作者:Chen Gong, Keren Fu, Lei Zhou, Jie Yang, Xiangjian He

摘要

Traditional graph-based semi-supervised learning (GBSSL) algorithms usually scale badly due to the expensive computational burden. The main bottleneck is that they need to compute the inversion of a huge matrix. In order to alleviate this problem, this paper proposes Neumann series approximation (NSA) to explicitly approximate the inversion process required by conventional GBSSL methodologies, which makes them computationally tractable for relatively large datasets. It is proved that the deviation between the approximation and direct inversion is bounded. Using real-world datasets related to handwritten digit recognition, speech recognition and text classification, the experimental results reveal that NSA accelerates the speed significantly without decreasing too much precision. We also empirically show that NSA outperforms other scalable approaches such as Nyström method, Takahashi equation, Lanczos process based SVD and AnchorGraph regularization, in terms of both efficiency and accuracy.

论文关键词:Semi-supervised learning, Scalability, Neumann series, Error bound

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-014-9351-z