Construction of a large-scale test set for author disambiguation

作者:

Highlights:

摘要

Author disambiguation resolves same-name author occurrences in the bibliographic data into namesakes. This enables author-centered searches and high-quality social network analysis. As an attempt to promote much research in author disambiguation, KISTI have constructed a new large-scale test set for this field. This article describes its semi-manual creation procedures, characteristics especially in terms of author ambiguities and name diversities. In addition, the baseline performance of author clustering against the test set is provided.

论文关键词:Test set construction,Author disambiguation,Author ambiguity

论文评审过程:Received 6 November 2009, Revised 31 May 2010, Accepted 5 October 2010, Available online 1 November 2010.

论文官网地址:https://doi.org/10.1016/j.ipm.2010.10.001