Reddit entity linking dataset

作者:

Highlights:

• We release a new entity linking dataset taken from Reddit.

• Human annotators perform well at the task even when given a broad definition of the goal.

• Thorough evaluation found that existing entity linkers perform poorly on this new dataset.

• New models are needed to extract information from social media commentary.

摘要

•We release a new entity linking dataset taken from Reddit.•Human annotators perform well at the task even when given a broad definition of the goal.•Thorough evaluation found that existing entity linkers perform poorly on this new dataset.•New models are needed to extract information from social media commentary.

论文关键词:Entity linking,Dataset,Natural language processing

论文评审过程:Received 3 September 2020, Revised 10 December 2020, Accepted 17 December 2020, Available online 5 February 2021, Version of Record 5 February 2021.

论文官网地址:https://doi.org/10.1016/j.ipm.2020.102479