A novel bundling learning paradigm for named entity recognition

作者:

Highlights:

摘要

Multi-task learning (MTL) takes advantage of the information gained from multiple related NLP tasks in order to improve performance across these tasks. MTL-based models for named entity recognition (NER) have traditionally included relation extraction and (or) coreference resolution, which requires additional data annotations in NER corpora, whereas these annotations are often unavailable. Indeed, we generally model the NER task using either a sequence labeling-based or span-based approach. Motivated by MTL, we propose a novel Bundling Learning (BL) paradigm for the NER task, which is achieved by bundling sequence labeling-based and span-based NER models together, thus allowing us to model the task from both token- and span-level perspectives. In addition, BL does not require additional data annotations compared to MTL. In experiments on NER and RE tasks, it is shown that BL consistently improves the performance of the two tasks across several benchmark datasets. Detailed analyses further confirm the effectiveness of BL.

论文关键词:Bundling learning,Named entity recognition,Relation extraction,Span,Sequence labeling

论文评审过程:Received 8 August 2021, Revised 12 April 2022, Accepted 13 April 2022, Available online 26 April 2022, Version of Record 9 May 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.108825