Mining arguments in scientific abstracts with discourse-level embeddings

作者:

Highlights:

摘要

Argument mining consists in the automatic identification of argumentative structures in texts. In this work we leverage existing discourse-level annotations to facilitate the identification of argumentative components and relations in scientific texts, which has been recognized as a particularly challenging task. We propose a new annotation schema and use it to augment a corpus of computational linguistics abstracts that had previously been annotated with discourse units and relations. Our initial experiments with the enriched corpus confirm the potential value of incorporating discourse information in argument mining tasks. In order to tackle the limitations posed by the lack of corpora containing both discourse and argumentative annotations we explore two transfer learning approaches in which discourse parsing is used as an auxiliary task when training argument mining models. In this case, as no discourse information is used as input, the resulting models could be used to predict the argumentative structure of unannotated texts.

论文关键词:

论文评审过程:Received 31 January 2020, Revised 30 June 2020, Accepted 27 July 2020, Available online 1 August 2020, Version of Record 30 September 2020.

论文官网地址:https://doi.org/10.1016/j.datak.2020.101840