How well do pre-trained contextual language representations recommend labels for GitHub issues?

作者：

Highlights：

•

摘要

Motivation:Open-source organizations use issues to collect user feedback, software bugs, and feature requests in GitHub. Many issues do not have labels, which makes labeling time-consuming work for the maintainers. Recently, some researchers used deep learning to improve the performance of automated tagging for software objects. However, these researches use static pre-trained word vectors that cannot represent the semantics of the same word in different contexts. Pre-trained contextual language representations have been shown to achieve outstanding performance on lots of NLP tasks.

论文关键词：Deep learning,Issue labeling,Data analysis,Language model

论文评审过程：Received 2 March 2021, Revised 18 August 2021, Accepted 6 September 2021, Available online 10 September 2021, Version of Record 15 September 2021.

论文官网地址：https://doi.org/10.1016/j.knosys.2021.107476