How well do pre-trained contextual language representations recommend labels for GitHub issues?
作者:
Highlights:
•
摘要
Motivation:Open-source organizations use issues to collect user feedback, software bugs, and feature requests in GitHub. Many issues do not have labels, which makes labeling time-consuming work for the maintainers. Recently, some researchers used deep learning to improve the performance of automated tagging for software objects. However, these researches use static pre-trained word vectors that cannot represent the semantics of the same word in different contexts. Pre-trained contextual language representations have been shown to achieve outstanding performance on lots of NLP tasks.
论文关键词:Deep learning,Issue labeling,Data analysis,Language model
论文评审过程:Received 2 March 2021, Revised 18 August 2021, Accepted 6 September 2021, Available online 10 September 2021, Version of Record 15 September 2021.
论文官网地址:https://doi.org/10.1016/j.knosys.2021.107476