Assessing the quality factors found in in-line documentation written in natural language: The JavadocMiner

作者:

Highlights:

摘要

An important software engineering artifact used by developers and maintainers to assist in software comprehension and maintenance is source code documentation. It provides the insight needed by software engineers when performing a task, and therefore ensuring the quality of this documentation is extremely important. In-line documentation is at the forefront of explaining a programmer's original intentions for a given implementation. Since this documentation is written in natural language, ensuring its quality so far needed to be performed manually. In this paper, we present an effective and automated approach for assessing the quality of in-line documentation using a set of heuristics, targeting both the quality of language and consistency between the source code and its comments. Our evaluation is made up of two parts: We first apply the JavadocMiner tool to the different modules of two open source applications (ArgoUML and Eclipse) in order to automatically assess their intrinsic comment quality. In the second part of our evaluation, we correlate the results returned by the analysis with bug defects reported for the individual modules in order to examine connections between natural language documentation and source code quality.

论文关键词:Text mining,Software engineering,Source code comments,Automated quality analysis,Javadoc,Ontology

论文评审过程:Received 31 December 2010, Revised 26 September 2011, Accepted 14 February 2013, Available online 13 March 2013.

论文官网地址:https://doi.org/10.1016/j.datak.2013.02.001