STACC: Code Comment Classification using SentenceTransformers

More Info
expand_more

Abstract

Code comments are a key resource for information about software artefacts. Depending on the use case, only some types of comments are useful. Thus, automatic approaches to clas-sify these comments have been proposed. In this work, we address this need by proposing, STACC, a set of SentenceTransformers- based binary classifiers. These lightweight classifiers are trained and tested on the NLBSE Code Comment Classification tool competition dataset, and surpass the baseline by a significant margin, achieving an average Fl score of 0.74 against the baseline of 0.31, which is an improvement of 139%. A replication package, as well as the models themselves, are publicly available.

Files

NLBSE_Code_Comment_Classificat... (pdf)
(pdf | 0.302 Mb)

Download not available

STACC_Code_Comment_Classificat... (pdf)
(pdf | 0.363 Mb)
- Embargo expired in 29-01-2024