Journal article
Distilling Quality Enhancing Comments From Code Reviews to Underpin Reviewer Recommendation
IEEE Transactions on Software Engineering, Vol.50(7), pp.1658-1674
07/2024
Metrics
19 Record Views
Abstract
Code review is an important practice in software development. One of its main objectives is for the assurance of code quality. For this purpose, the efficacy of code review is subject to the credibility of reviewers, i.e., reviewers who have demonstrated strong evidence of previously making quality-enhancing comments are more credible than those who have not. Code reviewer recommendation (CRR) is designed to assist in recommending suitable reviewers for a specific objective and, in this context, assurance of code quality. Its performance is susceptible to the relevance of its training dataset to this objective, composed of all reviewers’ historical review comments, which, however, often contains a plethora of comments that are irrelevant to the enhancement of code quality. Furthermore, recommendation accuracy has been adopted as the sole metric to evaluate a recommender's performance, which is inadequate as it does not take reviewers’ relevant credibility into consideration. These two issues form the ground truth problem in CRR as they both originate from the relevance of dataset used to train and evaluate CRR algorithms. To tackle this problem, we first propose the concept of Quality-Enhancing Review Comments ( QERC ), which includes three types of comments - change-triggering inline comments, informative general comments, and approve-to-merge comments. We then devise a set of algorithms and procedures to obtain a distilled dataset by applying QERC to the original dataset. We finally introduce a new metric – reviewer's credibility for quality enhancement (RCQE) – as a complementary metric to recommendation accuracy for evaluating the performance of recommenders. To validate the proposed QERC-based approach to CRR, we conduct empirical studies using real data from seven projects containing over 82K pull requests and 346K review comments. Results show that: (a) QERC can effectively address the ground truth problem by distilling quality-enhancing comments from the dataset containing original code reviews, (b) QERC can assist recommenders in finding highly credible reviewers at a slight cost of recommendation accuracy, and (c) even “wrong” recommendations using the distilled dataset are likely to be more credible than those using the original dataset.
Details
- Title
- Distilling Quality Enhancing Comments From Code Reviews to Underpin Reviewer Recommendation
- Creators
- Guoping Rong (Corresponding Author) - Nanjing UniversityYongda Yu (Researcher) - Nanjing UniversityYifan Zhang (Researcher) - Nanjing UniversityHe Zhang (Contributor) - Nanjing UniversityHaifeng Shen - Australian Catholic UniversityDong Shah (Contributor) - Nanjing UniversityHongyu Kuang (Contributor) - Nanjing UniversityMin Wang (Contributor) - Tencent Technology Cooperation Ltd.Zhao Wei (Contributor) - Tencent Technology Cooperation Ltd.Yong Xu (Contributor) - Tencent Technology Cooperation Ltd.Juhong Wang (Contributor) - Tencent Technology Cooperation Ltd.
- Publication Details
- IEEE Transactions on Software Engineering, Vol.50(7), pp.1658-1674
- Publisher
- Institute of Electrical and Electronics Engineers
- Grant note
- National Natural Science Foundation of China: 62072227, 62202219, 62302210 Jiangsu Provincial Key Research and Development Program: BE2021002-2 Innovation Project and Overseas Open Project of State Key Laboratory for Novel Software Technology (Nanjing University): ZZKT2022A25, KFKT2022A09, KFKT2023A09, KFKT2023A10
This work was supported in part by the National Natural Science Foundation of China under Grant 62072227, Grant 62202219, and Grant 62302210, in part by the Tencent Rhino-Bird Focus Research Program of Basic Platform Technology, in part by the Jiangsu Provincial Key Research and Development Program under Grant BE2021002-2, and in part by the Innovation Project and Overseas Open Project of State Key Laboratory for Novel Software Technology (Nanjing University) under Grant ZZKT2022A25, Grant KFKT2022A09, Grant KFKT2023A09, and Grant KFKT2023A10.
- Identifiers
- 991013222313402368
- Academic Unit
- Faculty of Science and Engineering
- Language
- English
- Resource Type
- Journal article