Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review

Yongda Yu; Guoping Rong; Haifeng Shen; He Zhang; Dong Shao; Min Wang; Zhao Wei; Yong Xu; Juhong Wang

doi:10.1145/3695993

Back

Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review

Journal article

Open access

Peer reviewed

Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review

Yongda Yu, Guoping Rong, Haifeng Shen, He Zhang, Dong Shao, Min Wang, Zhao Wei, Yong Xu and Juhong Wang

ACM transactions on software engineering and methodology, Vol.34(1), pp.1-26

31/01/2025

DOI: https://doi.org/10.1145/3695993

Appears in Recent Faculty of Science and Engineering Publications

Files and links (1)

pdf

Fine-tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review3.43 MBDownload View

AcceptedAuthor Accepted ManuscriptFree to Read, Open Access

Metrics

12 File views/ downloads

31 Record Views

Abstract

Automated Code Review

Human-machine Collaboration

LLM

LORA

Software engineering

Computer systems

As code review is a tedious and costly software quality practice, researchers have proposed several machine learning-based methods to automate the process. The primary focus has been on accuracy, that is, how accurately the algorithms are able to detect issues in the code under review. However, human intervention still remains inevitable since results produced by automated code review are not 100% correct. To assist human reviewers in making their final decisions on automatically generated review comments, the comprehensibility of the comments underpinned by accurate localization and relevant explanations for the detected issues with repair suggestions is paramount. However, this has largely been neglected in the existing research. Large language models (LLMs) have the potential to generate code review comments that are more readable and comprehensible by humans thanks to their remarkable processing and reasoning capabilities. However, even mainstream LLMs perform poorly in detecting the presence of code issues because they have not been specifically trained for this binary classification task required in code review. In this paper, we contribute Carllm (Comprehensibility of Automated Code Review using Large Language Models), a novel fine-tuned LLM that has the ability to improve not only the accuracy but, more importantly, the comprehensibility of automated code review, as compared to state-of-the-art pre-trained models and general LLMs.

Details

Title: Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review
Creators: Yongda Yu (Researcher) - Nanjing University
Guoping Rong (Corresponding Author) - Nanjing University
Haifeng Shen (Contributor) - Southern Cross University, Faculty of Science and Engineering
He Zhang (Contributor) - Nanjing University
Dong Shao (Contributor) - Nanjing University
Min Wang (Contributor) - Tencent Technology Cooperation Ltd.
Zhao Wei (Contributor) - Tencent Technology Cooperation Ltd.
Yong Xu (Contributor) - Tencent Technology Cooperation Ltd.
Juhong Wang (Contributor) - Tencent Technology Cooperation Ltd.
Publication Details: ACM transactions on software engineering and methodology, Vol.34(1), pp.1-26
Publisher: Association for Computing Machinery
Number of pages: 26
Identifiers: 991013222313202368
Academic Unit: Faculty of Science and Engineering
Language: English
Resource Type: Journal article

Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review

Files and links (1)

Metrics

Abstract

Details

Southern Cross University Social media