Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study

Sai Venkata; Sukhjit Sehra; Sumeet Sehra; Jaiteg Singh

doi:10.32604/cmc.2025.057792

Open Access icon Open Access

ARTICLE

Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study

Sai Venkata Akhil Ammu¹, Sukhjit Singh Sehra^1,*, Sumeet Kaur Sehra², Jaiteg Singh³

1 Department of Physics and Computer Science, Wilfrid Laurier University, Waterloo, N2L 3C5, Canada
2 Appilied Computer Science and Information Technology, Conestoga College, Waterloo, N2J 2W2, Canada
3 Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, 140401, India

* Corresponding Author: Sukhjit Singh Sehra. Email: email

Computers, Materials & Continua 2025, 83(1), 435-453. https://doi.org/10.32604/cmc.2025.057792

Received 27 August 2024; Accepted 13 January 2025; Issue published 26 March 2025

Abstract

Duplicate bug reporting is a critical problem in the software repositories’ mining area. Duplicate bug reports can lead to redundant efforts, wasted resources, and delayed software releases. Thus, their accurate identification is essential for streamlining the bug triage process mining area. Several researchers have explored classical information retrieval, natural language processing, text and data mining, and machine learning approaches. The emergence of large language models (LLMs) (ChatGPT and Huggingface) has presented a new line of models for semantic textual similarity (STS). Although LLMs have shown remarkable advancements, there remains a need for longitudinal studies to determine whether performance improvements are due to the scale of the models or the unique embeddings they produce compared to classical encoding models. This study systematically investigates this issue by comparing classical word embedding techniques against LLM-based embeddings for duplicate bug detection. In this study, we have proposed an amalgamation of models to detect duplicate bug reports using textual and non-textual information about bug reports. The empirical evaluation has been performed on the open-source datasets and evaluated based on established metrics using the mean reciprocal rank (MRR), mean average precision (MAP), and recall rate. The experimental results have shown that combined LLMs can outperform (recall-rate@k = 68%–74%) other individual models for duplicate bug detection. These findings highlight the effectiveness of amalgamating multiple techniques in improving the duplicate bug report detection accuracy.

Keywords

Duplicate bug detection; large language models; information retrieval

Cite This Article

APA Style

Ammu, S.V.A., Sehra, S.S., Sehra, S.K., Singh, J. (2025). Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study. Computers, Materials & Continua, 83(1), 435–453. https://doi.org/10.32604/cmc.2025.057792

Vancouver Style

Ammu SVA, Sehra SS, Sehra SK, Singh J. Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study. Comput Mater Contin. 2025;83(1):435–453. https://doi.org/10.32604/cmc.2025.057792

IEEE Style

S. V. A. Ammu, S. S. Sehra, S. K. Sehra, and J. Singh, “Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study,” Comput. Mater. Contin., vol. 83, no. 1, pp. 435–453, 2025. https://doi.org/10.32604/cmc.2025.057792

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study

Abstract

Keywords

Cite This Article

1257

1679

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link