Open Access iconOpen Access

ARTICLE

Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study

Sai Venkata Akhil Ammu1, Sukhjit Singh Sehra1,*, Sumeet Kaur Sehra2, Jaiteg Singh3

1 Department of Physics and Computer Science, Wilfrid Laurier University, Waterloo, N2L 3C5, Canada
2 Appilied Computer Science and Information Technology, Conestoga College, Waterloo, N2J 2W2, Canada
3 Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, 140401, India

* Corresponding Author: Sukhjit Singh Sehra. Email: email

Computers, Materials & Continua 2025, 83(1), 435-453. https://doi.org/10.32604/cmc.2025.057792

Abstract

Duplicate bug reporting is a critical problem in the software repositories’ mining area. Duplicate bug reports can lead to redundant efforts, wasted resources, and delayed software releases. Thus, their accurate identification is essential for streamlining the bug triage process mining area. Several researchers have explored classical information retrieval, natural language processing, text and data mining, and machine learning approaches. The emergence of large language models (LLMs) (ChatGPT and Huggingface) has presented a new line of models for semantic textual similarity (STS). Although LLMs have shown remarkable advancements, there remains a need for longitudinal studies to determine whether performance improvements are due to the scale of the models or the unique embeddings they produce compared to classical encoding models. This study systematically investigates this issue by comparing classical word embedding techniques against LLM-based embeddings for duplicate bug detection. In this study, we have proposed an amalgamation of models to detect duplicate bug reports using textual and non-textual information about bug reports. The empirical evaluation has been performed on the open-source datasets and evaluated based on established metrics using the mean reciprocal rank (MRR), mean average precision (MAP), and recall rate. The experimental results have shown that combined LLMs can outperform (recall-rate@k = 68%–74%) other individual models for duplicate bug detection. These findings highlight the effectiveness of amalgamating multiple techniques in improving the duplicate bug report detection accuracy.

Keywords

Duplicate bug detection; large language models; information retrieval

Cite This Article

APA Style
Ammu, S.V.A., Sehra, S.S., Sehra, S.K., Singh, J. (2025). Amalgamation of classical and large language models for duplicate bug detection: A comparative study. Computers, Materials & Continua, 83(1), 435–453. https://doi.org/10.32604/cmc.2025.057792
Vancouver Style
Ammu SVA, Sehra SS, Sehra SK, Singh J. Amalgamation of classical and large language models for duplicate bug detection: A comparative study. Comput Mater Contin. 2025;83(1):435–453. https://doi.org/10.32604/cmc.2025.057792
IEEE Style
S. V. A. Ammu, S. S. Sehra, S. K. Sehra, and J. Singh, “Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study,” Comput. Mater. Contin., vol. 83, no. 1, pp. 435–453, 2025. https://doi.org/10.32604/cmc.2025.057792



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 186

    View

  • 93

    Download

  • 0

    Like

Share Link