Compiler IR-Based Program Encoding Method for Software Defect Prediction

Chen, Yong; Xu, Chao; He, Jing Selena; Xiao, Sheng; Shen, Fanfan

doi:10.32604/cmc.2022.026750

Open Access icon Open Access

ARTICLE

Compiler IR-Based Program Encoding Method for Software Defect Prediction

by Yong Chen¹, Chao Xu^1,*, Jing Selena He², Sheng Xiao³, Fanfan Shen¹

1 School of Information Engineering, Nanjing Audit University, Nanjing, 211815, China
2 Department of Computer Science, Kennesaw State University, Kennesaw, 30144-5588, USA
3 Information Science and Engineering Department, Hunan First Normal University, Changsha, 410205, China

* Corresponding Author: Chao Xu. Email: email

Computers, Materials & Continua 2022, 72(3), 5251-5272. https://doi.org/10.32604/cmc.2022.026750

Received 04 January 2022; Accepted 23 February 2022; Issue published 21 April 2022

Abstract

With the continuous expansion of software applications, people's requirements for software quality are increasing. Software defect prediction is an important technology to improve software quality. It often encodes the software into several features and applies the machine learning method to build defect prediction classifiers, which can estimate the software areas is clean or buggy. However, the current encoding methods are mainly based on the traditional manual features or the AST of source code. Traditional manual features are difficult to reflect the deep semantics of programs, and there is a lot of noise information in AST, which affects the expression of semantic features. To overcome the above deficiencies, we combined with the Convolutional Neural Networks (CNN) and proposed a novel compiler Intermediate Representation (IR) based program encoding method for software defect prediction (CIR-CNN). Specifically, our program encoding method is based on the compiler IR, which can eliminate a large amount of noise information in the syntax structure of the source code and facilitate the acquisition of more accurate semantic information. Secondly, with the help of data flow analysis, a Data Dependency Graph (DDG) is constructed on the compiler IR, which helps to capture the deeper semantic information of the program. Finally, we use the widely used CNN model to build a software defect prediction model, which can increase the adaptive ability of the method. To evaluate the performance of the CIR-CNN, we use seven projects from PROMISE datasets to set up comparative experiments. The experiments results show that, in WPDP, with our CIR-CNN method, the prediction accuracy was improved by 12% for the AST-encoded CNN-based model and by 20.9% for the traditional features-based LR model, respectively. And in CPDP, the AST-encoded DBN-based model was improved by 9.1% and the traditional features-based TCA+ model by 19.2%, respectively.

Keywords

Compiler IR; CNN; data dependency graph; defect prediction

Cite This Article

APA Style

Chen, Y., Xu, C., He, J.S., Xiao, S., Shen, F. (2022). Compiler ir-based program encoding method for software defect prediction. Computers, Materials & Continua, 72(3), 5251–5272. https://doi.org/10.32604/cmc.2022.026750

Vancouver Style

Chen Y, Xu C, He JS, Xiao S, Shen F. Compiler ir-based program encoding method for software defect prediction. Comput Mater Contin. 2022;72(3):5251–5272. https://doi.org/10.32604/cmc.2022.026750

IEEE Style

Y. Chen, C. Xu, J. S. He, S. Xiao, and F. Shen, “Compiler IR-Based Program Encoding Method for Software Defect Prediction,” Comput. Mater. Contin., vol. 72, no. 3, pp. 5251–5272, 2022. https://doi.org/10.32604/cmc.2022.026750

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Compiler IR-Based Program Encoding Method for Software Defect Prediction

Abstract

Keywords

Cite This Article

1629

775

2

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link