Open Access iconOpen Access

ARTICLE

crossmark

Performance Enhancement of XML Parsing Using Regression and Parallelism

Muhammad Ali, Minhaj Ahmad Khan*

Department of Computer Science, Bahauddin Zakariya University, Multan, 60000, Pakistan

* Corresponding Author: Minhaj Ahmad Khan. Email: email

Computer Systems Science and Engineering 2024, 48(2), 287-303. https://doi.org/10.32604/csse.2023.043010

Abstract

The Extensible Markup Language (XML) files, widely used for storing and exchanging information on the web require efficient parsing mechanisms to improve the performance of the applications. With the existing Document Object Model (DOM) based parsing, the performance degrades due to sequential processing and large memory requirements, thereby requiring an efficient XML parser to mitigate these issues. In this paper, we propose a Parallel XML Tree Generator (PXTG) algorithm for accelerating the parsing of XML files and a Regression-based XML Parsing Framework (RXPF) that analyzes and predicts performance through profiling, regression, and code generation for efficient parsing. The PXTG algorithm is based on dividing the XML file into n parts and producing n trees in parallel. The profiling phase of the RXPF framework produces a dataset by measuring the performance of various parsing models including StAX, SAX, DOM, JDOM, and PXTG on different cores by using multiple file sizes. The regression phase produces the prediction model, based on which the final code for efficient parsing of XML files is produced through the code generation phase. The RXPF framework has shown a significant improvement in performance varying from 9.54% to 32.34% over other existing models used for parsing XML files.

Keywords


Cite This Article

APA Style
Ali, M., Khan, M.A. (2024). Performance enhancement of XML parsing using regression and parallelism. Computer Systems Science and Engineering, 48(2), 287-303. https://doi.org/10.32604/csse.2023.043010
Vancouver Style
Ali M, Khan MA. Performance enhancement of XML parsing using regression and parallelism. Comput Syst Sci Eng. 2024;48(2):287-303 https://doi.org/10.32604/csse.2023.043010
IEEE Style
M. Ali and M.A. Khan, “Performance Enhancement of XML Parsing Using Regression and Parallelism,” Comput. Syst. Sci. Eng., vol. 48, no. 2, pp. 287-303, 2024. https://doi.org/10.32604/csse.2023.043010



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 917

    View

  • 454

    Download

  • 0

    Like

Share Link