Detecting XSS with Random Forest and Multi-Channel Feature Extraction

Qin, Qiurong; Li, Yueqin; Mi, Yajie; Shen, Jinhui; Wu, Kexin; Wang, Zhenzhao

doi:10.32604/cmc.2024.051769

Open Access icon Open Access

ARTICLE

Detecting XSS with Random Forest and Multi-Channel Feature Extraction

by Qiurong Qin, Yueqin Li^*, Yajie Mi, Jinhui Shen, Kexin Wu, Zhenzhao Wang

Smart City College, Beijing Union University, Beijing, 100101, China

* Corresponding Author: Yueqin Li. Email: email

Computers, Materials & Continua 2024, 80(1), 843-874. https://doi.org/10.32604/cmc.2024.051769

Received 14 March 2024; Accepted 13 May 2024; Issue published 18 July 2024

Abstract

In the era of the Internet, widely used web applications have become the target of hacker attacks because they contain a large amount of personal information. Among these vulnerabilities, stealing private data through cross-site scripting (XSS) attacks is one of the most commonly used attacks by hackers. Currently, deep learning-based XSS attack detection methods have good application prospects; however, they suffer from problems such as being prone to overfitting, a high false alarm rate, and low accuracy. To address these issues, we propose a multi-stage feature extraction and fusion model for XSS detection based on Random Forest feature enhancement. The model utilizes Random Forests to capture the intrinsic structure and patterns of the data by extracting leaf node indices as features, which are subsequently merged with the original data features to form a feature set with richer information content. Further feature extraction is conducted through three parallel channels. Channel I utilizes parallel one-dimensional convolutional layers (1D convolutional layers) with different convolutional kernel sizes to extract local features at different scales and perform multi-scale feature fusion; Channel II employs maximum one-dimensional pooling layers (max 1D pooling layers) of various sizes to extract key features from the data; and Channel III extracts global information bi-directionally using a Bi-Directional Long-Short Term Memory Network (Bi-LSTM) and incorporates a multi-head attention mechanism to enhance global features. Finally, effective classification and prediction of XSS are performed by fusing the features of the three channels. To test the effectiveness of the model, we conduct experiments on six datasets. We achieve an accuracy of 100% on the UNSW-NB15 dataset and 99.99% on the CICIDS2017 dataset, which is higher than that of the existing models.

Keywords

Random forest; feature enhancement; three-channel parallelism; XSS detection

Cite This Article

APA Style

Qin, Q., Li, Y., Mi, Y., Shen, J., Wu, K. et al. (2024). Detecting XSS with random forest and multi-channel feature extraction. Computers, Materials & Continua, 80(1), 843-874. https://doi.org/10.32604/cmc.2024.051769

Vancouver Style

Qin Q, Li Y, Mi Y, Shen J, Wu K, Wang Z. Detecting XSS with random forest and multi-channel feature extraction. Comput Mater Contin. 2024;80(1):843-874 https://doi.org/10.32604/cmc.2024.051769

IEEE Style

Q. Qin, Y. Li, Y. Mi, J. Shen, K. Wu, and Z. Wang, “Detecting XSS with Random Forest and Multi-Channel Feature Extraction,” Comput. Mater. Contin., vol. 80, no. 1, pp. 843-874, 2024. https://doi.org/10.32604/cmc.2024.051769

BibTex EndNote RIS

Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Detecting XSS with Random Forest and Multi-Channel Feature Extraction

Abstract

Keywords

Cite This Article

511

267

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link