Detecting XSS with Random Forest and Multi-Channel Feature Extraction

Qiurong Qin, Yueqin Li^*, Yajie Mi, Jinhui Shen, Kexin Wu, Zhenzhao Wang
Smart City College, Beijing Union University, Beijing, 100101, China
* Corresponding Author: Yueqin Li. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2024.051769

Received 14 March 2024; Accepted 13 May 2024; Published online 24 June 2024

Download PDF

Abstract

In the era of the Internet, widely used web applications have become the target of hacker attacks because they contain a large amount of personal information. Among these vulnerabilities, stealing private data through cross-site scripting (XSS) attacks is one of the most commonly used attacks by hackers. Currently, deep learning-based XSS attack detection methods have good application prospects; however, they suffer from problems such as being prone to overfitting, a high false alarm rate, and low accuracy. To address these issues, we propose a multi-stage feature extraction and fusion model for XSS detection based on Random Forest feature enhancement. The model utilizes Random Forests to capture the intrinsic structure and patterns of the data by extracting leaf node indices as features, which are subsequently merged with the original data features to form a feature set with richer information content. Further feature extraction is conducted through three parallel channels. Channel I utilizes parallel one-dimensional convolutional layers (1D convolutional layers) with different convolutional kernel sizes to extract local features at different scales and perform multi-scale feature fusion; Channel II employs maximum one-dimensional pooling layers (max 1D pooling layers) of various sizes to extract key features from the data; and Channel III extracts global information bi-directionally using a Bi-Directional Long-Short Term Memory Network (Bi-LSTM) and incorporates a multi-head attention mechanism to enhance global features. Finally, effective classification and prediction of XSS are performed by fusing the features of the three channels. To test the effectiveness of the model, we conduct experiments on six datasets. We achieve an accuracy of 100% on the UNSW-NB15 dataset and 99.99% on the CICIDS2017 dataset, which is higher than that of the existing models.

Keywords

Random forest; feature enhancement; three-channel parallelism; XSS detection

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

149

View
28

Download
0

Like

An Empirical Comparison on Multi-Target Regression Learning
Xuefeng Xi, Victor S. Sheng, Binqi...
A Novel Ensemble Learning Algorithm Based on D-S Evidence Theory for IoT Security
Changting Shi
A Privacy-Preserving Algorithm for Clinical Decision-Support Systems Using Random Forest
Alia Alabdulkarim, Mznah Al-Rodhaan,...
Key Process Protection of High Dimensional Process Data in Complex Production
He Shi, Wenli Shang, Chunyu Chen,...
MalDetect: A Structure of Encrypted Malware Traffic Detection
Jiyuan Liu, Yingzhi Zeng, Jiangyong...

All issues

Online First

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Detecting XSS with Random Forest and Multi-Channel Feature Extraction

Abstract

Keywords

149

28

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link