A Hierarchical Two-Level Feature Fusion Approach for SMS Spam Filtering

Hussein Al-Kabbi; Mohammad-Reza Feizi-Derakhshi; Saeed Pashazadeh

doi:10.32604/iasc.2024.050452

Open Access icon Open Access

ARTICLE

A Hierarchical Two-Level Feature Fusion Approach for SMS Spam Filtering

Hussein Alaa Al-Kabbi^1,2, Mohammad-Reza Feizi-Derakhshi^1,*, Saeed Pashazadeh³

1 Computerized Intelligence Systems Laboratory, Department of Computer Engineering, University of Tabriz, Tabriz, 51368, Iran
2 Ministry of Education Iraq, General Direction of Vocational Education, Al-Najaf, 54001, Iraq
3 Department of Computer Engineering, University of Tabriz, Tabriz, 51368, Iran

* Corresponding Author: Mohammad-Reza Feizi-Derakhshi. Email: email

Intelligent Automation & Soft Computing 2024, 39(4), 665-682. https://doi.org/10.32604/iasc.2024.050452

Received 07 February 2024; Accepted 27 May 2024; Issue published 06 September 2024

Abstract

SMS spam poses a significant challenge to maintaining user privacy and security. Recently, spammers have employed fraudulent writing styles to bypass spam detection systems. This paper introduces a novel two-level detection system that utilizes deep learning techniques for effective spam identification to address the challenge of sophisticated SMS spam. The system comprises five steps, beginning with the preprocessing of SMS data. RoBERTa word embedding is then applied to convert text into a numerical format for deep learning analysis. Feature extraction is performed using a Convolutional Neural Network (CNN) for word-level analysis and a Bidirectional Long Short-Term Memory (BiLSTM) for sentence-level analysis. The two-level feature extraction enables a complete understanding of individual words and sentence structure. The novel part of the proposed approach is the Hierarchical Attention Network (HAN), which fuses and selects features at two levels through an attention mechanism. The HAN can deal with words and sentences to focus on the most pertinent aspects of messages for spam detection. This network is productive in capturing meaningful features, considering both word-level and sentence-level semantics. In the classification step, the model classifies the messages into spam and ham. This hybrid deep learning method improve the feature representation, and enhancing the model’s spam detection capabilities. By significantly reducing the incidence of SMS spam, our model contributes to a safer mobile communication environment, protecting users against potential phishing attacks and scams, and aiding in compliance with privacy and security regulations. This model’s performance was evaluated using the SMS Spam Collection Dataset from the UCI Machine Learning Repository. Cross-validation is employed to consider the dataset’s imbalanced nature, ensuring a reliable evaluation. The proposed model achieved a good accuracy of 99.48%, underscoring its efficiency in identifying SMS spam.

Keywords

SMS spam detection; hierarchical attention network; text classification; natural language processing

Cite This Article

APA Style

Al-Kabbi, H.A., Feizi-Derakhshi, M., Pashazadeh, S. (2024). A Hierarchical Two-Level Feature Fusion Approach for SMS Spam Filtering. Intelligent Automation & Soft Computing, 39(4), 665–682. https://doi.org/10.32604/iasc.2024.050452

Vancouver Style

Al-Kabbi HA, Feizi-Derakhshi M, Pashazadeh S. A Hierarchical Two-Level Feature Fusion Approach for SMS Spam Filtering. Intell Automat Soft Comput. 2024;39(4):665–682. https://doi.org/10.32604/iasc.2024.050452

IEEE Style

H. A. Al-Kabbi, M. Feizi-Derakhshi, and S. Pashazadeh, “A Hierarchical Two-Level Feature Fusion Approach for SMS Spam Filtering,” Intell. Automat. Soft Comput., vol. 39, no. 4, pp. 665–682, 2024. https://doi.org/10.32604/iasc.2024.050452

BibTex EndNote RIS

Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

A Hierarchical Two-Level Feature Fusion Approach for SMS Spam Filtering

Abstract

Keywords

Cite This Article

894

373

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link