Multiclass Classification for Cyber Threats Detection on Twitter

Adnan Hussein; Abdulwahab Almazroi

doi:10.32604/cmc.2023.040856

Open Access icon Open Access

ARTICLE

Multiclass Classification for Cyber Threats Detection on Twitter

Adnan Hussein¹, Abdulwahab Ali Almazroi^2,*

1 College of Computer Science and Engineering, Department of Computer Science, AL-Ahgaff University, Mukalla, Yemen
2 College of Computing and Information Technology at Khulais, Department of Information Technology, University of Jeddah, Jeddah, Saudi Arabia

* Corresponding Author: Abdulwahab Ali Almazroi. Email: email

Computers, Materials & Continua 2023, 77(3), 3853-3866. https://doi.org/10.32604/cmc.2023.040856

Received 01 April 2023; Accepted 25 July 2023; Issue published 26 December 2023

Abstract

The advances in technology increase the number of internet systems usage. As a result, cybersecurity issues have become more common. Cyber threats are one of the main problems in the area of cybersecurity. However, detecting cybersecurity threats is not a trivial task and thus is the center of focus for many researchers due to its importance. This study aims to analyze Twitter data to detect cyber threats using a multiclass classification approach. The data is passed through different tasks to prepare it for the analysis. Term Frequency and Inverse Document Frequency (TFIDF) features are extracted to vectorize the cleaned data and several machine learning algorithms are used to classify the Twitter posts into multiple classes of cyber threats. The results are evaluated using different metrics including precision, recall, F-score, and accuracy. This work contributes to the cyber security research area. The experiments revealed the promised results of the analysis using the Random Forest (RF) algorithm with (F-score = 81%). This result outperformed the existing studies in the field of cyber threat detection and showed the importance of detecting cyber threats in social media posts. There is a need for more investigation in the field of multiclass classification to achieve more accurate results. In the future, this study suggests applying different data representations for the feature extraction other than TF-IDF such as Word2Vec, and adding a new phase for feature selection to select the optimum features subset to achieve higher accuracy of the detection process.

Keywords

Cybersecurity; cyber threat detection; artificial intelligence; machine learning; Twitter

Cite This Article

APA Style

Hussein, A., Almazroi, A.A. (2023). Multiclass Classification for Cyber Threats Detection on Twitter. Computers, Materials & Continua, 77(3), 3853–3866. https://doi.org/10.32604/cmc.2023.040856

Vancouver Style

Hussein A, Almazroi AA. Multiclass Classification for Cyber Threats Detection on Twitter. Comput Mater Contin. 2023;77(3):3853–3866. https://doi.org/10.32604/cmc.2023.040856

IEEE Style

A. Hussein and A. A. Almazroi, “Multiclass Classification for Cyber Threats Detection on Twitter,” Comput. Mater. Contin., vol. 77, no. 3, pp. 3853–3866, 2023. https://doi.org/10.32604/cmc.2023.040856

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Multiclass Classification for Cyber Threats Detection on Twitter

Abstract

Keywords

Cite This Article

865

390

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link