Imbalanced Data Classification Using SVM Based on Improved Simulated Annealing Featuring Synthetic Data Generation and Reduction

Hussein Hussein; Said Anwar; Muhammad Ahmad

doi:10.32604/cmc.2023.036025

Open Access icon Open Access

ARTICLE

Imbalanced Data Classification Using SVM Based on Improved Simulated Annealing Featuring Synthetic Data Generation and Reduction

Hussein Ibrahim Hussein¹, Said Amirul Anwar^2,*, Muhammad Imran Ahmad²

1 Department of Computer Techniques Engineering, AlSafwa University College, Karbala, 56001, Iraq
2 Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, Arau, 02600, Perlis, Malaysia

* Corresponding Author: Said Amirul Anwar. Email: email

Computers, Materials & Continua 2023, 75(1), 547-564. https://doi.org/10.32604/cmc.2023.036025

Received 14 September 2022; Accepted 19 November 2022; Issue published 06 February 2023

Abstract

Imbalanced data classification is one of the major problems in machine learning. This imbalanced dataset typically has significant differences in the number of data samples between its classes. In most cases, the performance of the machine learning algorithm such as Support Vector Machine (SVM) is affected when dealing with an imbalanced dataset. The classification accuracy is mostly skewed toward the majority class and poor results are exhibited in the prediction of minority-class samples. In this paper, a hybrid approach combining data pre-processing technique and SVM algorithm based on improved Simulated Annealing (SA) was proposed. Firstly, the data pre-processing technique which primarily aims at solving the resampling strategy of handling imbalanced datasets was proposed. In this technique, the data were first synthetically generated to equalize the number of samples between classes and followed by a reduction step to remove redundancy and duplicated data. Next is the training of a balanced dataset using SVM. Since this algorithm requires an iterative process to search for the best penalty parameter during training, an improved SA algorithm was proposed for this task. In this proposed improvement, a new acceptance criterion for the solution to be accepted in the SA algorithm was introduced to enhance the accuracy of the optimization process. Experimental works based on ten publicly available imbalanced datasets have demonstrated higher accuracy in the classification tasks using the proposed approach in comparison with the conventional implementation of SVM. Registering at an average of 89.65% of accuracy for the binary class classification has demonstrated the good performance of the proposed works.

Keywords

Imbalanced data; resampling technique; data reduction; support vector machine; simulated annealing

Cite This Article

APA Style

Hussein, H.I., Anwar, S.A., Ahmad, M.I. (2023). Imbalanced Data Classification Using SVM Based on Improved Simulated Annealing Featuring Synthetic Data Generation and Reduction. Computers, Materials & Continua, 75(1), 547–564. https://doi.org/10.32604/cmc.2023.036025

Vancouver Style

Hussein HI, Anwar SA, Ahmad MI. Imbalanced Data Classification Using SVM Based on Improved Simulated Annealing Featuring Synthetic Data Generation and Reduction. Comput Mater Contin. 2023;75(1):547–564. https://doi.org/10.32604/cmc.2023.036025

IEEE Style

H. I. Hussein, S. A. Anwar, and M. I. Ahmad, “Imbalanced Data Classification Using SVM Based on Improved Simulated Annealing Featuring Synthetic Data Generation and Reduction,” Comput. Mater. Contin., vol. 75, no. 1, pp. 547–564, 2023. https://doi.org/10.32604/cmc.2023.036025

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Imbalanced Data Classification Using SVM Based on Improved Simulated Annealing Featuring Synthetic Data Generation and Reduction

Abstract

Keywords

Cite This Article

2683

1199

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link