Open Access iconOpen Access

ARTICLE

crossmark

ABMRF: An Ensemble Model for Author Profiling Based on Stylistic Features Using Roman Urdu

by Muhammad Arshad1, Bilal Khan1, Khalil Khan2, Ali Mustafa Qamar3,*, Rehan Ullah Khan4

1 Department of Computer Science, City University of Science and Information Technology, Peshawar, Pakistan
2 Department of Computer Science, School of Engineering and Digital Sciences, Nazarbayev University, Astana, Kazakhstan
3 Department of Computer Science, College of Computer, Qassim University, Buraydah, Saudi Arabia
4 Department of Information Technology, College of Computer, Qassim University, Buraydah, Saudi Arabia

* Corresponding Author: Ali Mustafa Qamar. Email: email

(This article belongs to the Special Issue: Applying Computational Intelligence to Social Science Research)

Intelligent Automation & Soft Computing 2024, 39(2), 301-317. https://doi.org/10.32604/iasc.2024.045402

Abstract

This study explores the area of Author Profiling (AP) and its importance in several industries, including forensics, security, marketing, and education. A key component of AP is the extraction of useful information from text, with an emphasis on the writers’ ages and genders. To improve the accuracy of AP tasks, the study develops an ensemble model dubbed ABMRF that combines AdaBoostM1 (ABM1) and Random Forest (RF). The work uses an extensive technique that involves text message dataset pretreatment, model training, and assessment. To evaluate the effectiveness of several machine learning (ML) algorithms in classifying age and gender, including Composite Hypercube on Random Projection (CHIRP), Decision Trees (J48), Naïve Bayes (NB), K Nearest Neighbor, AdaboostM1, NB-Updatable, RF, and ABMRF, they are compared. The findings demonstrate that ABMRF regularly beats the competition, with a gender classification accuracy of 71.14% and an age classification accuracy of 54.29%, respectively. Additional metrics like precision, recall, F-measure, Matthews Correlation Coefficient (MCC), and accuracy support ABMRF’s outstanding performance in age and gender profiling tasks. This study demonstrates the usefulness of ABMRF as an ensemble model for author profiling and highlights its possible uses in marketing, law enforcement, and education. The results emphasize the effectiveness of ensemble approaches in enhancing author profiling task accuracy, particularly when it comes to age and gender identification.

Keywords


Cite This Article

APA Style
Aiman, , Arshad, M., Khan, B., Khan, K., Qamar, A.M. et al. (2024). ABMRF: an ensemble model for author profiling based on stylistic features using roman urdu. Intelligent Automation & Soft Computing, 39(2), 301-317. https://doi.org/10.32604/iasc.2024.045402
Vancouver Style
Aiman , Arshad M, Khan B, Khan K, Qamar AM, Khan RU. ABMRF: an ensemble model for author profiling based on stylistic features using roman urdu. Intell Automat Soft Comput . 2024;39(2):301-317 https://doi.org/10.32604/iasc.2024.045402
IEEE Style
Aiman, M. Arshad, B. Khan, K. Khan, A. M. Qamar, and R. U. Khan, “ABMRF: An Ensemble Model for Author Profiling Based on Stylistic Features Using Roman Urdu,” Intell. Automat. Soft Comput. , vol. 39, no. 2, pp. 301-317, 2024. https://doi.org/10.32604/iasc.2024.045402



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1167

    View

  • 330

    Download

  • 0

    Like

Share Link