Open Access iconOpen Access

ARTICLE

crossmark

A Big Data Approach to Black Friday Sales

by Mazhar Javed Awan1,2,*, Mohd Shafry Mohd Rahim2, Haitham Nobanee3,4,5, Awais Yasin6, Osamah Ibrahim Khalaf7, Umer Ishfaq2

1 Department of Software Engineering, University of Management and Technology, Lahore, Pakistan
2 School of Computing, Faculty of Engineering, University Teknologi Malaysia, Johor, Malaysia
3Collage of Business, Abu Dhabi University, Abu Dhabi, United Arab Emirates
4 Oxford Center for Islamic Studies, the University of Oxford, Marston Road, Oxford, UK
5 The University of Liverpool Management School, the University of Liverpool, Liverpool, UK
6 Department of Computer Engineering, National University of Technology, Islamabad, Pakistan
7 AlNahrain Nanorenewable Energy Research Centre, Al-Nahrain University, Baghdad, Iraq

* Corresponding Author: Mazhar Javed Awan. Email: email

Intelligent Automation & Soft Computing 2021, 27(3), 785-797. https://doi.org/10.32604/iasc.2021.014216

Abstract

Retail companies recognize the need to analyze and predict their sales and customer behavior against their products and product categories. Our study aims to help retail companies create personalized deals and promotions for their customers, even during the COVID-19 pandemic, through a big data framework that allows them to handle massive sales volumes with more efficient models. In this paper, we used Black Friday sales data taken from a dataset on the Kaggle website, which contains nearly 550,000 observations analyzed with 10 features: qualitative and quantitative. The class label is purchases and sales (in U.S. dollars). Because the predictor label is continuous, regression models are suited in this case. Using the Apache Spark big data framework, which uses the MLlib machine learning library, we trained two machine learning models: linear regression and random forest. These machine learning algorithms were used to predict future pricing and sales. We first implemented a linear regression model and a random forest model without using the Spark framework and achieved accuracies of 68% and 74%, respectively. Then, we trained these models on the Spark machine learning big data framework where we achieved an accuracy of 72% for the linear regression model and 81% for the random forest model.

Keywords


Cite This Article

APA Style
Javed Awan, M., Rahim, M.S.M., Nobanee, H., Yasin, A., Khalaf, O.I. et al. (2021). A big data approach to black friday sales. Intelligent Automation & Soft Computing, 27(3), 785-797. https://doi.org/10.32604/iasc.2021.014216
Vancouver Style
Javed Awan M, Rahim MSM, Nobanee H, Yasin A, Khalaf OI, Ishfaq U. A big data approach to black friday sales. Intell Automat Soft Comput . 2021;27(3):785-797 https://doi.org/10.32604/iasc.2021.014216
IEEE Style
M. Javed Awan, M. S. M. Rahim, H. Nobanee, A. Yasin, O. I. Khalaf, and U. Ishfaq, “A Big Data Approach to Black Friday Sales,” Intell. Automat. Soft Comput. , vol. 27, no. 3, pp. 785-797, 2021. https://doi.org/10.32604/iasc.2021.014216

Citations




cc Copyright © 2021 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 4994

    View

  • 2488

    Download

  • 2

    Like

Share Link