Data Analytics for the Identification of Fake Reviews Using Supervised Learning

Alsubari, Saleh Nagi; Deshmukh, Sachin N.; Alqarni, Ahmed Abdullah; Alsharif, Nizar; Aldhyani, Theyazn H. H.; Alsaade, Fawaz Waselallah; Khalaf, Osamah I.

doi:10.32604/cmc.2022.019625

Open Access icon Open Access

ARTICLE

Data Analytics for the Identification of Fake Reviews Using Supervised Learning

by Saleh Nagi Alsubari¹, Sachin N. Deshmukh¹, Ahmed Abdullah Alqarni², Nizar Alsharif³, Theyazn H. H. Aldhyani^4,*, Fawaz Waselallah Alsaade⁵, Osamah I. Khalaf⁶

1 Department of Computer Science & Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, India
2 Department of Computer Sciences and Information Technology, Albaha University, Saudi Arabia
3 Department of Computer Engineering and Science, Albaha University, Saudi Arabia
4 Community College of Abqaiq, King Faisal University, Al-Ahsa, Saudi Arabia
5 College of Computer Science and Information Technology, King Faisal University, Al-Ahsa, Saudi Arabia
6 Al–Nahrain University, Bagdad, Iraq

* Corresponding Author: Theyazn H. H. Aldhyani. Email: email

(This article belongs to the Special Issue: Application of Big Data Analytics in the Management of Business)

Computers, Materials & Continua 2022, 70(2), 3189-3204. https://doi.org/10.32604/cmc.2022.019625

Received 20 April 2021; Accepted 15 June 2021; Issue published 27 September 2021

Abstract

Fake reviews, also known as deceptive opinions, are used to mislead people and have gained more importance recently. This is due to the rapid increase in online marketing transactions, such as selling and purchasing. E-commerce provides a facility for customers to post reviews and comment about the product or service when purchased. New customers usually go through the posted reviews or comments on the website before making a purchase decision. However, the current challenge is how new individuals can distinguish truthful reviews from fake ones, which later deceive customers, inflict losses, and tarnish the reputation of companies. The present paper attempts to develop an intelligent system that can detect fake reviews on ecommerce platforms using n-grams of the review text and sentiment scores given by the reviewer. The proposed methodology adopted in this study used a standard fake hotel review dataset for experimenting and data preprocessing methods and a term frequency-Inverse document frequency (TF-IDF) approach for extracting features and their representation. For detection and classification, n-grams of review texts were inputted into the constructed models to be classified as fake or truthful. However, the experiments were carried out using four different supervised machine-learning techniques and were trained and tested on a dataset collected from the Trip Advisor website. The classification results of these experiments showed that naïve Bayes (NB), support vector machine (SVM), adaptive boosting (AB), and random forest (RF) received 88%, 93%, 94%, and 95%, respectively, based on testing accuracy and the F1-score. The obtained results were compared with existing works that used the same dataset, and the proposed methods outperformed the comparable methods in terms of accuracy.

Keywords

E-commerce; fake reviews detection; methodologies; machine learning; hotel reviews

Cite This Article

APA Style

Alsubari, S.N., Deshmukh, S.N., Alqarni, A.A., Alsharif, N., Aldhyani, T.H.H. et al. (2022). Data analytics for the identification of fake reviews using supervised learning. Computers, Materials & Continua, 70(2), 3189-3204. https://doi.org/10.32604/cmc.2022.019625

Vancouver Style

Alsubari SN, Deshmukh SN, Alqarni AA, Alsharif N, Aldhyani THH, Alsaade FW, et al. Data analytics for the identification of fake reviews using supervised learning. Comput Mater Contin. 2022;70(2):3189-3204 https://doi.org/10.32604/cmc.2022.019625

IEEE Style

S. N. Alsubari et al., “Data Analytics for the Identification of Fake Reviews Using Supervised Learning,” Comput. Mater. Contin., vol. 70, no. 2, pp. 3189-3204, 2022. https://doi.org/10.32604/cmc.2022.019625

BibTex EndNote RIS

Citations

20

[click to view]

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Data Analytics for the Identification of Fake Reviews Using Supervised Learning

Abstract

Keywords

Cite This Article

Citations

6322

3533

1

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link