Leveraging Pre-Trained Word Embedding Models for Fake Review Identification

Glody Muka; Patrick Mukala

doi:10.32604/jai.2024.049685

Open Access icon Open Access

ARTICLE

Leveraging Pre-Trained Word Embedding Models for Fake Review Identification

Glody Muka^1,*, Patrick Mukala^1,2,*

1 Department of Mathematics and Computer Science, National Pedagogical University, Kinshasa, P.O. Box 8815, Democratic Republic of Congo
2 School of Computer Science, University of Wollongong in Dubai, Dubai, P.O. Box 20183, United Arab Emirates

* Corresponding Authors: Glody Muka. Email: email ; Patrick Mukala. Email: email

Journal on Artificial Intelligence 2024, 6, 211-223. https://doi.org/10.32604/jai.2024.049685

Received 24 January 2024; Accepted 08 July 2024; Issue published 07 August 2024

Abstract

Reviews have a significant impact on online businesses. Nowadays, online consumers rely heavily on other people's reviews before purchasing a product, instead of looking at the product description. With the emergence of technology, malicious online actors are using techniques such as Natural Language Processing (NLP) and others to generate a large number of fake reviews to destroy their competitors’ markets. To remedy this situation, several researches have been conducted in the last few years. Most of them have applied NLP techniques to preprocess the text before building Machine Learning (ML) or Deep Learning (DL) models to detect and filter these fake reviews. However, with the same NLP techniques, machine-generated fake reviews are increasing exponentially. This work explores a powerful text representation technique called Embedding models to combat the proliferation of fake reviews in online marketplaces. Indeed, these embedding structures can capture much more information from the data compared to other standard text representations. To do this, we tested our hypothesis in two different Recurrent Neural Network (RNN) architectures, namely Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU), using fake review data from Amazon and TripAdvisor. Our experimental results show that our best-proposed model can distinguish between real and fake reviews with 91.44% accuracy. Furthermore, our results corroborate with the state-of-the-art research in this area and demonstrate some improvements over other approaches. Therefore, proper text representation improves the accuracy of fake review detection.

Keywords

Natural language processing; word embedding; deep learning; fake review detection

Cite This Article

APA Style

Muka, G., Mukala, P. (2024). Leveraging Pre-Trained Word Embedding Models for Fake Review Identification. Journal on Artificial Intelligence, 6(1), 211–223. https://doi.org/10.32604/jai.2024.049685

Vancouver Style

Muka G, Mukala P. Leveraging Pre-Trained Word Embedding Models for Fake Review Identification. J Artif Intell. 2024;6(1):211–223. https://doi.org/10.32604/jai.2024.049685

IEEE Style

G. Muka and P. Mukala, “Leveraging Pre-Trained Word Embedding Models for Fake Review Identification,” J. Artif. Intell., vol. 6, no. 1, pp. 211–223, 2024. https://doi.org/10.32604/jai.2024.049685

BibTex EndNote RIS

Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Leveraging Pre-Trained Word Embedding Models for Fake Review Identification

Abstract

Keywords

Cite This Article

1327

721

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link