Open Access iconOpen Access

ARTICLE

crossmark

Leveraging Pre-Trained Word Embedding Models for Fake Review Identification

Glody Muka1,*, Patrick Mukala1,2,*

1 Department of Mathematics and Computer Science, National Pedagogical University, Kinshasa, P.O. Box 8815, Democratic Republic of Congo
2 School of Computer Science, University of Wollongong in Dubai, Dubai, P.O. Box 20183, United Arab Emirates

* Corresponding Authors: Glody Muka. Email: email; Patrick Mukala. Email: email

Journal on Artificial Intelligence 2024, 6, 211-223. https://doi.org/10.32604/jai.2024.049685

Abstract

Reviews have a significant impact on online businesses. Nowadays, online consumers rely heavily on other people's reviews before purchasing a product, instead of looking at the product description. With the emergence of technology, malicious online actors are using techniques such as Natural Language Processing (NLP) and others to generate a large number of fake reviews to destroy their competitors’ markets. To remedy this situation, several researches have been conducted in the last few years. Most of them have applied NLP techniques to preprocess the text before building Machine Learning (ML) or Deep Learning (DL) models to detect and filter these fake reviews. However, with the same NLP techniques, machine-generated fake reviews are increasing exponentially. This work explores a powerful text representation technique called Embedding models to combat the proliferation of fake reviews in online marketplaces. Indeed, these embedding structures can capture much more information from the data compared to other standard text representations. To do this, we tested our hypothesis in two different Recurrent Neural Network (RNN) architectures, namely Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU), using fake review data from Amazon and TripAdvisor. Our experimental results show that our best-proposed model can distinguish between real and fake reviews with 91.44% accuracy. Furthermore, our results corroborate with the state-of-the-art research in this area and demonstrate some improvements over other approaches. Therefore, proper text representation improves the accuracy of fake review detection.

Keywords


Cite This Article

APA Style
Muka, G., Mukala, P. (2024). Leveraging pre-trained word embedding models for fake review identification. Journal on Artificial Intelligence, 6(1), 211-223. https://doi.org/10.32604/jai.2024.049685
Vancouver Style
Muka G, Mukala P. Leveraging pre-trained word embedding models for fake review identification. J Artif Intell . 2024;6(1):211-223 https://doi.org/10.32604/jai.2024.049685
IEEE Style
G. Muka and P. Mukala, “Leveraging Pre-Trained Word Embedding Models for Fake Review Identification,” J. Artif. Intell. , vol. 6, no. 1, pp. 211-223, 2024. https://doi.org/10.32604/jai.2024.049685



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 402

    View

  • 182

    Download

  • 0

    Like

Share Link