PKME-MLM: A Novel Multimodal Large Model for Sarcasm Detection

Jian Luo; Yaling Li; Xueyu Li; Xuliang Hu

doi:10.32604/cmc.2025.061401

Open Access icon Open Access

ARTICLE

PKME-MLM: A Novel Multimodal Large Model for Sarcasm Detection

Jian Luo¹, Yaling Li¹, Xueyu Li¹, Xuliang Hu^2,*

1 College of Information Science and Engineering, Hunan Normal University, Changsha, 410000, China
2 Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410000, China

* Corresponding Author: Xuliang Hu. Email: email

Computers, Materials & Continua 2025, 83(1), 877-896. https://doi.org/10.32604/cmc.2025.061401

Received 23 November 2024; Accepted 02 January 2025; Issue published 26 March 2025

Abstract

Sarcasm detection in Natural Language Processing (NLP) has become increasingly important, particularly with the rise of social media and non-textual emotional expressions, such as images. Existing methods often rely on separate image and text modalities, which may not fully utilize the information available from both sources. To address this limitation, we propose a novel multimodal large model, i.e., the PKME-MLM (Prior Knowledge and Multi-label Emotion analysis based Multimodal Large Model for sarcasm detection). The PKME-MLM aims to enhance sarcasm detection by integrating prior knowledge to extract useful textual information from images, which is then combined with text data for deeper analysis. This method improves the integration of image and text data, addressing the limitation of previous models that process these modalities separately. Additionally, we incorporate multi-label sentiment analysis, refining sentiment labels to improve sarcasm recognition accuracy. This design overcomes the limitations of prior models that treated sentiment classification as a single-label problem, thereby improving sarcasm recognition by distinguishing subtle emotional cues from the text. Experimental results demonstrate that our approach achieves significant performance improvements in multimodal sarcasm detection tasks, with an accuracy (Acc.) of 94.35%, and Macro-Average Precision and Recall reaching 93.92% and 94.21%, respectively. These results highlight the potential of multimodal models in improving sarcasm detection and suggest that further integration of modalities could advance future research. This work also paves the way for incorporating multimodal sentiment analysis into sarcasm detection.

Keywords

Sarcasm detection; multimodal large model; prior knowledge; multi-label fusion

Cite This Article

APA Style

Luo, J., Li, Y., Li, X., Hu, X. (2025). PKME-MLM: A novel multimodal large model for sarcasm detection. Computers, Materials & Continua, 83(1), 877–896. https://doi.org/10.32604/cmc.2025.061401

Vancouver Style

Luo J, Li Y, Li X, Hu X. PKME-MLM: A novel multimodal large model for sarcasm detection. Comput Mater Contin. 2025;83(1):877–896. https://doi.org/10.32604/cmc.2025.061401

IEEE Style

J. Luo, Y. Li, X. Li, and X. Hu, “PKME-MLM: A Novel Multimodal Large Model for Sarcasm Detection,” Comput. Mater. Contin., vol. 83, no. 1, pp. 877–896, 2025. https://doi.org/10.32604/cmc.2025.061401

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

PKME-MLM: A Novel Multimodal Large Model for Sarcasm Detection

Abstract

Keywords

Cite This Article

238

93

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link