Open Access iconOpen Access

ARTICLE

PKME-MLM: A Novel Multimodal Large Model for Sarcasm Detection

Jian Luo1, Yaling Li1, Xueyu Li1, Xuliang Hu2,*

1 College of Information Science and Engineering, Hunan Normal University, Changsha, 410000, China
2 Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410000, China

* Corresponding Author: Xuliang Hu. Email: email

Computers, Materials & Continua 2025, 83(1), 877-896. https://doi.org/10.32604/cmc.2025.061401

Abstract

Sarcasm detection in Natural Language Processing (NLP) has become increasingly important, particularly with the rise of social media and non-textual emotional expressions, such as images. Existing methods often rely on separate image and text modalities, which may not fully utilize the information available from both sources. To address this limitation, we propose a novel multimodal large model, i.e., the PKME-MLM (Prior Knowledge and Multi-label Emotion analysis based Multimodal Large Model for sarcasm detection). The PKME-MLM aims to enhance sarcasm detection by integrating prior knowledge to extract useful textual information from images, which is then combined with text data for deeper analysis. This method improves the integration of image and text data, addressing the limitation of previous models that process these modalities separately. Additionally, we incorporate multi-label sentiment analysis, refining sentiment labels to improve sarcasm recognition accuracy. This design overcomes the limitations of prior models that treated sentiment classification as a single-label problem, thereby improving sarcasm recognition by distinguishing subtle emotional cues from the text. Experimental results demonstrate that our approach achieves significant performance improvements in multimodal sarcasm detection tasks, with an accuracy (Acc.) of 94.35%, and Macro-Average Precision and Recall reaching 93.92% and 94.21%, respectively. These results highlight the potential of multimodal models in improving sarcasm detection and suggest that further integration of modalities could advance future research. This work also paves the way for incorporating multimodal sentiment analysis into sarcasm detection.

Keywords

Sarcasm detection; multimodal large model; prior knowledge; multi-label fusion

Cite This Article

APA Style
Luo, J., Li, Y., Li, X., Hu, X. (2025). PKME-MLM: A novel multimodal large model for sarcasm detection. Computers, Materials & Continua, 83(1), 877–896. https://doi.org/10.32604/cmc.2025.061401
Vancouver Style
Luo J, Li Y, Li X, Hu X. PKME-MLM: A novel multimodal large model for sarcasm detection. Comput Mater Contin. 2025;83(1):877–896. https://doi.org/10.32604/cmc.2025.061401
IEEE Style
J. Luo, Y. Li, X. Li, and X. Hu, “PKME-MLM: A Novel Multimodal Large Model for Sarcasm Detection,” Comput. Mater. Contin., vol. 83, no. 1, pp. 877–896, 2025. https://doi.org/10.32604/cmc.2025.061401



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 238

    View

  • 93

    Download

  • 0

    Like

Share Link