Home / Journals / CMC / Online First / doi:10.32604/cmc.2025.061359
Special Issues
Table of Content

Open Access

ARTICLE

Causal Representation Enhances Cross-Domain Named Entity Recognition in Large Language Models

Jiahao Wu1,2, Jinzhong Xu1, Xiaoming Liu1,*, Guan Yang1,3, Jie Liu4
1 School of Artificial Intelligence, Zhongyuan University of Technology, Zhengzhou, 450007, China
2 School of Computer Science, Zhongyuan University of Technology, Zhengzhou, 450007, China
3 Zhengzhou Key Laboratory of Text Processing and Image Understanding, Zhengzhou, 450007, China
4 School of Information Science and Technology, North China University of Technology, Beijing, 100144, China
* Corresponding Author: Xiaoming Liu. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.061359

Received 22 November 2024; Accepted 17 February 2025; Published online 19 March 2025

Abstract

Large language models cross-domain named entity recognition task in the face of the scarcity of large language labeled data in a specific domain, due to the entity bias arising from the variation of entity information between different domains, which makes large language models prone to spurious correlations problems when dealing with specific domains and entities. In order to solve this problem, this paper proposes a cross-domain named entity recognition method based on causal graph structure enhancement, which captures the cross-domain invariant causal structural representations between feature representations of text sequences and annotation sequences by establishing a causal learning and intervention module, so as to improve the utilization of causal structural features by the large language models in the target domains, and thus effectively alleviate the false entity bias triggered by the false relevance problem; meanwhile, through the semantic feature fusion module, the semantic information of the source and target domains is effectively combined. The results show an improvement of 2.47% and 4.12% in the political and medical domains, respectively, compared with the benchmark model, and an excellent performance in small-sample scenarios, which proves the effectiveness of causal graph structural enhancement in improving the accuracy of cross-domain entity recognition and reducing false correlations.

Keywords

Large language model; entity bias; causal graph structure
  • 95

    View

  • 27

    Download

  • 0

    Like

Share Link