Jing Yang, Bin Ji, Shasha Li*, Jun Ma, Jie Yu
CMC-Computers, Materials & Continua, Vol.78, No.3, pp. 4303-4315, 2024, DOI:10.32604/cmc.2023.035594
- 26 March 2024
Abstract Named entity recognition (NER) is a fundamental task of information extraction (IE), and it has attracted considerable research attention in recent years. The abundant annotated English NER datasets have significantly promoted the NER research in the English field. By contrast, much fewer efforts are made to the Chinese NER research, especially in the scientific domain, due to the scarcity of Chinese NER datasets. To alleviate this problem, we present a Chinese scientific NER dataset–SciCN, which contains entity annotations of titles and abstracts derived from 3,500 scientific papers. We manually annotate a total of 62,059 entities,… More >