Open Access iconOpen Access

ARTICLE

crossmark

MEIM: A Multi-Source Software Knowledge Entity Extraction Integration Model

Wuqian Lv1, Zhifang Liao1,*, Shengzong Liu2, Yan Zhang3

1 School of Computer Science and Engineering, Central South University, Changsha, 410075, China
2 School of Information Technology and Management, Hunan University of Finance and Economics, Changsha, 410205, China
3 School of Computing, Engineering and Built Environment, Glasgow Caledonian University, Glasgow, G4 0BA, UK

* Corresponding Author: Zhifang Liao. Email: email

Computers, Materials & Continua 2021, 66(1), 1027-1042. https://doi.org/10.32604/cmc.2020.012478

Abstract

Entity recognition and extraction are the foundations of knowledge graph construction. Entity data in the field of software engineering come from different platforms and communities, and have different formats. This paper divides multi-source software knowledge entities into unstructured data, semi-structured data and code data. For these different types of data, Bi-directional Long ShortTerm Memory (Bi-LSTM) with Conditional Random Field (CRF), template matching, and abstract syntax tree are used and integrated into a multi-source software knowledge entity extraction integration model (MEIM) to extract software entities. The model can be updated continuously based on user’s feedbacks to improve the accuracy. To deal with the shortage of entity annotation datasets, keyword extraction methods based on Term Frequency–Inverse Document Frequency (TF-IDF), TextRank, and K-Means are applied to annotate tasks. The proposed MEIM model is applied to the Spring Boot framework, which demonstrates good adaptability. The extracted entities are used to construct a knowledge graph, which is applied to association retrieval and association visualization.

Keywords


Cite This Article

APA Style
Lv, W., Liao, Z., Liu, S., Zhang, Y. (2021). MEIM: A multi-source software knowledge entity extraction integration model. Computers, Materials & Continua, 66(1), 1027-1042. https://doi.org/10.32604/cmc.2020.012478
Vancouver Style
Lv W, Liao Z, Liu S, Zhang Y. MEIM: A multi-source software knowledge entity extraction integration model. Comput Mater Contin. 2021;66(1):1027-1042 https://doi.org/10.32604/cmc.2020.012478
IEEE Style
W. Lv, Z. Liao, S. Liu, and Y. Zhang, “MEIM: A Multi-Source Software Knowledge Entity Extraction Integration Model,” Comput. Mater. Contin., vol. 66, no. 1, pp. 1027-1042, 2021. https://doi.org/10.32604/cmc.2020.012478

Citations




cc Copyright © 2021 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 3019

    View

  • 1625

    Download

  • 0

    Like

Share Link