Open Access
ARTICLE
Shallow Neural Network and Ontology-Based Novel Semantic Document Indexing for Information Retrieval
1 University School of Information, Communication & Technology, Guru Gobind Singh Indraprastha University, Delhi, 110078, India
2 Department of Computer Science and Engineering, Netaji Subhas University of Technology, Delhi, 110078, India
* Corresponding Author: Anil Sharma. Email:
Intelligent Automation & Soft Computing 2022, 34(3), 1989-2005. https://doi.org/10.32604/iasc.2022.026095
Received 15 December 2021; Accepted 15 February 2022; Issue published 25 May 2022
Abstract
Information Retrieval (IR) systems are developed to fetch the most relevant content matching the user’s information needs from a pool of information. A user expects to get IR results based on the conceptual contents of the query rather than keywords. But traditional IR approaches index documents based on the terms that they contain and ignore semantic descriptions of document contents. This results in a vocabulary gap when queries and documents use different terms to describe the same concept. As a solution to this problem and to improve the performance of IR systems, we have designed a Shallow Neural Network and ontology-based novel approach for semantic document indexing (SNNOntoSDI). The SNNOntoSDI approach identifies the concepts representing a document using the word2vec model (a Shallow Neural Network) and domain ontology. The relevance of a concept in the document is measured by assigning weight to the concept based on its statistical, semantic, and scientific Named Entity features. The parameters of these feature weights are calculated using the Analytic Hierarchy Process (AHP). Finally, concepts are ranked in order of relevance. To empirically evaluate the SNNOntoSDI approach, a series of experiments were carried out on five standard publicly available datasets. The results of experiments demonstrate that the SNNOntoSDI approach outperformed state-of-the-art methods, with an average improvement of 29% and 25% in average accuracy and F-measure respectively.Keywords
Cite This Article
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.