Open Access
ARTICLE
SF-CNN: Deep Text Classification and Retrieval for Text Documents
1 Computer Science and Engineering, Dhanalaksmi College of Engineering, Anna University, Chennai, India
2 R. M. D Engineering College, Anna University, Chennai, India
3 Computer Science and Engineering, Aalim Muhammed Salegh College of Engineering, Anna University, Chennai, India
* Corresponding Author: R. Sarasu. Email:
Intelligent Automation & Soft Computing 2023, 35(2), 1799-1813. https://doi.org/10.32604/iasc.2023.027429
Received 17 January 2022; Accepted 13 March 2022; Issue published 19 July 2022
Abstract
Researchers and scientists need rapid access to text documents such as research papers, source code and dissertations. Many research documents are available on the Internet and need more time to retrieve exact documents based on keywords. An efficient classification algorithm for retrieving documents based on keyword words is required. The traditional algorithm performs less because it never considers words’ polysemy and the relationship between bag-of-words in keywords. To solve the above problem, Semantic Featured Convolution Neural Networks (SF-CNN) is proposed to obtain the key relationships among the searching keywords and build a structure for matching the words for retrieving correct text documents. The proposed SF-CNN is based on deep semantic-based bag-of-word representation for document retrieval. Traditional deep learning methods such as Convolutional Neural Network and Recurrent Neural Network never use semantic representation for bag-of-words. The experiment is performed with different document datasets for evaluating the performance of the proposed SF-CNN method. SF-CNN classifies the documents with an accuracy of 94% than the traditional algorithms.Keywords
Cite This Article
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.