SF-CNN: Deep Text Classification and Retrieval for Text Documents

R. Sarasu; K. Thyagharajan; N. Shanker

doi:10.32604/iasc.2023.027429

Open Access icon Open Access

ARTICLE

SF-CNN: Deep Text Classification and Retrieval for Text Documents

R. Sarasu^1,*, K. K. Thyagharajan², N. R. Shanker³

1 Computer Science and Engineering, Dhanalaksmi College of Engineering, Anna University, Chennai, India
2 R. M. D Engineering College, Anna University, Chennai, India
3 Computer Science and Engineering, Aalim Muhammed Salegh College of Engineering, Anna University, Chennai, India

* Corresponding Author: R. Sarasu. Email: email

Intelligent Automation & Soft Computing 2023, 35(2), 1799-1813. https://doi.org/10.32604/iasc.2023.027429

Received 17 January 2022; Accepted 13 March 2022; Issue published 19 July 2022

Abstract

Researchers and scientists need rapid access to text documents such as research papers, source code and dissertations. Many research documents are available on the Internet and need more time to retrieve exact documents based on keywords. An efficient classification algorithm for retrieving documents based on keyword words is required. The traditional algorithm performs less because it never considers words’ polysemy and the relationship between bag-of-words in keywords. To solve the above problem, Semantic Featured Convolution Neural Networks (SF-CNN) is proposed to obtain the key relationships among the searching keywords and build a structure for matching the words for retrieving correct text documents. The proposed SF-CNN is based on deep semantic-based bag-of-word representation for document retrieval. Traditional deep learning methods such as Convolutional Neural Network and Recurrent Neural Network never use semantic representation for bag-of-words. The experiment is performed with different document datasets for evaluating the performance of the proposed SF-CNN method. SF-CNN classifies the documents with an accuracy of 94% than the traditional algorithms.

Keywords

Semantic; classification; convolution neural networks; semantic enhancement

Cite This Article

APA Style

Sarasu, R., Thyagharajan, K.K., Shanker, N.R. (2023). SF-CNN: Deep Text Classification and Retrieval for Text Documents. Intelligent Automation & Soft Computing, 35(2), 1799–1813. https://doi.org/10.32604/iasc.2023.027429

Vancouver Style

Sarasu R, Thyagharajan KK, Shanker NR. SF-CNN: Deep Text Classification and Retrieval for Text Documents. Intell Automat Soft Comput. 2023;35(2):1799–1813. https://doi.org/10.32604/iasc.2023.027429

IEEE Style

R. Sarasu, K. K. Thyagharajan, and N. R. Shanker, “SF-CNN: Deep Text Classification and Retrieval for Text Documents,” Intell. Automat. Soft Comput., vol. 35, no. 2, pp. 1799–1813, 2023. https://doi.org/10.32604/iasc.2023.027429

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

SF-CNN: Deep Text Classification and Retrieval for Text Documents

Abstract

Keywords

Cite This Article

2564

1227

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link