Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System

K. O.; P. Sivakumar

doi:10.32604/iasc.2022.022805

Open Access icon Open Access

ARTICLE

Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System

K. O. Mohammed Aarif^1,*, P. Sivakumar²

1 Department of Electronics and Communication Engineering, C. Abdul Hakeem College of Engineering and Technology, Melvisharam, 632509, India
2 Department of Electronics and Communication Engineering, Dr. NGP Institute of Technology, Coimbatore, 614048, India

* Corresponding Author: K. O. Mohammed Aarif. Email: email

Intelligent Automation & Soft Computing 2022, 33(1), 275-289. https://doi.org/10.32604/iasc.2022.022805

Received 19 August 2021; Accepted 23 September 2021; Issue published 05 January 2022

Abstract

Deep learning has achieved magnificent success in the field of pattern recognition. In recent years Urdu character recognition system has significantly benefited from the effectiveness of the deep convolutional neural network. Majority of the research on Urdu text recognition are concentrated on formal handwritten and printed Urdu text document. In this paper, we experimented the Challenging issue of text recognition in Urdu ancient literature documents. Due to its cursiveness, complex word formation (ligatures), and context-sensitivity, and inadequate benchmark dataset, recognition of Urdu text from the literature document is very difficult to process compared to the formal Urdu text document. In this work, first, we generated a dataset by extracting the recurrent ligatures from an ancient Urdu fatawa book. Secondly, we categorized and augment the ligatures to generate batches of augmented images that improvise the training efficiency and classification accuracy. Finally, we proposed a multi-domain deep Convolutional Neural Network which integrates a spatial domain and a frequency domain CNN to learn the modular relations between features originating from the two different domain networks to train and improvise the classification accuracy. The experimental results show that the proposed network with the augmented dataset achieves an averaged accuracy of 97.8% which outperforms the other CNN models in this class. The experimental results also show that for the recognition of ancient Urdu literature, well-known benchmark datasets are not appropriate which is also verified with our prepared dataset.

Keywords

Text recognition; deep learning; multi-domain CNN; ligatures; pattern recognition

Cite This Article

APA Style

Mohammed Aarif, K.O., Sivakumar, P. (2022). Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System. Intelligent Automation & Soft Computing, 33(1), 275–289. https://doi.org/10.32604/iasc.2022.022805

Vancouver Style

Mohammed Aarif KO, Sivakumar P. Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System. Intell Automat Soft Comput. 2022;33(1):275–289. https://doi.org/10.32604/iasc.2022.022805

IEEE Style

K. O. Mohammed Aarif and P. Sivakumar, “Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System,” Intell. Automat. Soft Comput., vol. 33, no. 1, pp. 275–289, 2022. https://doi.org/10.32604/iasc.2022.022805

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System

Abstract

Keywords

Cite This Article

2459

1749

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link