Vol.33, No.1, 2022, pp.275-289, doi:10.32604/iasc.2022.022805
OPEN ACCESS
ARTICLE
Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System
  • K. O. Mohammed Aarif1,*, P. Sivakumar2
1 Department of Electronics and Communication Engineering, C. Abdul Hakeem College of Engineering and Technology, Melvisharam, 632509, India
2 Department of Electronics and Communication Engineering, Dr. NGP Institute of Technology, Coimbatore, 614048, India
* Corresponding Author: K. O. Mohammed Aarif. Email:
Received 19 August 2021; Accepted 23 September 2021; Issue published 05 January 2022
Abstract
Deep learning has achieved magnificent success in the field of pattern recognition. In recent years Urdu character recognition system has significantly benefited from the effectiveness of the deep convolutional neural network. Majority of the research on Urdu text recognition are concentrated on formal handwritten and printed Urdu text document. In this paper, we experimented the Challenging issue of text recognition in Urdu ancient literature documents. Due to its cursiveness, complex word formation (ligatures), and context-sensitivity, and inadequate benchmark dataset, recognition of Urdu text from the literature document is very difficult to process compared to the formal Urdu text document. In this work, first, we generated a dataset by extracting the recurrent ligatures from an ancient Urdu fatawa book. Secondly, we categorized and augment the ligatures to generate batches of augmented images that improvise the training efficiency and classification accuracy. Finally, we proposed a multi-domain deep Convolutional Neural Network which integrates a spatial domain and a frequency domain CNN to learn the modular relations between features originating from the two different domain networks to train and improvise the classification accuracy. The experimental results show that the proposed network with the augmented dataset achieves an averaged accuracy of 97.8% which outperforms the other CNN models in this class. The experimental results also show that for the recognition of ancient Urdu literature, well-known benchmark datasets are not appropriate which is also verified with our prepared dataset.
Keywords
Text recognition; deep learning; multi-domain CNN; ligatures; pattern recognition
Cite This Article
K. O. Mohammed Aarif and P. Sivakumar, "Multi-domain deep convolutional neural network for ancient urdu text recognition system," Intelligent Automation & Soft Computing, vol. 33, no.1, pp. 275–289, 2022.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.