Vol.67, No.3, 2021, pp.2831-2844, doi:10.32604/cmc.2021.015054
OPEN ACCESS
ARTICLE
Pashto Characters Recognition Using Multi-Class Enabled Support Vector Machine
  • Sulaiman Khan1, Shah Nazir1, Habib Ullah Khan2,*, Anwar Hussain1
1 Department of Computer Science, University of Swabi, Swabi, Pakistan
2 Department of Accounting and Information Systems, College of Business and Economics, Qatar University, Doha, Qatar
* Corresponding Author: Habib Ullah Khan. Email:
(This article belongs to this Special Issue: Deep Learning and Parallel Computing for Intelligent and Efficient IoT)
Received 04 November 2020; Accepted 16 December 2020; Issue published 01 March 2021
Abstract
During the last two decades significant work has been reported in the field of cursive language’s recognition especially, in the Arabic, the Urdu and the Persian languages. The unavailability of such work in the Pashto language is because of: the absence of a standard database and of significant research work that ultimately acts as a big barrier for the research community. The slight change in the Pashto characters’ shape is an additional challenge for researchers. This paper presents an efficient OCR system for the handwritten Pashto characters based on multi-class enabled support vector machine using manifold feature extraction techniques. These feature extraction techniques include, tools such as zoning feature extractor, discrete cosine transform, discrete wavelet transform, and Gabor filters and histogram of oriented gradients. A hybrid feature map is developed by combining the manifold feature maps. This research work is performed by developing a medium-sized dataset of handwritten Pashto characters that encapsulate 200 handwritten samples for each 44 characters in the Pashto language. Recognition results are generated for the proposed model based on a manifold and hybrid feature map. An overall accuracy rates of 63.30%, 65.13%, 68.55%, 68.28%, 67.02% and 83% are generated based on a zoning technique, HoGs, Gabor filter, DCT, DWT and hybrid feature maps respectively. Applicability of the proposed model is also tested by comparing its results with a convolution neural network model. The convolution neural network-based model generated an accuracy rate of 81.02% smaller than the multi-class support vector machine. The highest accuracy rate of 83% for the multi-class SVM model based on a hybrid feature map reflects the applicability of the proposed model.
Keywords
Pashto; multi-class support vector machine; handwritten characters database; zoning; and histogram of oriented gradients
Cite This Article
S. Khan, S. Nazir, H. U. Khan and A. Hussain, "Pashto characters recognition using multi-class enabled support vector machine," Computers, Materials & Continua, vol. 67, no.3, pp. 2831–2844, 2021.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.