Open Access iconOpen Access

ARTICLE

crossmark

A New Segmentation Framework for Arabic Handwritten Text Using Machine Learning Techniques

by Saleem Ibraheem Saleem1,*, Adnan Mohsin Abdulazeez1, Zeynep Orman2

1 Department of Information Technology, Technical Informatics College of Akre, Duhok Polytechnic University, Duhok, 42004, Kurdistan Region, Iraq
2 Department of Computer Engineering, Faculty of Engineering, Istanbul University-Cerrahpasa, Istanbul, 34320, Turkey

* Corresponding Author: Saleem Ibraheem Saleem. Email: email

(This article belongs to the Special Issue: AI, IoT, Blockchain Assisted Intelligent Solutions to Medical and Healthcare Systems)

Computers, Materials & Continua 2021, 68(2), 2727-2754. https://doi.org/10.32604/cmc.2021.016447

Abstract

The writer identification (WI) of handwritten Arabic text is now of great concern to intelligence agencies following the recent attacks perpetrated by known Middle East terrorist organizations. It is also a useful instrument for the digitalization and attribution of old text to other authors of historic studies, including old national and religious archives. In this study, we proposed a new affective segmentation model by modifying an artificial neural network model and making it suitable for the binarization stage based on blocks. This modified method is combined with a new effective rotation model to achieve an accurate segmentation through the analysis of the histogram of binary images. Also, propose a new framework for correct text rotation that will help us to establish a segmentation method that can facilitate the extraction of text from its background. Image projections and the radon transform are used and improved using machine learning based on a co-occurrence matrix to produce binary images. The training stage involves taking a number of images for model training. These images are selected randomly with different angles to generate four classes (0–90, 90–180, 180–270, and 270–360). The proposed segmentation approach achieves a high accuracy of 98.18%. The study ultimately provides two major contributions that are ranked from top to bottom according to the degree of importance. The proposed method can be further developed as a new application and used in the recognition of handwritten Arabic text from small documents regardless of logical combinations and sentence construction.

Keywords


Cite This Article

APA Style
Saleem, S.I., Abdulazeez, A.M., Orman, Z. (2021). A new segmentation framework for arabic handwritten text using machine learning techniques. Computers, Materials & Continua, 68(2), 2727-2754. https://doi.org/10.32604/cmc.2021.016447
Vancouver Style
Saleem SI, Abdulazeez AM, Orman Z. A new segmentation framework for arabic handwritten text using machine learning techniques. Comput Mater Contin. 2021;68(2):2727-2754 https://doi.org/10.32604/cmc.2021.016447
IEEE Style
S. I. Saleem, A. M. Abdulazeez, and Z. Orman, “A New Segmentation Framework for Arabic Handwritten Text Using Machine Learning Techniques,” Comput. Mater. Contin., vol. 68, no. 2, pp. 2727-2754, 2021. https://doi.org/10.32604/cmc.2021.016447



cc Copyright © 2021 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 2793

    View

  • 1681

    Download

  • 0

    Like

Share Link