Ra’ed M. Al-Khatib1,*, Taha Zerrouki2, Mohammed M. Abu Shquier3, Amar Balla4, Asef Al-Khateeb5
CMC-Computers, Materials & Continua, Vol.68, No.1, pp. 1255-1269, 2021, DOI:10.32604/cmc.2021.016155
- 22 March 2021
Abstract This paper introduces a new enhanced Arabic stemming algorithm for solving the information retrieval problem, especially in medical documents. Our proposed algorithm is a light stemming algorithm for extracting stems and roots from the input data. One of the main challenges facing the light stemming algorithm is cutting off the input word, to extract the initial segments. When initiating the light stemmer with strong initial segments, the final extracting stems and roots will be more accurate. Therefore, a new enhanced segmentation based on deploying the Direct Acyclic Graph (DAG) model is utilized. In addition to More >