A Novel Optimized Language-Independent Text Summarization Technique

Hanan A.; Alaaeldin Hafez

doi:10.32604/cmc.2022.031485

Open Access icon Open Access

ARTICLE

A Novel Optimized Language-Independent Text Summarization Technique

Hanan A. Hosni Mahmoud^1,*, Alaaeldin M. Hafez²

1 Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
2 Department of Information Systems, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia

* Corresponding Author: Hanan A. Hosni Mahmoud. Email: email

Computers, Materials & Continua 2022, 73(3), 5121-5136. https://doi.org/10.32604/cmc.2022.031485

Received 19 April 2022; Accepted 30 May 2022; Issue published 28 July 2022

Abstract

A substantial amount of textual data is present electronically in several languages. These texts directed the gear to information redundancy. It is essential to remove this redundancy and decrease the reading time of these data. Therefore, we need a computerized text summarization technique to extract relevant information from group of text documents with correlated subjects. This paper proposes a language-independent extractive summarization technique. The proposed technique presents a clustering-based optimization technique. The clustering technique determines the main subjects of the text, while the proposed optimization technique minimizes redundancy, and maximizes significance. Experiments are devised and evaluated using BillSum dataset for the English language, MLSUM for German and Russian and Mawdoo3 for the Arabic language. The experiments are evaluated using ROUGE metrics. The results showed the effectiveness of the proposed technique compared to other language-dependent and language-independent summarization techniques. Our technique achieved better ROUGE metrics for all the utilized datasets. The technique accomplished an F-measure of 41.9% for Rouge-1, 18.7% for Rouge-2, 39.4% for Rouge-3, and 16.8% for Rouge-4 on average for all the dataset using all three objectives. Our system also exhibited an improvement of 26.6%, 35.5%, 34.65%, and 31.54% w.r.t. The recent model contributed in the summarization of BillSum in terms of ROUGE metric evaluation. Our model’s performance is higher than the compared models, especially in the metric results of ROUGE_2 which is bi-gram matching.

Keywords

Text summarization: language-independent summarization; ROUGE

Cite This Article

APA Style

Hosni Mahmoud, H.A., Hafez, A.M. (2022). A Novel Optimized Language-Independent Text Summarization Technique. Computers, Materials & Continua, 73(3), 5121–5136. https://doi.org/10.32604/cmc.2022.031485

Vancouver Style

Hosni Mahmoud HA, Hafez AM. A Novel Optimized Language-Independent Text Summarization Technique. Comput Mater Contin. 2022;73(3):5121–5136. https://doi.org/10.32604/cmc.2022.031485

IEEE Style

H. A. Hosni Mahmoud and A. M. Hafez, “A Novel Optimized Language-Independent Text Summarization Technique,” Comput. Mater. Contin., vol. 73, no. 3, pp. 5121–5136, 2022. https://doi.org/10.32604/cmc.2022.031485

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

A Novel Optimized Language-Independent Text Summarization Technique

Abstract

Keywords

Cite This Article

1736

1590

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link