Tech Science Press - Publisher of Open Access Journals

News & Announcements

30 January 2026
Tech Science Press Shares Integrity Insights on AI-Enabled Paper Mills at Charleston Asia Conference
27 January 2026
SDHM-Recommended: I3CSE 2026 in Guangzhou
26 January 2026
TSP Establishes Strategic Cooperation with Chinese Medical Association Publishing House (CMAPH)
05 January 2026
Prof. Lin Lu Appointed Editor-in-Chief of Energy Engineering
29 December 2025
Two More Tech Science Press Journals Now Indexed in Chemical Abstracts Service (CAS) Databases
24 December 2025
Oncologie Welcomes Dr. Lei Zheng as Editor-in-Chief

Title/Keywords
Author/Affliations
Journal
Article Type
Start Year
End Year

Update Searching Clear

Show export options

Articles
Online

Search Results (2)

Open Access

ARTICLE

Benchmarking Performance of Document Level Classification and Topic Modeling

Muhammad Shahid Bhatti^1,*, Azmat Ullah¹, Rohaya Latip², Abid Sohail¹, Anum Riaz¹, Rohail Hassan³

CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 125-141, 2022, DOI:10.32604/cmc.2022.020083 - 03 November 2021

Abstract Text classification of low resource language is always a trivial and challenging problem. This paper discusses the process of Urdu news classiﬁcation and Urdu documents similarity. Urdu is one of the most famous spoken languages in Asia. The implementation of computational methodologies for text classiﬁcation has increased over time. However, Urdu language has not much experimented with research, it does not have readily available datasets, which turn out to be the primary reason behind limited research and applying the latest methodologies to the Urdu. To overcome these obstacles, a medium-sized dataset having six categories is… More >

View
3691

Download
2064
Open Access

ARTICLE

News Text Topic Clustering Optimized Method Based on TF-IDF Algorithm on Spark

Zhuo Zhou¹, Jiaohua Qin^1,*, Xuyu Xiang¹, Yun Tan¹, Qiang Liu¹, Neal N. Xiong²

CMC-Computers, Materials & Continua, Vol.62, No.1, pp. 217-231, 2020, DOI:10.32604/cmc.2020.06431

Abstract Due to the slow processing speed of text topic clustering in stand-alone architecture under the background of big data, this paper takes news text as the research object and proposes LDA text topic clustering algorithm based on Spark big data platform. Since the TF-IDF (term frequency-inverse document frequency) algorithm under Spark is irreversible to word mapping, the mapped words indexes cannot be traced back to the original words. In this paper, an optimized method is proposed that TF-IDF under Spark to ensure the text words can be restored. Firstly, the text feature is extracted by More >

View
4587

Download
2262

Cited by
20

Displaying 1-10 on page 1 of 2. Per Page

Benchmarking Performance of Document Level Classification and Topic Modeling

View

3691

Download

2064

News Text Topic Clustering Optimized Method Based on TF-IDF Algorithm on Spark

View

4587

Download

2262

Cited by

20

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: