Amir Yasseen Mahdi1,2,*, Siti Sophiayati Yuhaniz1
CMC-Computers, Materials & Continua, Vol.74, No.1, pp. 1375-1392, 2023, DOI:10.32604/cmc.2023.032671
- 22 September 2022
Abstract The extraction of features from unstructured clinical data of Covid-19 patients is critical for guiding clinical decision-making and diagnosing this viral disease. Furthermore, an early and accurate diagnosis of COVID-19 can reduce the burden on healthcare systems. In this paper, an improved Term Weighting technique combined with Parts-Of-Speech (POS) Tagging is proposed to reduce dimensions for automatic and effective classification of clinical text related to Covid-19 disease. Term Frequency-Inverse Document Frequency (TF-IDF) is the most often used term weighting scheme (TWS). However, TF-IDF has several developments to improve its drawbacks, in particular, it is not… More >