Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (5)
  • Open Access

    ARTICLE

    Analyzing COVID-19 Discourse on Twitter: Text Clustering and Classification Models for Public Health Surveillance

    Pakorn Santakij1, Samai Srisuay2,*, Pongporn Punpeng1

    Computer Systems Science and Engineering, Vol.48, No.3, pp. 665-689, 2024, DOI:10.32604/csse.2024.045066 - 20 May 2024

    Abstract Social media has revolutionized the dissemination of real-life information, serving as a robust platform for sharing life events. Twitter, characterized by its brevity and continuous flow of posts, has emerged as a crucial source for public health surveillance, offering valuable insights into public reactions during the COVID-19 pandemic. This study aims to leverage a range of machine learning techniques to extract pivotal themes and facilitate text classification on a dataset of COVID-19 outbreak-related tweets. Diverse topic modeling approaches have been employed to extract pertinent themes and subsequently form a dataset for training text classification models.… More >

  • Open Access

    ARTICLE

    ESG Discourse Analysis Through BERTopic: Comparing News Articles and Academic Papers

    Haein Lee1, Seon Hong Lee1, Kyeo Re Lee2, Jang Hyun Kim3,*

    CMC-Computers, Materials & Continua, Vol.75, No.3, pp. 6023-6037, 2023, DOI:10.32604/cmc.2023.039104 - 29 April 2023

    Abstract Environmental, social, and governance (ESG) factors are critical in achieving sustainability in business management and are used as values aiming to enhance corporate value. Recently, non-financial indicators have been considered as important for the actual valuation of corporations, thus analyzing natural language data related to ESG is essential. Several previous studies limited their focus to specific countries or have not used big data. Past methodologies are insufficient for obtaining potential insights into the best practices to leverage ESG. To address this problem, in this study, the authors used data from two platforms: LexisNexis, a platform… More >

  • Open Access

    ARTICLE

    Ensemble Deep Learning Framework for Situational Aspects-Based Annotation and Classification of International Student’s Tweets during COVID-19

    Shabir Hussain1, Muhammad Ayoub2, Yang Yu1, Junaid Abdul Wahid1, Akmal Khan3, Dietmar P. F. Moller4, Hou Weiyan1,*

    CMC-Computers, Materials & Continua, Vol.75, No.3, pp. 5355-5377, 2023, DOI:10.32604/cmc.2023.036779 - 29 April 2023

    Abstract As the COVID-19 pandemic swept the globe, social media platforms became an essential source of information and communication for many. International students, particularly, turned to Twitter to express their struggles and hardships during this difficult time. To better understand the sentiments and experiences of these international students, we developed the Situational Aspect-Based Annotation and Classification (SABAC) text mining framework. This framework uses a three-layer approach, combining baseline Deep Learning (DL) models with Machine Learning (ML) models as meta-classifiers to accurately predict the sentiments and aspects expressed in tweets from our collected Student-COVID-19 dataset. Using the… More >

  • Open Access

    ARTICLE

    Automated File Labeling for Heterogeneous Files Organization Using Machine Learning

    Sagheer Abbas1, Syed Ali Raza1,2, M. A. Khan3, Muhammad Adnan Khan4,*, Atta-ur-Rahman5, Kiran Sultan6, Amir Mosavi7,8,9

    CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3263-3278, 2023, DOI:10.32604/cmc.2023.032864 - 31 October 2022

    Abstract File labeling techniques have a long history in analyzing the anthological trends in computational linguistics. The situation becomes worse in the case of files downloaded into systems from the Internet. Currently, most users either have to change file names manually or leave a meaningless name of the files, which increases the time to search required files and results in redundancy and duplications of user files. Currently, no significant work is done on automated file labeling during the organization of heterogeneous user files. A few attempts have been made in topic modeling. However, one major drawback More >

  • Open Access

    ARTICLE

    Benchmarking Performance of Document Level Classification and Topic Modeling

    Muhammad Shahid Bhatti1,*, Azmat Ullah1, Rohaya Latip2, Abid Sohail1, Anum Riaz1, Rohail Hassan3

    CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 125-141, 2022, DOI:10.32604/cmc.2022.020083 - 03 November 2021

    Abstract Text classification of low resource language is always a trivial and challenging problem. This paper discusses the process of Urdu news classification and Urdu documents similarity. Urdu is one of the most famous spoken languages in Asia. The implementation of computational methodologies for text classification has increased over time. However, Urdu language has not much experimented with research, it does not have readily available datasets, which turn out to be the primary reason behind limited research and applying the latest methodologies to the Urdu. To overcome these obstacles, a medium-sized dataset having six categories is… More >

Displaying 1-10 on page 1 of 5. Per Page