Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (12)
  • Open Access

    ARTICLE

    Improved Data Stream Clustering Method: Incorporating KD-Tree for Typicality and Eccentricity-Based Approach

    Dayu Xu1,#, Jiaming Lü1,#, Xuyao Zhang2, Hongtao Zhang1,*

    CMC-Computers, Materials & Continua, Vol.78, No.2, pp. 2557-2573, 2024, DOI:10.32604/cmc.2024.045932 - 27 February 2024

    Abstract Data stream clustering is integral to contemporary big data applications. However, addressing the ongoing influx of data streams efficiently and accurately remains a primary challenge in current research. This paper aims to elevate the efficiency and precision of data stream clustering, leveraging the TEDA (Typicality and Eccentricity Data Analysis) algorithm as a foundation, we introduce improvements by integrating a nearest neighbor search algorithm to enhance both the efficiency and accuracy of the algorithm. The original TEDA algorithm, grounded in the concept of “Typicality and Eccentricity Data Analytics”, represents an evolving and recursive method that requires… More >

  • Open Access

    REVIEW

    Subspace Clustering in High-Dimensional Data Streams: A Systematic Literature Review

    Nur Laila Ab Ghani1,2,*, Izzatdin Abdul Aziz1,2, Said Jadid AbdulKadir1,2

    CMC-Computers, Materials & Continua, Vol.75, No.2, pp. 4649-4668, 2023, DOI:10.32604/cmc.2023.035987 - 31 March 2023

    Abstract Clustering high dimensional data is challenging as data dimensionality increases the distance between data points, resulting in sparse regions that degrade clustering performance. Subspace clustering is a common approach for processing high-dimensional data by finding relevant features for each cluster in the data space. Subspace clustering methods extend traditional clustering to account for the constraints imposed by data streams. Data streams are not only high-dimensional, but also unbounded and evolving. This necessitates the development of subspace clustering algorithms that can handle high dimensionality and adapt to the unique characteristics of data streams. Although many articles… More >

  • Open Access

    ARTICLE

    Combined Effect of Concept Drift and Class Imbalance on Model Performance During Stream Classification

    Abdul Sattar Palli1,6,*, Jafreezal Jaafar1,2, Manzoor Ahmed Hashmani1,3, Heitor Murilo Gomes4,5, Aeshah Alsughayyir7, Abdul Rehman Gilal1

    CMC-Computers, Materials & Continua, Vol.75, No.1, pp. 1827-1845, 2023, DOI:10.32604/cmc.2023.033934 - 06 February 2023

    Abstract Every application in a smart city environment like the smart grid, health monitoring, security, and surveillance generates non-stationary data streams. Due to such nature, the statistical properties of data changes over time, leading to class imbalance and concept drift issues. Both these issues cause model performance degradation. Most of the current work has been focused on developing an ensemble strategy by training a new classifier on the latest data to resolve the issue. These techniques suffer while training the new classifier if the data is imbalanced. Also, the class imbalance ratio may change greatly from… More >

  • Open Access

    ARTICLE

    Drift Detection Method Using Distance Measures and Windowing Schemes for Sentiment Classification

    Idris Rabiu1,3,*, Naomie Salim2, Maged Nasser1,4, Aminu Da’u1, Taiseer Abdalla Elfadil Eisa5, Mhassen Elnour Elneel Dalam6

    CMC-Computers, Materials & Continua, Vol.74, No.3, pp. 6001-6017, 2023, DOI:10.32604/cmc.2023.035221 - 28 December 2022

    Abstract Textual data streams have been extensively used in practical applications where consumers of online products have expressed their views regarding online products. Due to changes in data distribution, commonly referred to as concept drift, mining this data stream is a challenging problem for researchers. The majority of the existing drift detection techniques are based on classification errors, which have higher probabilities of false-positive or missed detections. To improve classification accuracy, there is a need to develop more intuitive detection techniques that can identify a great number of drifts in the data streams. This paper presents… More >

  • Open Access

    ARTICLE

    Sentiment Drift Detection and Analysis in Real Time Twitter Data Streams

    E. Susi*, A. P. Shanthi

    Computer Systems Science and Engineering, Vol.45, No.3, pp. 3231-3246, 2023, DOI:10.32604/csse.2023.032104 - 21 December 2022

    Abstract Handling sentiment drifts in real time twitter data streams are a challenging task while performing sentiment classifications, because of the changes that occur in the sentiments of twitter users, with respect to time. The growing volume of tweets with sentiment drifts has led to the need for devising an adaptive approach to detect and handle this drift in real time. This work proposes an adaptive learning algorithm-based framework, Twitter Sentiment Drift Analysis-Bidirectional Encoder Representations from Transformers (TSDA-BERT), which introduces a sentiment drift measure to detect drifts and a domain impact score to adaptively retrain the… More >

  • Open Access

    ARTICLE

    Clustered Single-Board Devices with Docker Container Big Stream Processing Architecture

    N. Penchalaiah1, Abeer S. Al-Humaimeedy2, Mashael Maashi3, J. Chinna Babu4,*, Osamah Ibrahim Khalaf5, Theyazn H. H. Aldhyani6

    CMC-Computers, Materials & Continua, Vol.73, No.3, pp. 5349-5365, 2022, DOI:10.32604/cmc.2022.029639 - 28 July 2022

    Abstract The expanding amounts of information created by Internet of Things (IoT) devices places a strain on cloud computing, which is often used for data analysis and storage. This paper investigates a different approach based on edge cloud applications, which involves data filtering and processing before being delivered to a backup cloud environment. This Paper suggest designing and implementing a low cost, low power cluster of Single Board Computers (SBC) for this purpose, reducing the amount of data that must be transmitted elsewhere, using Big Data ideas and technology. An Apache Hadoop and Spark Cluster that… More >

  • Open Access

    ARTICLE

    Incremental Learning Framework for Mining Big Data Stream

    Alaa Eisa1, Nora EL-Rashidy2, Mohammad Dahman Alshehri3,*, Hazem M. El-bakry1, Samir Abdelrazek1

    CMC-Computers, Materials & Continua, Vol.71, No.2, pp. 2901-2921, 2022, DOI:10.32604/cmc.2022.021342 - 07 December 2021

    Abstract At this current time, data stream classification plays a key role in big data analytics due to its enormous growth. Most of the existing classification methods used ensemble learning, which is trustworthy but these methods are not effective to face the issues of learning from imbalanced big data, it also supposes that all data are pre-classified. Another weakness of current methods is that it takes a long evaluation time when the target data stream contains a high number of features. The main objective of this research is to develop a new method for incremental learning More >

  • Open Access

    ARTICLE

    Impact of Distance Measures on the Performance of AIS Data Clustering

    Marta Mieczyńska1,*, Ireneusz Czarnowski2

    Computer Systems Science and Engineering, Vol.36, No.1, pp. 69-82, 2021, DOI:10.32604/csse.2021.014327 - 23 December 2020

    Abstract Automatic Identification System (AIS) data stream analysis is based on the AIS data of different vessel’s behaviours, including the vessels’ routes. When the AIS data consists of outliers, noises, or are incomplete, then the analysis of the vessel’s behaviours is not possible or is limited. When the data consists of outliers, it is not possible to automatically assign the AIS data to a particular vessel. In this paper, a clustering method is proposed to support the AIS data analysis, to qualify noises and outliers with respect to their suitability, and finally to aid the reconstruction… More >

  • Open Access

    ARTICLE

    FogMed: A Fog-Based Framework for Disease Prognosis Based Medical Sensor Data Streams

    Le Sun1,*, Qiandi Yu1, Dandan Peng1, Sudha Subramani2, Xuyang Wang1

    CMC-Computers, Materials & Continua, Vol.66, No.1, pp. 603-619, 2021, DOI:10.32604/cmc.2020.012515 - 30 October 2020

    Abstract Recently, an increasing number of works start investigating the combination of fog computing and electronic health (ehealth) applications. However, there are still numerous unresolved issues worth to be explored. For instance, there is a lack of investigation on the disease prediction in fog environment and only limited studies show, how the Quality of Service (QoS) levels of fog services and the data stream mining techniques influence each other to improve the disease prediction performance (e.g., accuracy and time efficiency). To address these issues, we propose a fog-based framework for disease prediction based on Medical sensor More >

  • Open Access

    ARTICLE

    A Scalable Method of Maintaining Order Statistics for Big Data Stream

    Zhaohui Zhang*,1,2,3, Jian Chen1, Ligong Chen1, Qiuwen Liu1, Lijun Yang1, Pengwei Wang1,2,3, Yongjun Zheng4

    CMC-Computers, Materials & Continua, Vol.60, No.1, pp. 117-132, 2019, DOI:10.32604/cmc.2019.05325

    Abstract Recently, there are some online quantile algorithms that work on how to analyze the order statistics about the high-volume and high-velocity data stream, but the drawback of these algorithms is not scalable because they take the GK algorithm as the subroutine, which is not known to be mergeable. Another drawback is that they can’t maintain the correctness, which means the error will increase during the process of the window sliding. In this paper, we use a novel data structure to store the sketch that maintains the order statistics over sliding windows. Therefore three algorithms have… More >

Displaying 1-10 on page 1 of 12. Per Page