Home / Journals / CMC / Online First / doi:10.32604/cmc.2024.059403
Special Issues
Table of Content

Open Access

ARTICLE

An Arrhythmia Intelligent Recognition Method Based on a Multimodal Information and Spatio-Temporal Hybrid Neural Network Model

Xinchao Han1,2, Aojun Zhang1,2, Runchuan Li1,2,*, Shengya Shen3, Di Zhang1,2, Bo Jin1,2, Longfei Mao1,2, Linqi Yang1,2, Shuqin Zhang1,2
1 School of Computer Science, Zhongyuan University of Technology, Zhengzhou, 450007, China
2 International Joint Laboratory for AI Interpretability and Reasoning Applications, Zhengzhou, 450007, China
3 School of Foreign Languages, Zhengzhou University of Economics and Business, Zhengzhou, 450099, China
* Corresponding Author: Runchuan Li. Email: email
(This article belongs to the Special Issue: Emerging Trends and Applications of Deep Learning for Biomedical Signal and Image Processing)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2024.059403

Received 07 October 2024; Accepted 26 November 2024; Published online 19 December 2024

Abstract

Electrocardiogram (ECG) analysis is critical for detecting arrhythmias, but traditional methods struggle with large-scale Electrocardiogram data and rare arrhythmia events in imbalanced datasets. These methods fail to perform multi-perspective learning of temporal signals and Electrocardiogram images, nor can they fully extract the latent information within the data, falling short of the accuracy required by clinicians. Therefore, this paper proposes an innovative hybrid multimodal spatiotemporal neural network to address these challenges. The model employs a multimodal data augmentation framework integrating visual and signal-based features to enhance the classification performance of rare arrhythmias in imbalanced datasets. Additionally, the spatiotemporal fusion module incorporates a spatiotemporal graph convolutional network to jointly model temporal and spatial features, uncovering complex dependencies within the Electrocardiogram data and improving the model’s ability to represent complex patterns. In experiments conducted on the MIT-BIH arrhythmia dataset, the model achieved 99.95% accuracy, 99.80% recall, and a 99.78% F1 score. The model was further validated for generalization using the clinical INCART arrhythmia dataset, and the results demonstrated its effectiveness in terms of both generalization and robustness.

Keywords

Multimodal learning; spatio-temporal hybrid; graph convolutional network; data imbalance; ECG classification
  • 37

    View

  • 6

    Download

  • 0

    Like

Share Link