Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (130)
  • Open Access

    ARTICLE

    A Multimodal Sentiment Analysis Method Based on Multi-Granularity Guided Fusion

    Zilin Zhang1, Yan Liu1,*, Jia Liu2, Senbao Hou3, Yuping Zhang1, Chenyuan Wang1

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-14, 2026, DOI:10.32604/cmc.2025.072286 - 09 December 2025

    Abstract With the growing demand for more comprehensive and nuanced sentiment understanding, Multimodal Sentiment Analysis (MSA) has gained significant traction in recent years and continues to attract widespread attention in the academic community. Despite notable advances, existing approaches still face critical challenges in both information modeling and modality fusion. On one hand, many current methods rely heavily on encoders to extract global features from each modality, which limits their ability to capture latent fine-grained emotional cues within modalities. On the other hand, prevailing fusion strategies often lack mechanisms to model semantic discrepancies across modalities and to… More >

  • Open Access

    ARTICLE

    MultiAgent-CoT: A Multi-Agent Chain-of-Thought Reasoning Model for Robust Multimodal Dialogue Understanding

    Ans D. Alghamdi*

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-35, 2026, DOI:10.32604/cmc.2025.071210 - 09 December 2025

    Abstract Multimodal dialogue systems often fail to maintain coherent reasoning over extended conversations and suffer from hallucination due to limited context modeling capabilities. Current approaches struggle with cross-modal alignment, temporal consistency, and robust handling of noisy or incomplete inputs across multiple modalities. We propose MultiAgent-Chain of Thought (CoT), a novel multi-agent chain-of-thought reasoning framework where specialized agents for text, vision, and speech modalities collaboratively construct shared reasoning traces through inter-agent message passing and consensus voting mechanisms. Our architecture incorporates self-reflection modules, conflict resolution protocols, and dynamic rationale alignment to enhance consistency, factual accuracy, and user engagement. More >

  • Open Access

    ARTICLE

    Bearing Fault Diagnosis Based on Multimodal Fusion GRU and Swin-Transformer

    Yingyong Zou*, Yu Zhang, Long Li, Tao Liu, Xingkui Zhang

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-24, 2026, DOI:10.32604/cmc.2025.068246 - 10 November 2025

    Abstract Fault diagnosis of rolling bearings is crucial for ensuring the stable operation of mechanical equipment and production safety in industrial environments. However, due to the nonlinearity and non-stationarity of collected vibration signals, single-modal methods struggle to capture fault features fully. This paper proposes a rolling bearing fault diagnosis method based on multi-modal information fusion. The method first employs the Hippopotamus Optimization Algorithm (HO) to optimize the number of modes in Variational Mode Decomposition (VMD) to achieve optimal modal decomposition performance. It combines Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU) to extract temporal features… More >

  • Open Access

    ARTICLE

    CAPGen: An MLLM-Based Framework Integrated with Iterative Optimization Mechanism for Cultural Artifacts Poster Generation

    Qianqian Hu, Chuhan Li, Mohan Zhang, Fang Liu*

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-17, 2026, DOI:10.32604/cmc.2025.068225 - 10 November 2025

    Abstract Due to the digital transformation tendency among cultural institutions and the substantial influence of the social media platform, the demands of visual communication keep increasing for promoting traditional cultural artifacts online. As an effective medium, posters serve to attract public attention and facilitate broader engagement with cultural artifacts. However, existing poster generation methods mainly rely on fixed templates and manual design, which limits their scalability and adaptability to the diverse visual and semantic features of the artifacts. Therefore, we propose CAPGen, an automated aesthetic Cultural Artifacts Poster Generation framework built on a Multimodal Large Language More >

  • Open Access

    REVIEW

    Human Behaviour Classification in Emergency Situations Using Machine Learning with Multimodal Data: A Systematic Review (2020–2025)

    Mirza Murad Baig1, Muhammad Rehan Faheem2,*, Lal Khan3,*, Hannan Adeel2, Syed Asim Ali Shah4

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 2895-2935, 2025, DOI:10.32604/cmes.2025.073172 - 23 December 2025

    Abstract With growing urban areas, the climate continues to change as a result of growing populations, and hence, the demand for better emergency response systems has become more important than ever. Human Behaviour Classification (HBC) systems have started to play a vital role by analysing data from different sources to detect signs of emergencies. These systems are being used in many critical areas like healthcare, public safety, and disaster management to improve response time and to prepare ahead of time. But detecting human behaviour in such stressful conditions is not simple; it often comes with noisy… More > Graphic Abstract

    Human Behaviour Classification in Emergency Situations Using Machine Learning with Multimodal Data: A Systematic Review (2020–2025)

  • Open Access

    REVIEW

    A Systematic Review of Multimodal Fusion and Explainable AI Applications in Breast Cancer Diagnosis

    Deema Alzamil1,2,*, Bader Alkhamees2, Mohammad Mehedi Hassan2,3

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 2971-3027, 2025, DOI:10.32604/cmes.2025.070867 - 23 December 2025

    Abstract Breast cancer diagnosis relies heavily on many kinds of information from diverse sources—like mammogram images, ultrasound scans, patient records, and genetic tests—but most AI tools look at only one of these at a time, which limits their ability to produce accurate and comprehensive decisions. In recent years, multimodal learning has emerged, enabling the integration of heterogeneous data to improve performance and diagnostic accuracy. However, doctors cannot always see how or why these AI tools make their choices, which is a significant bottleneck in their reliability, along with adoption in clinical settings. Hence, people are adding… More >

  • Open Access

    ARTICLE

    Outcomes and Toxicity of Adult Medulloblastoma Treated with Pediatric Multimodal Protocols: A Single-Institution Experience

    Antonio Ruggiero1,2,*, Dario Talloa1, Alberto Romano1, Giorgio Attinà1, Stefano Mastrangelo1,2, Palma Maurizi1,2, Tommaso Verdolotti3, Gianpiero Tamburrini4,5, Silvia Chiesa6, Rina di Bonaventura7, Pier Paolo Mattogno7, Alessandro Olivi7,8, Alessio Albanese7,8

    Oncology Research, Vol.33, No.12, pp. 3855-3867, 2025, DOI:10.32604/or.2025.067948 - 27 November 2025

    Abstract Background: Adult medulloblastoma (MB) represents less than 1% of central nervous system malignancies, lacking standardized therapeutic approaches due to its rarity. This retrospective single-center analysis aimed to assess survival outcomes and treatment-associated toxicities in adult MB patients managed with pediatric-derived protocols. Methods: Eighteen patients (≥18 years) with MB treated at Fondazione Policlinico Universitario Agostino Gemelli Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) (January 1997–January 2024) were analyzed. All received craniospinal radiotherapy with posterior fossa boost, followed by adjuvant chemotherapy utilizing pediatric regimens (PNET3, PNET4, PNET5, or high-risk protocols incorporating high-dose chemotherapy with autologous… More >

  • Open Access

    ARTICLE

    A Multimodal Learning Framework to Reduce Misclassification in GI Tract Disease Diagnosis

    Sadia Fatima1, Fadl Dahan2,*, Jamal Hussain Shah1, Refan Almohamedh2, Mohammed Aloqaily2, Samia Riaz1

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.1, pp. 971-994, 2025, DOI:10.32604/cmes.2025.070272 - 30 October 2025

    Abstract The human gastrointestinal (GI) tract is influenced by numerous disorders. If not detected in the early stages, they may result in severe consequences such as organ failure or the development of cancer, and in extreme cases, become life-threatening. Endoscopy is a specialised imaging technique used to examine the GI tract. However, physicians might neglect certain irregular morphologies during the examination due to continuous monitoring of the video recording. Recent advancements in artificial intelligence have led to the development of high-performance AI-based systems, which are optimal for computer-assisted diagnosis. Due to numerous limitations in endoscopic image… More >

  • Open Access

    ARTICLE

    Towards Secure and Efficient Human Fall Detection: Sensor-Visual Fusion via Gramian Angular Field with Federated CNN

    Md Sabir Hossain1, Md Mahfuzur Rahman1,2,*, Mufti Mahmud1,3

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.1, pp. 1087-1116, 2025, DOI:10.32604/cmes.2025.068779 - 30 October 2025

    Abstract This article presents a human fall detection system that addresses two critical challenges: privacy preservation and detection accuracy. We propose a comprehensive framework that integrates state-of-the-art machine learning models, multimodal data fusion, federated learning (FL), and Karush-Kuhn-Tucker (KKT)-based resource optimization. The system fuses data from wearable sensors and cameras using Gramian Angular Field (GAF) encoding to capture rich spatial-temporal features. To protect sensitive data, we adopt a privacy-preserving FL setup, where model training occurs locally on client devices without transferring raw data. A custom convolutional neural network (CNN) is designed to extract robust features from More > Graphic Abstract

    Towards Secure and Efficient Human Fall Detection: Sensor-Visual Fusion via Gramian Angular Field with Federated CNN

  • Open Access

    ARTICLE

    GLAMSNet: A Gated-Linear Aspect-Aware Multimodal Sentiment Network with Alignment Supervision and External Knowledge Guidance

    Dan Wang1, Zhoubin Li1, Yuze Xia1,2,*, Zhenhua Yu1,*

    CMC-Computers, Materials & Continua, Vol.85, No.3, pp. 5823-5845, 2025, DOI:10.32604/cmc.2025.071656 - 23 October 2025

    Abstract Multimodal Aspect-Based Sentiment Analysis (MABSA) aims to detect sentiment polarity toward specific aspects by leveraging both textual and visual inputs. However, existing models suffer from weak aspect-image alignment, modality imbalance dominated by textual signals, and limited reasoning for implicit or ambiguous sentiments requiring external knowledge. To address these issues, we propose a unified framework named Gated-Linear Aspect-Aware Multimodal Sentiment Network (GLAMSNet). First of all, an input encoding module is employed to construct modality-specific and aspect-aware representations. Subsequently, we introduce an image–aspect correlation matching module to provide hierarchical supervision for visual-textual alignment. Building upon these components, More >

Displaying 1-10 on page 1 of 130. Per Page