Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (132)
  • Open Access

    ARTICLE

    A Novel Unified Framework for Automated Generation and Multimodal Validation of UML Diagrams

    Van-Viet Nguyen1, Huu-Khanh Nguyen2, Kim-Son Nguyen1, Thi Minh-Hue Luong1, Duc-Quang Vu1, Trung-Nghia Phung3, The-Vinh Nguyen1,*

    CMES-Computer Modeling in Engineering & Sciences, Vol.146, No.1, 2026, DOI:10.32604/cmes.2025.075442 - 29 January 2026

    Abstract It remains difficult to automate the creation and validation of Unified Modeling Language (UML) diagrams due to unstructured requirements, limited automated pipelines, and the lack of reliable evaluation methods. This study introduces a cohesive architecture that amalgamates requirement development, UML synthesis, and multimodal validation. First, LLaMA-3.2-1B-Instruct was utilized to generate user-focused requirements. Then, DeepSeek-R1-Distill-Qwen-32B applies its reasoning skills to transform these requirements into PlantUML code. Using this dual-LLM pipeline, we constructed a synthetic dataset of 11,997 UML diagrams spanning six major diagram families. Rendering analysis showed that 89.5% of the generated diagrams compile correctly, while… More >

  • Open Access

    ARTICLE

    A Dual-Stream Framework for Landslide Segmentation with Cross-Attention Enhancement and Gated Multimodal Fusion

    Md Minhazul Islam1,2, Yunfei Yin1,2,*, Md Tanvir Islam1,2, Zheng Yuan1,2, Argho Dey1,2

    CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.072550 - 12 January 2026

    Abstract Automatic segmentation of landslides from remote sensing imagery is challenging because traditional machine learning and early CNN-based models often fail to generalize across heterogeneous landscapes, where segmentation maps contain sparse and fragmented landslide regions under diverse geographical conditions. To address these issues, we propose a lightweight dual-stream siamese deep learning framework that integrates optical and topographical data fusion with an adaptive decoder, guided multimodal fusion, and deep supervision. The framework is built upon the synergistic combination of cross-attention, gated fusion, and sub-pixel upsampling within a unified dual-stream architecture specifically optimized for landslide segmentation, enabling efficient… More >

  • Open Access

    ARTICLE

    A Multimodal Sentiment Analysis Method Based on Multi-Granularity Guided Fusion

    Zilin Zhang1, Yan Liu1,*, Jia Liu2, Senbao Hou3, Yuping Zhang1, Chenyuan Wang1

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-14, 2026, DOI:10.32604/cmc.2025.072286 - 09 December 2025

    Abstract With the growing demand for more comprehensive and nuanced sentiment understanding, Multimodal Sentiment Analysis (MSA) has gained significant traction in recent years and continues to attract widespread attention in the academic community. Despite notable advances, existing approaches still face critical challenges in both information modeling and modality fusion. On one hand, many current methods rely heavily on encoders to extract global features from each modality, which limits their ability to capture latent fine-grained emotional cues within modalities. On the other hand, prevailing fusion strategies often lack mechanisms to model semantic discrepancies across modalities and to… More >

  • Open Access

    ARTICLE

    MultiAgent-CoT: A Multi-Agent Chain-of-Thought Reasoning Model for Robust Multimodal Dialogue Understanding

    Ans D. Alghamdi*

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-35, 2026, DOI:10.32604/cmc.2025.071210 - 09 December 2025

    Abstract Multimodal dialogue systems often fail to maintain coherent reasoning over extended conversations and suffer from hallucination due to limited context modeling capabilities. Current approaches struggle with cross-modal alignment, temporal consistency, and robust handling of noisy or incomplete inputs across multiple modalities. We propose MultiAgent-Chain of Thought (CoT), a novel multi-agent chain-of-thought reasoning framework where specialized agents for text, vision, and speech modalities collaboratively construct shared reasoning traces through inter-agent message passing and consensus voting mechanisms. Our architecture incorporates self-reflection modules, conflict resolution protocols, and dynamic rationale alignment to enhance consistency, factual accuracy, and user engagement. More >

  • Open Access

    ARTICLE

    Bearing Fault Diagnosis Based on Multimodal Fusion GRU and Swin-Transformer

    Yingyong Zou*, Yu Zhang, Long Li, Tao Liu, Xingkui Zhang

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-24, 2026, DOI:10.32604/cmc.2025.068246 - 10 November 2025

    Abstract Fault diagnosis of rolling bearings is crucial for ensuring the stable operation of mechanical equipment and production safety in industrial environments. However, due to the nonlinearity and non-stationarity of collected vibration signals, single-modal methods struggle to capture fault features fully. This paper proposes a rolling bearing fault diagnosis method based on multi-modal information fusion. The method first employs the Hippopotamus Optimization Algorithm (HO) to optimize the number of modes in Variational Mode Decomposition (VMD) to achieve optimal modal decomposition performance. It combines Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU) to extract temporal features… More >

  • Open Access

    ARTICLE

    CAPGen: An MLLM-Based Framework Integrated with Iterative Optimization Mechanism for Cultural Artifacts Poster Generation

    Qianqian Hu, Chuhan Li, Mohan Zhang, Fang Liu*

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-17, 2026, DOI:10.32604/cmc.2025.068225 - 10 November 2025

    Abstract Due to the digital transformation tendency among cultural institutions and the substantial influence of the social media platform, the demands of visual communication keep increasing for promoting traditional cultural artifacts online. As an effective medium, posters serve to attract public attention and facilitate broader engagement with cultural artifacts. However, existing poster generation methods mainly rely on fixed templates and manual design, which limits their scalability and adaptability to the diverse visual and semantic features of the artifacts. Therefore, we propose CAPGen, an automated aesthetic Cultural Artifacts Poster Generation framework built on a Multimodal Large Language More >

  • Open Access

    REVIEW

    Human Behaviour Classification in Emergency Situations Using Machine Learning with Multimodal Data: A Systematic Review (2020–2025)

    Mirza Murad Baig1, Muhammad Rehan Faheem2,*, Lal Khan3,*, Hannan Adeel2, Syed Asim Ali Shah4

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 2895-2935, 2025, DOI:10.32604/cmes.2025.073172 - 23 December 2025

    Abstract With growing urban areas, the climate continues to change as a result of growing populations, and hence, the demand for better emergency response systems has become more important than ever. Human Behaviour Classification (HBC) systems have started to play a vital role by analysing data from different sources to detect signs of emergencies. These systems are being used in many critical areas like healthcare, public safety, and disaster management to improve response time and to prepare ahead of time. But detecting human behaviour in such stressful conditions is not simple; it often comes with noisy… More > Graphic Abstract

    Human Behaviour Classification in Emergency Situations Using Machine Learning with Multimodal Data: A Systematic Review (2020–2025)

  • Open Access

    REVIEW

    A Systematic Review of Multimodal Fusion and Explainable AI Applications in Breast Cancer Diagnosis

    Deema Alzamil1,2,*, Bader Alkhamees2, Mohammad Mehedi Hassan2,3

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 2971-3027, 2025, DOI:10.32604/cmes.2025.070867 - 23 December 2025

    Abstract Breast cancer diagnosis relies heavily on many kinds of information from diverse sources—like mammogram images, ultrasound scans, patient records, and genetic tests—but most AI tools look at only one of these at a time, which limits their ability to produce accurate and comprehensive decisions. In recent years, multimodal learning has emerged, enabling the integration of heterogeneous data to improve performance and diagnostic accuracy. However, doctors cannot always see how or why these AI tools make their choices, which is a significant bottleneck in their reliability, along with adoption in clinical settings. Hence, people are adding… More >

  • Open Access

    ARTICLE

    Outcomes and Toxicity of Adult Medulloblastoma Treated with Pediatric Multimodal Protocols: A Single-Institution Experience

    Antonio Ruggiero1,2,*, Dario Talloa1, Alberto Romano1, Giorgio Attinà1, Stefano Mastrangelo1,2, Palma Maurizi1,2, Tommaso Verdolotti3, Gianpiero Tamburrini4,5, Silvia Chiesa6, Rina di Bonaventura7, Pier Paolo Mattogno7, Alessandro Olivi7,8, Alessio Albanese7,8

    Oncology Research, Vol.33, No.12, pp. 3855-3867, 2025, DOI:10.32604/or.2025.067948 - 27 November 2025

    Abstract Background: Adult medulloblastoma (MB) represents less than 1% of central nervous system malignancies, lacking standardized therapeutic approaches due to its rarity. This retrospective single-center analysis aimed to assess survival outcomes and treatment-associated toxicities in adult MB patients managed with pediatric-derived protocols. Methods: Eighteen patients (≥18 years) with MB treated at Fondazione Policlinico Universitario Agostino Gemelli Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) (January 1997–January 2024) were analyzed. All received craniospinal radiotherapy with posterior fossa boost, followed by adjuvant chemotherapy utilizing pediatric regimens (PNET3, PNET4, PNET5, or high-risk protocols incorporating high-dose chemotherapy with autologous… More >

  • Open Access

    ARTICLE

    A Multimodal Learning Framework to Reduce Misclassification in GI Tract Disease Diagnosis

    Sadia Fatima1, Fadl Dahan2,*, Jamal Hussain Shah1, Refan Almohamedh2, Mohammed Aloqaily2, Samia Riaz1

    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.1, pp. 971-994, 2025, DOI:10.32604/cmes.2025.070272 - 30 October 2025

    Abstract The human gastrointestinal (GI) tract is influenced by numerous disorders. If not detected in the early stages, they may result in severe consequences such as organ failure or the development of cancer, and in extreme cases, become life-threatening. Endoscopy is a specialised imaging technique used to examine the GI tract. However, physicians might neglect certain irregular morphologies during the examination due to continuous monitoring of the video recording. Recent advancements in artificial intelligence have led to the development of high-performance AI-based systems, which are optimal for computer-assisted diagnosis. Due to numerous limitations in endoscopic image… More >

Displaying 1-10 on page 1 of 132. Per Page