Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

A Novel Unified Framework for Automated Generation and Multimodal Validation of UML Diagrams

Van-Viet Nguyen¹, Huu-Khanh Nguyen², Kim-Son Nguyen¹, Thi Minh-Hue Luong¹, Duc-Quang Vu¹, Trung-Nghia Phung³, The-Vinh Nguyen^1,*

CMES-Computer Modeling in Engineering & Sciences, Vol.146, No.1, 2026, DOI:10.32604/cmes.2025.075442 - 29 January 2026

Abstract It remains difficult to automate the creation and validation of Unified Modeling Language (UML) diagrams due to unstructured requirements, limited automated pipelines, and the lack of reliable evaluation methods. This study introduces a cohesive architecture that amalgamates requirement development, UML synthesis, and multimodal validation. First, LLaMA-3.2-1B-Instruct was utilized to generate user-focused requirements. Then, DeepSeek-R1-Distill-Qwen-32B applies its reasoning skills to transform these requirements into PlantUML code. Using this dual-LLM pipeline, we constructed a synthetic dataset of 11,997 UML diagrams spanning six major diagram families. Rendering analysis showed that 89.5% of the generated diagrams compile correctly, while… More >

Open Access

ARTICLE

A Dual-Stream Framework for Landslide Segmentation with Cross-Attention Enhancement and Gated Multimodal Fusion

Md Minhazul Islam^1,2, Yunfei Yin^1,2,*, Md Tanvir Islam^1,2, Zheng Yuan^1,2, Argho Dey^1,2

CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.072550 - 12 January 2026

Abstract Automatic segmentation of landslides from remote sensing imagery is challenging because traditional machine learning and early CNN-based models often fail to generalize across heterogeneous landscapes, where segmentation maps contain sparse and fragmented landslide regions under diverse geographical conditions. To address these issues, we propose a lightweight dual-stream siamese deep learning framework that integrates optical and topographical data fusion with an adaptive decoder, guided multimodal fusion, and deep supervision. The framework is built upon the synergistic combination of cross-attention, gated fusion, and sub-pixel upsampling within a unified dual-stream architecture specifically optimized for landslide segmentation, enabling efficient… More >

Open Access

ARTICLE

A Multimodal Sentiment Analysis Method Based on Multi-Granularity Guided Fusion

Zilin Zhang¹, Yan Liu^1,*, Jia Liu², Senbao Hou³, Yuping Zhang¹, Chenyuan Wang¹

CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-14, 2026, DOI:10.32604/cmc.2025.072286 - 09 December 2025

Abstract With the growing demand for more comprehensive and nuanced sentiment understanding, Multimodal Sentiment Analysis (MSA) has gained significant traction in recent years and continues to attract widespread attention in the academic community. Despite notable advances, existing approaches still face critical challenges in both information modeling and modality fusion. On one hand, many current methods rely heavily on encoders to extract global features from each modality, which limits their ability to capture latent fine-grained emotional cues within modalities. On the other hand, prevailing fusion strategies often lack mechanisms to model semantic discrepancies across modalities and to… More >

Open Access

ARTICLE

MultiAgent-CoT: A Multi-Agent Chain-of-Thought Reasoning Model for Robust Multimodal Dialogue Understanding

Ans D. Alghamdi^*

CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-35, 2026, DOI:10.32604/cmc.2025.071210 - 09 December 2025

Abstract Multimodal dialogue systems often fail to maintain coherent reasoning over extended conversations and suffer from hallucination due to limited context modeling capabilities. Current approaches struggle with cross-modal alignment, temporal consistency, and robust handling of noisy or incomplete inputs across multiple modalities. We propose MultiAgent-Chain of Thought (CoT), a novel multi-agent chain-of-thought reasoning framework where specialized agents for text, vision, and speech modalities collaboratively construct shared reasoning traces through inter-agent message passing and consensus voting mechanisms. Our architecture incorporates self-reflection modules, conflict resolution protocols, and dynamic rationale alignment to enhance consistency, factual accuracy, and user engagement. More >

Open Access

ARTICLE

Bearing Fault Diagnosis Based on Multimodal Fusion GRU and Swin-Transformer

Yingyong Zou^*, Yu Zhang, Long Li, Tao Liu, Xingkui Zhang

CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-24, 2026, DOI:10.32604/cmc.2025.068246 - 10 November 2025

Abstract Fault diagnosis of rolling bearings is crucial for ensuring the stable operation of mechanical equipment and production safety in industrial environments. However, due to the nonlinearity and non-stationarity of collected vibration signals, single-modal methods struggle to capture fault features fully. This paper proposes a rolling bearing fault diagnosis method based on multi-modal information fusion. The method first employs the Hippopotamus Optimization Algorithm (HO) to optimize the number of modes in Variational Mode Decomposition (VMD) to achieve optimal modal decomposition performance. It combines Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU) to extract temporal features… More >

Open Access

ARTICLE

CAPGen: An MLLM-Based Framework Integrated with Iterative Optimization Mechanism for Cultural Artifacts Poster Generation

Qianqian Hu, Chuhan Li, Mohan Zhang, Fang Liu^*

CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-17, 2026, DOI:10.32604/cmc.2025.068225 - 10 November 2025

Abstract Due to the digital transformation tendency among cultural institutions and the substantial influence of the social media platform, the demands of visual communication keep increasing for promoting traditional cultural artifacts online. As an effective medium, posters serve to attract public attention and facilitate broader engagement with cultural artifacts. However, existing poster generation methods mainly rely on fixed templates and manual design, which limits their scalability and adaptability to the diverse visual and semantic features of the artifacts. Therefore, we propose CAPGen, an automated aesthetic Cultural Artifacts Poster Generation framework built on a Multimodal Large Language More >

Human Behaviour Classification in Emergency Situations Using Machine Learning with Multimodal Data: A Systematic Review (2020–2025)

Mirza Murad Baig¹, Muhammad Rehan Faheem^2,*, Lal Khan^3,*, Hannan Adeel², Syed Asim Ali Shah⁴

CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 2895-2935, 2025, DOI:10.32604/cmes.2025.073172 - 23 December 2025

Abstract With growing urban areas, the climate continues to change as a result of growing populations, and hence, the demand for better emergency response systems has become more important than ever. Human Behaviour Classification (HBC) systems have started to play a vital role by analysing data from different sources to detect signs of emergencies. These systems are being used in many critical areas like healthcare, public safety, and disaster management to improve response time and to prepare ahead of time. But detecting human behaviour in such stressful conditions is not simple; it often comes with noisy… More > Graphic Abstract

Human Behaviour Classification in Emergency Situations Using Machine Learning with Multimodal Data: A Systematic Review (2020–2025)

A Systematic Review of Multimodal Fusion and Explainable AI Applications in Breast Cancer Diagnosis

Deema Alzamil^1,2,*, Bader Alkhamees², Mohammad Mehedi Hassan^2,3

CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 2971-3027, 2025, DOI:10.32604/cmes.2025.070867 - 23 December 2025

Abstract Breast cancer diagnosis relies heavily on many kinds of information from diverse sources—like mammogram images, ultrasound scans, patient records, and genetic tests—but most AI tools look at only one of these at a time, which limits their ability to produce accurate and comprehensive decisions. In recent years, multimodal learning has emerged, enabling the integration of heterogeneous data to improve performance and diagnostic accuracy. However, doctors cannot always see how or why these AI tools make their choices, which is a significant bottleneck in their reliability, along with adoption in clinical settings. Hence, people are adding… More >

Open Access

ARTICLE

Outcomes and Toxicity of Adult Medulloblastoma Treated with Pediatric Multimodal Protocols: A Single-Institution Experience

Antonio Ruggiero^1,2,*, Dario Talloa¹, Alberto Romano¹, Giorgio Attinà¹, Stefano Mastrangelo^1,2, Palma Maurizi^1,2, Tommaso Verdolotti³, Gianpiero Tamburrini^4,5, Silvia Chiesa⁶, Rina di Bonaventura⁷, Pier Paolo Mattogno⁷, Alessandro Olivi^7,8, Alessio Albanese^7,8

Oncology Research, Vol.33, No.12, pp. 3855-3867, 2025, DOI:10.32604/or.2025.067948 - 27 November 2025

Abstract Background: Adult medulloblastoma (MB) represents less than 1% of central nervous system malignancies, lacking standardized therapeutic approaches due to its rarity. This retrospective single-center analysis aimed to assess survival outcomes and treatment-associated toxicities in adult MB patients managed with pediatric-derived protocols. Methods: Eighteen patients (≥18 years) with MB treated at Fondazione Policlinico Universitario Agostino Gemelli Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) (January 1997–January 2024) were analyzed. All received craniospinal radiotherapy with posterior fossa boost, followed by adjuvant chemotherapy utilizing pediatric regimens (PNET3, PNET4, PNET5, or high-risk protocols incorporating high-dose chemotherapy with autologous… More >

Open Access

ARTICLE

A Multimodal Learning Framework to Reduce Misclassification in GI Tract Disease Diagnosis

Sadia Fatima¹, Fadl Dahan^2,*, Jamal Hussain Shah¹, Refan Almohamedh², Mohammed Aloqaily², Samia Riaz¹

CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.1, pp. 971-994, 2025, DOI:10.32604/cmes.2025.070272 - 30 October 2025

Abstract The human gastrointestinal (GI) tract is influenced by numerous disorders. If not detected in the early stages, they may result in severe consequences such as organ failure or the development of cancer, and in extreme cases, become life-threatening. Endoscopy is a specialised imaging technique used to examine the GI tract. However, physicians might neglect certain irregular morphologies during the examination due to continuous monitoring of the video recording. Recent advancements in artificial intelligence have led to the development of high-performance AI-based systems, which are optimal for computer-assisted diagnosis. Due to numerous limitations in endoscopic image… More >

Displaying 1-10 on page 1 of 132. Per Page

View

612

Download

174

View

927

Download

325

View

826

Download

337

View

713

Download

322

View

2005

Download

571

View

768

Download

241

View

1474

Download

671

View

765

Download

328

View

851

Download

284

View

826

Download

334

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: