Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

Enhancing Anomaly Detection with Causal Reasoning and Semantic Guidance

Weishan Gao^1,2, Ye Wang^1,2, Xiaoyin Wang^1,2, Xiaochuan Jing^1,2,*

CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.073850 - 12 January 2026

Abstract In the field of intelligent surveillance, weakly supervised video anomaly detection (WSVAD) has garnered widespread attention as a key technology that identifies anomalous events using only video-level labels. Although multiple instance learning (MIL) has dominated the WSVAD for a long time, its reliance solely on video-level labels without semantic grounding hinders a fine-grained understanding of visually similar yet semantically distinct events. In addition, insufficient temporal modeling obscures causal relationships between events, making anomaly decisions reactive rather than reasoning-based. To overcome the limitations above, this paper proposes an adaptive knowledge-based guidance method that integrates external structured… More >

Open Access

ARTICLE

Deep Retraining Approach for Category-Specific 3D Reconstruction Models from a Single 2D Image

Nour El Houda Kaiber¹, Tahar Mekhaznia¹, Akram Bennour^1,*, Mohammed Al-Sarem^2,3,*, Zakaria Lakhdara⁴, Fahad Ghaban², Mohammad Nassef^5,6

CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.070337 - 12 January 2026

Abstract The generation of high-quality 3D models from single 2D images remains challenging in terms of accuracy and completeness. Deep learning has emerged as a promising solution, offering new avenues for improvements. However, building models from scratch is computationally expensive and requires large datasets. This paper presents a transfer-learning-based approach for category-specific 3D reconstruction from a single 2D image. The core idea is to fine-tune a pre-trained model on specific object categories using new, unseen data, resulting in specialized versions of the model that are better adapted to reconstruct particular objects. The proposed approach utilizes a… More >

Open Access

ARTICLE

Improving Person Recognition for Single-Person-in-Photos: Intimacy in Photo Collections

Xiaoyi Duan, Tianqi Zou, Chenyang Wang, Yu Gu, Xiuying Li^*

CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-24, 2026, DOI:10.32604/cmc.2025.070683 - 09 December 2025

Abstract Person recognition in photo collections is a critical yet challenging task in computer vision. Previous studies have used social relationships within photo collections to address this issue. However, these methods often fail when performing single-person-in-photos recognition in photo collections, as they cannot rely on social connections for recognition. In this work, we discard social relationships and instead measure the relationships between photos to solve this problem. We designed a new model that includes a multi-parameter attention network for adaptively fusing visual features and a unified formula for measuring photo intimacy. This model effectively recognizes individuals More >

Open Access

ARTICLE

CAFE-GAN: CLIP-Projected GAN with Attention-Aware Generation and Multi-Scale Discrimination

Xuanhong Wang¹, Hongyu Guo¹, Jiazhen Li¹, Mingchen Wang¹, Xian Wang¹, Yijun Zhang^2,*

CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-19, 2026, DOI:10.32604/cmc.2025.069482 - 10 November 2025

Abstract Over the past decade, large-scale pre-trained autoregressive and diffusion models rejuvenated the field of text-guided image generation. However, these models require enormous datasets and parameters, and their multi-step generation processes are often inefficient and difficult to control. To address these challenges, we propose CAFE-GAN, a CLIP-Projected GAN with Attention-Aware Generation and Multi-Scale Discrimination, which incorporates a pre-trained CLIP model along with several key architectural innovations. First, we embed a coordinate attention mechanism into the generator to capture long-range dependencies and enhance feature representation. Second, we introduce a trainable linear projection layer after the CLIP text… More >

Open Access

ARTICLE

Privacy-Preserving Gender-Based Customer Behavior Analytics in Retail Spaces Using Computer Vision

Ginanjar Suwasono Adi¹, Samsul Huda^2,*, Griffani Megiyanto Rahmatullah³, Dodit Suprianto¹, Dinda Qurrota Aini Al-Sefy³, Ivon Sandya Sari Putri⁴, Lalu Tri Wijaya Nata Kusuma⁵

CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-23, 2026, DOI:10.32604/cmc.2025.068619 - 10 November 2025

Abstract In the competitive retail industry of the digital era, data-driven insights into gender-specific customer behavior are essential. They support the optimization of store performance, layout design, product placement, and targeted marketing. However, existing computer vision solutions often rely on facial recognition to gather such insights, raising significant privacy and ethical concerns. To address these issues, this paper presents a privacy-preserving customer analytics system through two key strategies. First, we deploy a deep learning framework using YOLOv9s, trained on the RCA-TVGender dataset. Cameras are positioned perpendicular to observation areas to reduce facial visibility while maintaining accurate More >

Open Access

ARTICLE

An Embedded Computer Vision Approach to Environment Modeling and Local Path Planning in Autonomous Mobile Robots

Rıdvan Yayla, Hakan Üçgün^*, Onur Ali Korkmaz

CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 4055-4087, 2025, DOI:10.32604/cmes.2025.072703 - 23 December 2025

Abstract Recent advancements in autonomous vehicle technologies are transforming intelligent transportation systems. Artificial intelligence enables real-time sensing, decision-making, and control on embedded platforms with improved efficiency. This study presents the design and implementation of an autonomous radio-controlled (RC) vehicle prototype capable of lane line detection, obstacle avoidance, and navigation through dynamic path planning. The system integrates image processing and ultrasonic sensing, utilizing Raspberry Pi for vision-based tasks and Arduino Nano for real-time control. Lane line detection is achieved through conventional image processing techniques, providing the basis for local path generation, while traffic sign classification employs a… More > Graphic Abstract

An Embedded Computer Vision Approach to Environment Modeling and Local Path Planning in Autonomous Mobile Robots

Open Access

ARTICLE

Novel Quantum-Integrated CNN Model for Improved Human Activity Recognition in Smart Surveillance

Tanvir Fatima Naik Bukht^1,2, Yanfeng Wu¹, Nouf Abdullah Almujally³, Shuoa S. AItarbi⁴, Hameedur Rahman², Ahmad Jalal^2,5,*, Hui Liu^1,6,7,*

CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 4013-4036, 2025, DOI:10.32604/cmes.2025.071850 - 23 December 2025

Abstract Human activity recognition (HAR) is crucial in fields like robotics, surveillance, and healthcare, enabling systems to understand and respond to human actions. Current models often struggle with complex datasets, making accurate recognition challenging. This study proposes a quantum-integrated Convolutional Neural Network (QI-CNN) to enhance HAR performance. The traditional models demonstrate weak performance in transferring learned knowledge between diverse complex data collections, including D3D-HOI and Sysu 3D HOI. HAR requires better extraction models and techniques that must address current challenges to achieve improved accuracy and scalability. The model aims to enhance HAR task performance by combining… More >

Open Access

ARTICLE

STPEIC: A Swin Transformer-Based Framework for Interpretable Post-Earthquake Structural Classification

Xinrui Ma, Shizhi Chen^*

Structural Durability & Health Monitoring, Vol.19, No.6, pp. 1745-1767, 2025, DOI:10.32604/sdhm.2025.071148 - 17 November 2025

Abstract The rapid and accurate assessment of structural damage following an earthquake is crucial for effective emergency response and post-disaster recovery. Traditional manual inspection methods are often slow, labor-intensive, and prone to human error. To address these challenges, this study proposes STPEIC (Swin Transformer-based Framework for Interpretable Post-Earthquake Structural Classification), an automated deep learning framework designed for analyzing post-earthquake images. STPEIC performs two key tasks: structural components classification and damage level classification. By leveraging the hierarchical attention mechanisms of the Swin Transformer (Shifted Window Transformer), the model achieves 85.4% accuracy in structural component classification and 85.1% More >

Open Access

ARTICLE

Leveraging Federated Learning for Efficient Privacy-Enhancing Violent Activity Recognition from Videos

Moshiur Rahman Tonmoy¹, Md. Mithun Hossain¹, Mejdl Safran^2,*, Sultan Alfarhood², Dunren Che³, M. F. Mridha⁴

CMC-Computers, Materials & Continua, Vol.85, No.3, pp. 5747-5763, 2025, DOI:10.32604/cmc.2025.067589 - 23 October 2025

Abstract Automated recognition of violent activities from videos is vital for public safety, but often raises significant privacy concerns due to the sensitive nature of the footage. Moreover, resource constraints often hinder the deployment of deep learning-based complex video classification models on edge devices. With this motivation, this study aims to investigate an effective violent activity classifier while minimizing computational complexity, attaining competitive performance, and mitigating user data privacy concerns. We present a lightweight deep learning architecture with fewer parameters for efficient violent activity recognition. We utilize a two-stream formation of 3D depthwise separable convolution coupled More >

Open Access

ARTICLE

Enhanced Fire Detection System for Blind and Visually Challenged People Using Artificial Intelligence with Deep Convolutional Neural Networks

Fahd N. Al-Wesabi^1,*, Hamad Almansour², Huda G. Iskandar^3,4, Ishfaq Yaseen⁵

CMC-Computers, Materials & Continua, Vol.85, No.3, pp. 5765-5787, 2025, DOI:10.32604/cmc.2025.067571 - 23 October 2025

Abstract Earlier notification and fire detection methods provide safety information and fire prevention to blind and visually impaired (BVI) individuals in a limited timeframe in the event of emergencies, particularly in enclosed areas. Fire detection becomes crucial as it directly impacts human safety and the environment. While modern technology requires precise techniques for early detection to prevent damage and loss, few research has focused on artificial intelligence (AI)-based early fire alert systems for BVI individuals in indoor settings. To prevent such fire incidents, it is crucial to identify fires accurately and promptly, and alert BVI personnel… More >

Displaying 1-10 on page 1 of 142. Per Page

View

949

Download

246

View

761

Download

281

View

516

Download

222

View

1035

Download

377

View

1153

Download

388

View

998

Download

433

View

835

Download

265

View

1033

Download

411

View

654

Download

204

View

791

Download

229

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: