Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (2)
  • Open Access

    ARTICLE

    Three-Dimensional Model Classification Based on VIT-GE and Voting Mechanism

    Fang Yuan, Xueyao Gao*, Chunxiang Zhang

    CMC-Computers, Materials & Continua, Vol.85, No.3, pp. 5037-5055, 2025, DOI:10.32604/cmc.2025.067760 - 23 October 2025

    Abstract 3D model classification has emerged as a significant research focus in computer vision. However, traditional convolutional neural networks (CNNs) often struggle to capture global dependencies across both height and width dimensions simultaneously, leading to limited feature representation capabilities when handling complex visual tasks. To address this challenge, we propose a novel 3D model classification network named ViT-GE (Vision Transformer with Global and Efficient Attention), which integrates Global Grouped Coordinate Attention (GGCA) and Efficient Channel Attention (ECA) mechanisms. Specifically, the Vision Transformer (ViT) is employed to extract comprehensive global features from multi-view inputs using its self-attention More >

  • Open Access

    ARTICLE

    A Real-Time Semantic Segmentation Method Based on Transformer for Autonomous Driving

    Weiyu Hao1, Jingyi Wang2, Huimin Lu3,*

    CMC-Computers, Materials & Continua, Vol.81, No.3, pp. 4419-4433, 2024, DOI:10.32604/cmc.2024.055478 - 19 December 2024

    Abstract While traditional Convolutional Neural Network (CNN)-based semantic segmentation methods have proven effective, they often encounter significant computational challenges due to the requirement for dense pixel-level predictions, which complicates real-time implementation. To address this, we introduce an advanced real-time semantic segmentation strategy specifically designed for autonomous driving, utilizing the capabilities of Visual Transformers. By leveraging the self-attention mechanism inherent in Visual Transformers, our method enhances global contextual awareness, refining the representation of each pixel in relation to the overall scene. This enhancement is critical for quickly and accurately interpreting the complex elements within driving scenarios—a fundamental… More >

Displaying 1-10 on page 1 of 2. Per Page