Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (8)
  • Open Access


    Improved Blending Attention Mechanism in Visual Question Answering

    Siyu Lu1, Yueming Ding1, Zhengtong Yin2, Mingzhe Liu3,*, Xuan Liu4, Wenfeng Zheng1,*, Lirong Yin5

    Computer Systems Science and Engineering, Vol.47, No.1, pp. 1149-1161, 2023, DOI:10.32604/csse.2023.038598

    Abstract Visual question answering (VQA) has attracted more and more attention in computer vision and natural language processing. Scholars are committed to studying how to better integrate image features and text features to achieve better results in VQA tasks. Analysis of all features may cause information redundancy and heavy computational burden. Attention mechanism is a wise way to solve this problem. However, using single attention mechanism may cause incomplete concern of features. This paper improves the attention mechanism method and proposes a hybrid attention mechanism that combines the spatial attention mechanism method and the channel attention mechanism method. In the case… More >

  • Open Access


    Crack Segmentation Based on Fusing Multi-Scale Wavelet and Spatial-Channel Attention

    Peng Geng*, Ji Lu, Hongtao Ma, Guiyi Yang

    Structural Durability & Health Monitoring, Vol.17, No.1, pp. 1-22, 2023, DOI:10.32604/sdhm.2023.018632

    Abstract Accurate and reliable crack segmentation is a challenge and meaningful task. In this article, aiming at the characteristics of cracks on the concrete images, the intensity frequency information of source images which is obtained by Discrete Wavelet Transform (DWT) is fed into deep learning-based networks to enhance the ability of network on crack segmentation. To well integrate frequency information into network an effective and novel DWTA module based on the DWT and scSE attention mechanism is proposed. The semantic information of cracks is enhanced and the irrelevant information is suppressed by DWTA module. And the gap between frequency information and… More >

  • Open Access


    Facial Expression Recognition Based on Multi-Channel Attention Residual Network

    Tongping Shen1,2,*, Huanqing Xu1

    CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 539-560, 2023, DOI:10.32604/cmes.2022.022312

    Abstract For the problems of complex model structure and too many training parameters in facial expression recognition algorithms, we proposed a residual network structure with a multi-headed channel attention (MCA) module. The migration learning algorithm is used to pre-train the convolutional layer parameters and mitigate the overfitting caused by the insufficient number of training samples. The designed MCA module is integrated into the ResNet18 backbone network. The attention mechanism highlights important information and suppresses irrelevant information by assigning different coefficients or weights, and the multi-head structure focuses more on the local features of the pictures, which improves the efficiency of facial… More >

  • Open Access


    Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet

    Helong Yu, Xianhe Cheng, Ziqing Li, Qi Cai, Chunguang Bi*

    CMES-Computer Modeling in Engineering & Sciences, Vol.132, No.3, pp. 711-738, 2022, DOI:10.32604/cmes.2022.020263

    Abstract To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks, a lightweight ResNet (LW-ResNet) model for apple disease recognition is proposed. Based on the deep residual network (ResNet18), the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features. By improving the identity mapping structure to reduce information loss. By introducing the efficient channel attention module (ECANet) to suppress noise from a complex background. The experimental results show that the average… More >

  • Open Access


    Image Inpainting Detection Based on High-Pass Filter Attention Network

    Can Xiao1,2, Feng Li1,2,*, Dengyong Zhang1,2, Pu Huang1,2, Xiangling Ding3, Victor S. Sheng4

    Computer Systems Science and Engineering, Vol.43, No.3, pp. 1145-1154, 2022, DOI:10.32604/csse.2022.027249

    Abstract Image inpainting based on deep learning has been greatly improved. The original purpose of image inpainting was to repair some broken photos, such as inpainting artifacts. However, it may also be used for malicious operations, such as destroying evidence. Therefore, detection and localization of image inpainting operations are essential. Recent research shows that high-pass filtering full convolutional network (HPFCN) is applied to image inpainting detection and achieves good results. However, those methods did not consider the spatial location and channel information of the feature map. To solve these shortcomings, we introduce the squeezed excitation blocks (SE) and propose a high-pass… More >

  • Open Access


    Mixed Attention Densely Residual Network for Single Image Super-Resolution

    Jingjun Zhou1,2, Jing Liu3, Jingbing Li1,2,*, Mengxing Huang1,2, Jieren Cheng4, Yen-Wei Chen5, Yingying Xu3,6, Saqib Ali Nawaz1

    Computer Systems Science and Engineering, Vol.39, No.1, pp. 133-146, 2021, DOI:10.32604/csse.2021.016633

    Abstract Recent applications of convolutional neural networks (CNNs) in single image super-resolution (SISR) have achieved unprecedented performance. However, existing CNN-based SISR network structure design consider mostly only channel or spatial information, and cannot make full use of both channel and spatial information to improve SISR performance further. The present work addresses this problem by proposing a mixed attention densely residual network architecture that can make full and simultaneous use of both channel and spatial information. Specifically, we propose a residual in dense network structure composed of dense connections between multiple dense residual groups to form a very deep network. This structure… More >

  • Open Access


    PGCA-Net: Progressively Aggregating Hierarchical Features with the Pyramid Guided Channel Attention for Saliency Detection

    Jiajie Mai1, Xuemiao Xu2,*, Guorong Xiao3, Zijun Deng2, Jiaxing Chen2

    Intelligent Automation & Soft Computing, Vol.26, No.4, pp. 847-855, 2020, DOI:10.32604/iasc.2020.010119

    Abstract The Salient object detection aims to segment out the most visually distinctive objects in an image, which is a challenging task in computer vision. In this paper, we present the PGCA-Net equipped with the pyramid guided channel attention fusion block (PGCAFB) for the saliency detection task. Given an input image, the hierarchical features are extracted using a deep convolutional neural network (DCNN), then starting from the highest-level semantic features, we stage-by-stage restore the spatial saliency details by aggregating the lowerlevel detailed features. Since for the weak discriminative ability of the shallow detailed features, directly introducing them to the semantic features… More >

  • Open Access


    A Multi-View Gait Recognition Method Using Deep Convolutional Neural Network and Channel Attention Mechanism

    Jiabin Wang*, Kai Peng

    CMES-Computer Modeling in Engineering & Sciences, Vol.125, No.1, pp. 345-363, 2020, DOI:10.32604/cmes.2020.011046

    Abstract In many existing multi-view gait recognition methods based on images or video sequences, gait sequences are usually used to superimpose and synthesize images and construct energy-like template. However, information may be lost during the process of compositing image and capture EMG signals. Errors and the recognition accuracy may be introduced and affected respectively by some factors such as period detection. To better solve the problems, a multi-view gait recognition method using deep convolutional neural network and channel attention mechanism is proposed. Firstly, the sliding time window method is used to capture EMG signals. Then, the back-propagation learning algorithm is used… More >

Displaying 1-10 on page 1 of 8. Per Page  

Share Link