Tech Science Press - Publisher of Open Access Journals

News & Announcements

28 June 2024
SDHM Selected for the Jiangsu Science and Technology Journal Excellence Action Plan
20 June 2024
JCR 2024 Released: 13 TSP Journals Achieve Latest Impact Factors
14 June 2024
CiteScore 2023 Metrics Reveal Increased Impact for TSP Journals
31 May 2024
International Journal of Mental Health Promotion Welcomes Prof. Joseph Tak-fai LAU as Editor-in-Chief
07 May 2024
Tech Science Press Partners with Morressier to Provide Editorial Teams with Integrity Intelligence at Scale
23 April 2024
Revue Internationale de Géomatique (RIG) welcomes its new Editor-in-Chief Prof. Manchun Li

Show export options

Articles
Online

Search Results (2)

Open Access

ARTICLE

MVCE-Net: Multi-View Region Feature and Caption Enhancement Co-Attention Network for Visual Question Answering

Feng Yan¹, Wushouer Silamu², Yanbing Li^1,*

CMC-Computers, Materials & Continua, Vol.76, No.1, pp. 65-80, 2023, DOI:10.32604/cmc.2023.038177

Abstract Visual question answering (VQA) requires a deep understanding of images and their corresponding textual questions to answer questions about images more accurately. However, existing models tend to ignore the implicit knowledge in the images and focus only on the visual information in the images, which limits the understanding depth of the image content. The images contain more than just visual objects, some images contain textual information about the scene, and slightly more complex images contain relationships between individual visual objects. Firstly, this paper proposes a model using image description for feature enhancement. This model encodes… More >

View
864

Download
506

Like
0
Open Access

ARTICLE

Fine-Grained Features for Image Captioning

Mengyue Shao¹, Jie Feng^1,*, Jie Wu¹, Haixiang Zhang¹, Yayu Zheng²

CMC-Computers, Materials & Continua, Vol.75, No.3, pp. 4697-4712, 2023, DOI:10.32604/cmc.2023.036564

Abstract Image captioning involves two different major modalities (image and sentence) that convert a given image into a language that adheres to visual semantics. Almost all methods first extract image features to reduce the difficulty of visual semantic embedding and then use the caption model to generate fluent sentences. The Convolutional Neural Network (CNN) is often used to extract image features in image captioning, and the use of object detection networks to extract region features has achieved great success. However, the region features retrieved by this method are object-level and do not pay attention to fine-grained… More >

View
1048

Download
484

Like
0

Displaying 1-10 on page 1 of 2. Per Page

MVCE-Net: Multi-View Region Feature and Caption Enhancement Co-Attention Network for Visual Question Answering

View

864

Download

506

Like

0

Fine-Grained Features for Image Captioning

View

1048

Download

484

Like

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: