Tech Science Press - Publisher of Open Access Journals

News & Announcements

10 October 2024
Tsinghua University Press and Tech Science Press Collaborate to Bring 11 Journals to SciOpen Platform
20 September 2024
Tech Science Press Journals Now Available on PubScholar Platform
14 August 2024
30th ICCES2024 Concludes Successfully in Singapore
13 August 2024
ResearchGate and Tech Science Press announce new Journal Home partnership
03 July 2024
Congenital Heart Disease Journal Strengthens Collaboration at the 4th AAPCHS Annual Meeting, Seoul, South Korea
28 June 2024
SDHM Selected for the Jiangsu Science and Technology Journal Excellence Action Plan

Show export options

Articles
Online

Search Results (3)

Open Access

ARTICLE

Improving VQA via Dual-Level Feature Embedding Network

Yaru Song^*, Huahu Xu, Dikai Fang

Intelligent Automation & Soft Computing, Vol.39, No.3, pp. 397-416, 2024, DOI:10.32604/iasc.2023.040521 - 11 July 2024

Abstract Visual Question Answering (VQA) has sparked widespread interest as a crucial task in integrating vision and language. VQA primarily uses attention mechanisms to effectively answer questions to associate relevant visual regions with input questions. The detection-based features extracted by the object detection network aim to acquire the visual attention distribution on a predetermined detection frame and provide object-level insights to answer questions about foreground objects more effectively. However, it cannot answer the question about the background forms without detection boxes due to the lack of fine-grained details, which is the advantage of grid-based features. In… More >

View
655

Download
358
Open Access

ARTICLE

MVCE-Net: Multi-View Region Feature and Caption Enhancement Co-Attention Network for Visual Question Answering

Feng Yan¹, Wushouer Silamu², Yanbing Li^1,*

CMC-Computers, Materials & Continua, Vol.76, No.1, pp. 65-80, 2023, DOI:10.32604/cmc.2023.038177 - 08 June 2023

Abstract Visual question answering (VQA) requires a deep understanding of images and their corresponding textual questions to answer questions about images more accurately. However, existing models tend to ignore the implicit knowledge in the images and focus only on the visual information in the images, which limits the understanding depth of the image content. The images contain more than just visual objects, some images contain textual information about the scene, and slightly more complex images contain relationships between individual visual objects. Firstly, this paper proposes a model using image description for feature enhancement. This model encodes… More >

View
990

Download
641
Open Access

ARTICLE

Improved Blending Attention Mechanism in Visual Question Answering

Siyu Lu¹, Yueming Ding¹, Zhengtong Yin², Mingzhe Liu^3,*, Xuan Liu⁴, Wenfeng Zheng^1,*, Lirong Yin⁵

Computer Systems Science and Engineering, Vol.47, No.1, pp. 1149-1161, 2023, DOI:10.32604/csse.2023.038598 - 26 May 2023

Abstract Visual question answering (VQA) has attracted more and more attention in computer vision and natural language processing. Scholars are committed to studying how to better integrate image features and text features to achieve better results in VQA tasks. Analysis of all features may cause information redundancy and heavy computational burden. Attention mechanism is a wise way to solve this problem. However, using single attention mechanism may cause incomplete concern of features. This paper improves the attention mechanism method and proposes a hybrid attention mechanism that combines the spatial attention mechanism method and the channel attention More >

View
669

Download
359

Displaying 1-10 on page 1 of 3. Per Page

Improving VQA via Dual-Level Feature Embedding Network

View

655

Download

358

MVCE-Net: Multi-View Region Feature and Caption Enhancement Co-Attention Network for Visual Question Answering

View

990

Download

641

Improved Blending Attention Mechanism in Visual Question Answering

View

669

Download

359

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: