Tech Science Press - Publisher of Open Access Journals

News & Announcements

10 October 2024
Tsinghua University Press and Tech Science Press Collaborate to Bring 11 Journals to SciOpen Platform
20 September 2024
Tech Science Press Journals Now Available on PubScholar Platform
14 August 2024
30th ICCES2024 Concludes Successfully in Singapore
13 August 2024
ResearchGate and Tech Science Press announce new Journal Home partnership
03 July 2024
Congenital Heart Disease Journal Strengthens Collaboration at the 4th AAPCHS Annual Meeting, Seoul, South Korea
28 June 2024
SDHM Selected for the Jiangsu Science and Technology Journal Excellence Action Plan

Show export options

Articles
Online

Search Results (2)

Open Access

ARTICLE

Enhancing Cross-Lingual Image Description: A Multimodal Approach for Semantic Relevance and Stylistic Alignment

Emran Al-Buraihy, Dan Wang^*

CMC-Computers, Materials & Continua, Vol.79, No.3, pp. 3913-3938, 2024, DOI:10.32604/cmc.2024.048104 - 20 June 2024

Abstract Cross-lingual image description, the task of generating image captions in a target language from images and descriptions in a source language, is addressed in this study through a novel approach that combines neural network models and semantic matching techniques. Experiments conducted on the Flickr8k and AraImg2k benchmark datasets, featuring images and descriptions in English and Arabic, showcase remarkable performance improvements over state-of-the-art methods. Our model, equipped with the Image & Cross-Language Semantic Matching module and the Target Language Domain Evaluation module, significantly enhances the semantic relevance of generated image descriptions. For English-to-Arabic and Arabic-to-English cross-language… More >

View
420

Download
217
Open Access

ARTICLE

Enhancing Image Description Generation through Deep Reinforcement Learning: Fusing Multiple Visual Features and Reward Mechanisms

Yan Li, Qiyuan Wang^*, Kaidi Jia

CMC-Computers, Materials & Continua, Vol.78, No.2, pp. 2469-2489, 2024, DOI:10.32604/cmc.2024.047822 - 27 February 2024

Abstract Image description task is the intersection of computer vision and natural language processing, and it has important prospects, including helping computers understand images and obtaining information for the visually impaired. This study presents an innovative approach employing deep reinforcement learning to enhance the accuracy of natural language descriptions of images. Our method focuses on refining the reward function in deep reinforcement learning, facilitating the generation of precise descriptions by aligning visual and textual features more closely. Our approach comprises three key architectures. Firstly, it utilizes Residual Network 101 (ResNet-101) and Faster Region-based Convolutional Neural Network… More >

View
640

Download
276

Like
1

Displaying 1-10 on page 1 of 2. Per Page

Enhancing Cross-Lingual Image Description: A Multimodal Approach for Semantic Relevance and Stylistic Alignment

View

420

Download

217

Enhancing Image Description Generation through Deep Reinforcement Learning: Fusing Multiple Visual Features and Reward Mechanisms

View

640

Download

276

Like

1

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: