Open Access iconOpen Access

ARTICLE

crossmark

Improving Low-Resource Machine Translation Using Reinforcement Learning from Human Feedback

by Liqing Wang*, Yiheng Xiao

School of Information Science and Engineering, Yunnan University, Kunming, 650500, China

* Corresponding Author: Liqing Wang. Email: email

Intelligent Automation & Soft Computing 2024, 39(4), 619-631. https://doi.org/10.32604/iasc.2024.052971

Abstract

Neural Machine Translation is one of the key research directions in Natural Language Processing. However, limited by the scale and quality of parallel corpus, the translation quality of low-resource Neural Machine Translation has always been unsatisfactory. When Reinforcement Learning from Human Feedback (RLHF) is applied to low-resource machine translation, commonly encountered issues of substandard preference data quality and the higher cost associated with manual feedback data. Therefore, a more cost-effective method for obtaining feedback data is proposed. At first, optimizing the quality of preference data through the prompt engineering of the Large Language Model (LLM), then combining human feedback to complete the evaluation. In this way, the reward model could acquire more semantic information and human preferences during the training phase, thereby enhancing feedback efficiency and the result’s quality. Experimental results demonstrate that compared with the traditional RLHF method, our method has been proven effective on multiple datasets and exhibits a notable improvement of 1.07 in BLUE. Meanwhile, it is also more favorably received in the assessments conducted by human evaluators and GPT-4o.

Keywords


Cite This Article

APA Style
Wang, L., Xiao, Y. (2024). Improving low-resource machine translation using reinforcement learning from human feedback. Intelligent Automation & Soft Computing, 39(4), 619–631. https://doi.org/10.32604/iasc.2024.052971
Vancouver Style
Wang L, Xiao Y. Improving low-resource machine translation using reinforcement learning from human feedback. Intell Automat Soft Comput. 2024;39(4):619–631. https://doi.org/10.32604/iasc.2024.052971
IEEE Style
L. Wang and Y. Xiao, “Improving Low-Resource Machine Translation Using Reinforcement Learning from Human Feedback,” Intell. Automat. Soft Comput., vol. 39, no. 4, pp. 619–631, 2024. https://doi.org/10.32604/iasc.2024.052971



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 545

    View

  • 239

    Download

  • 0

    Like

Share Link