Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition

Zhijie Wang; Yue Zhao; Licheng Wu; Xiaojun Bi; Zhuoma Dawa; Qiang Ji

doi:10.32604/cmc.2022.027092

Open Access icon Open Access

ARTICLE

Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition

Zhijie Wang¹, Yue Zhao^1,*, Licheng Wu¹, Xiaojun Bi¹, Zhuoma Dawa², Qiang Ji³

1 School of Information Engineering, Minzu University of China, Beijing, 100081, China
2 School of Chinese Ethnic Minority Languages and Literatures, Minzu University of China, Beijing, 100081, China
3 Department of Electrical, Computer, and Systems Engineering, Rensselaer Polytechnic Institute, Troy, NY 12180-3590, USA

* Corresponding Author: Yue Zhao. Email: email

Computers, Materials & Continua 2022, 73(1), 629-639. https://doi.org/10.32604/cmc.2022.027092

Received 10 January 2022; Accepted 30 March 2022; Issue published 18 May 2022

Abstract

As one of Chinese minority languages, Tibetan speech recognition technology was not researched upon as extensively as Chinese and English were until recently. This, along with the relatively small Tibetan corpus, has resulted in an unsatisfying performance of Tibetan speech recognition based on an end-to-end model. This paper aims to achieve an accurate Tibetan speech recognition using a small amount of Tibetan training data. We demonstrate effective methods of Tibetan end-to-end speech recognition via cross-language transfer learning from three aspects: modeling unit selection, transfer learning method, and source language selection. Experimental results show that the Chinese-Tibetan multi-language learning method using multi-language character set as the modeling unit yields the best performance on Tibetan Character Error Rate (CER) at 27.3%, which is reduced by 26.1% compared to the language-specific model. And our method also achieves the 2.2% higher accuracy using less amount of data compared with the method using Tibetan multi-dialect transfer learning under the same model structure and data set.

Keywords

Cross-language transfer learning; low-resource language; modeling unit; Tibetan speech recognition

Cite This Article

APA Style

Wang, Z., Zhao, Y., Wu, L., Bi, X., Dawa, Z. et al. (2022). Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition. Computers, Materials & Continua, 73(1), 629–639. https://doi.org/10.32604/cmc.2022.027092

Vancouver Style

Wang Z, Zhao Y, Wu L, Bi X, Dawa Z, Ji Q. Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition. Comput Mater Contin. 2022;73(1):629–639. https://doi.org/10.32604/cmc.2022.027092

IEEE Style

Z. Wang, Y. Zhao, L. Wu, X. Bi, Z. Dawa, and Q. Ji, “Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition,” Comput. Mater. Contin., vol. 73, no. 1, pp. 629–639, 2022. https://doi.org/10.32604/cmc.2022.027092

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition

Abstract

Keywords

Cite This Article

2169

1713

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link