Research on Tibetan Speech Recognition Based on the Am-do Dialect

Kuntharrgyal Khysru; Jianguo Wei; Jianwu Dang

doi:10.32604/cmc.2022.027591

Open Access icon Open Access

ARTICLE

Research on Tibetan Speech Recognition Based on the Am-do Dialect

Kuntharrgyal Khysru^1,*, Jianguo Wei^1,2, Jianwu Dang³

1 Key Laboratory of Artificial Intelligence Application Technology State Ethnic Affairs Commission, Qinghai Minzu University, Xining, 810007, China
2 Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin University, Tianjin, 300072, China
3 Japan Advanced Institute of Science and Technology, Ishikawa, Japan

* Corresponding Author: Kuntharrgyal Khysru. Email: email

Computers, Materials & Continua 2022, 73(3), 4897-4907. https://doi.org/10.32604/cmc.2022.027591

Received 21 January 2022; Accepted 30 March 2022; Issue published 28 July 2022

Abstract

In China, Tibetan is usually divided into three major dialects: the Am-do, Khams and Lhasa dialects. The Am-do dialect evolved from ancient Tibetan and is a local variant of modern Tibetan. Although this dialect has its own specific historical and social conditions and development, there have been different degrees of communication with other ethnic groups, but all the abovementioned dialects developed from the same language: Tibetan. This paper uses the particularity of Tibetan suffixes in pronunciation and proposes a lexicon for the Am-do language, which optimizes the problems existing in previous research. Audio data of the Am-do dialect are expanded by data augmentation technology combining noise and reverberation, and the morphological characteristics and characteristics of the Tibetan language are further considered. According to the particularity of Tibetan grammar, grammatical features are used to optimize grammatical relationships and are combined with a language model, and the Am-do dialect is scored and rescored. Experimental results show that compared with the baseline, our proposed new lexicon and data augmentation technology yields a relative increase of approximately 3% in character error rates (CERs) and a relative increase of 3%–19% in the recognition rate of acoustic models and language models.

Keywords

Am-do dialect; acoustic model; language model; rescoring

Cite This Article

APA Style

Khysru, K., Wei, J., Dang, J. (2022). Research on Tibetan Speech Recognition Based on the Am-do Dialect. Computers, Materials & Continua, 73(3), 4897–4907. https://doi.org/10.32604/cmc.2022.027591

Vancouver Style

Khysru K, Wei J, Dang J. Research on Tibetan Speech Recognition Based on the Am-do Dialect. Comput Mater Contin. 2022;73(3):4897–4907. https://doi.org/10.32604/cmc.2022.027591

IEEE Style

K. Khysru, J. Wei, and J. Dang, “Research on Tibetan Speech Recognition Based on the Am-do Dialect,” Comput. Mater. Contin., vol. 73, no. 3, pp. 4897–4907, 2022. https://doi.org/10.32604/cmc.2022.027591

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Research on Tibetan Speech Recognition Based on the Am-do Dialect

Abstract

Keywords

Cite This Article

1964

1277

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link