Tibetan Multi-Dialect Speech and Dialect Identity Recognition

Yue Zhao; Jianjian Yue; Wei Song; Xiaona Xu; Xiali Li; Licheng Wu; Qiang Ji

doi:10.32604/cmc.2019.05636

Open Access icon Open Access

ARTICLE

Tibetan Multi-Dialect Speech and Dialect Identity Recognition

Yue Zhao¹, Jianjian Yue¹, Wei Song^1,*, Xiaona Xu¹, Xiali Li¹, Licheng Wu¹, Qiang Ji²

1 School of Information and Engineering, Minzu University of China, Beijing, 100081, China.
2 Rensselaer Polytechnic Institute, JEC 7004, Troy NY 12180-3590, USA.

* Corresponding Author: Wei Song. Email: email .

Computers, Materials & Continua 2019, 60(3), 1223-1235. https://doi.org/10.32604/cmc.2019.05636

Download PDF

Abstract

Tibetan language has very limited resource for conventional automatic speech recognition so far. It lacks of enough data, sub-word unit, lexicons and word inventories for some dialects. And speech content recognition and dialect classification have been treated as two independent tasks and modeled respectively in most prior works. But the two tasks are highly correlated. In this paper, we present a multi-task WaveNet model to perform simultaneous Tibetan multi-dialect speech recognition and dialect identification. It avoids processing the pronunciation dictionary and word segmentation for new dialects, while, in the meantime, allows training speech recognition and dialect identification in a single model. The experimental results show our method can simultaneously recognize speech content for different Tibetan dialects and identify the dialect with high accuracy using a unified model. The dialect information used in output for training can improve multi-dialect speech recognition accuracy, and the low-resource dialects got higher speech content recognition rate and dialect classification accuracy by multi-dialect and multi-task recognition model than task-specific models.

Keywords

Tibetan multi-dialect speech recognition, dialect identification, multi-task learning, wavenet model.

Cite This Article

APA Style

Zhao, Y., Yue, J., Song, W., Xu, X., Li, X. et al. (2019). Tibetan Multi-Dialect Speech and Dialect Identity Recognition. Computers, Materials & Continua, 60(3), 1223–1235. https://doi.org/10.32604/cmc.2019.05636

Vancouver Style

Zhao Y, Yue J, Song W, Xu X, Li X, Wu L, et al. Tibetan Multi-Dialect Speech and Dialect Identity Recognition. Comput Mater Contin. 2019;60(3):1223–1235. https://doi.org/10.32604/cmc.2019.05636

IEEE Style

Y. Zhao et al., “Tibetan Multi-Dialect Speech and Dialect Identity Recognition,” Comput. Mater. Contin., vol. 60, no. 3, pp. 1223–1235, 2019. https://doi.org/10.32604/cmc.2019.05636

BibTex EndNote RIS

Citations

2

[click to view]

Copyright © 2019 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Tibetan Multi-Dialect Speech and Dialect Identity Recognition

Abstract

Keywords

Cite This Article

Citations

4341

2212

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link