Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode

Yue Zhao; Jianjian Yue; Wei Song; Xiaona Xu; Xiali Li; Licheng Wu; Qiang Ji

doi:10.32604/jiot.2019.05866

Open Access icon Open Access

ARTICLE

Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode

Yue Zhao¹, Jianjian Yue¹, Wei Song^1,*, Xiaona Xu¹, Xiali Li¹, Licheng Wu¹, Qiang Ji²

1 School of Information and Engineering, Minzu University of China , Beijing, 100081, China.
2 Rensselaer Polytechnic Institute, 110 Eighth Street, Troy NY 12180-3590, USA.

*Corresponding Author: Wei Song. Email: email .

Journal on Internet of Things 2019, 1(1), 17-23. https://doi.org/10.32604/jiot.2019.05866

Download PDF

Abstract

We proposed a method using latent regression Bayesian network (LRBN) to extract the shared speech feature for the input of end-to-end speech recognition model. The structure of LRBN is compact and its parameter learning is fast. Compared with Convolutional Neural Network, it has a simpler and understood structure and less parameters to learn. Experimental results show that the advantage of hybrid LRBN/Bidirectional Long Short-Term Memory-Connectionist Temporal Classification architecture for Tibetan multi-dialect speech recognition, and demonstrate the LRBN is helpful to differentiate among multiple language speech sets.

Keywords

Multi-dialect speech recognition, Tibetan language, latent regression bayesian network, end-to-end model

Cite This Article

APA Style

Zhao, Y., Yue, J., Song, W., Xu, X., Li, X. et al. (2019). Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode. Journal on Internet of Things, 1(1), 17–23. https://doi.org/10.32604/jiot.2019.05866

Vancouver Style

Zhao Y, Yue J, Song W, Xu X, Li X, Wu L, et al. Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode. J Internet Things. 2019;1(1):17–23. https://doi.org/10.32604/jiot.2019.05866

IEEE Style

Y. Zhao et al., “Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode,” J. Internet Things, vol. 1, no. 1, pp. 17–23, 2019. https://doi.org/10.32604/jiot.2019.05866

BibTex EndNote RIS

Copyright © 2019 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode

Abstract

Keywords

Cite This Article

3826

1948

0

Further Information

Guidelines

Follow Us

Join Us

Share Link