Hao Wu1,*, Arun Kumar Sangaiah2
Intelligent Automation & Soft Computing, Vol.28, No.1, pp. 121-132, 2021, DOI:10.32604/iasc.2021.016457
- 17 March 2021
Abstract In oral English teaching in China, teachers usually improve students’ pronunciation by their subjective judgment. Even to the same student, the teacher gives different suggestions at different times. Students’ oral pronunciation features can be obtained from the reconstructed acoustic and natural language features of speech audio, but the task is complicated due to the embedding of multimodal sentences. To solve this problem, this paper proposes an English speech recognition based on enhanced temporal convolution network. Firstly, a suitable UNet network model is designed to extract the noise of speech signal and achieve the purpose of… More >