Open AccessOpen Access

ARTICLE

An Optimal Method for Speech Recognition Based on Neural Network

Mohamad Khairi Ishak1, Dag Øivind Madsen2,*, Fahad Ahmed Al-Zahrani3

1 School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Nibong Tebal, 14300, Malaysia
2 University of South-Eastern Norway, Bredalsveien 14, 3511, Hønefoss, Norway
3 Computer Engineering Department, Umm Al-Qura University, Mecca, 24381, Saudi Arabia

* Corresponding Author: Dag Øivind Madsen. Email:

Intelligent Automation & Soft Computing 2023, 36(2), 1951-1961. https://doi.org/10.32604/iasc.2023.033971

Abstract

Natural language processing technologies have become more widely available in recent years, making them more useful in everyday situations. Machine learning systems that employ accessible datasets and corporate work to serve the whole spectrum of problems addressed in computational linguistics have lately yielded a number of promising breakthroughs. These methods were particularly advantageous for regional languages, as they were provided with cutting-edge language processing tools as soon as the requisite corporate information was generated. The bulk of modern people are unconcerned about the importance of reading. Reading aloud, on the other hand, is an effective technique for nourishing feelings as well as a necessary skill in the learning process. This paper proposed a novel approach for speech recognition based on neural networks. The attention mechanism is first utilized to determine the speech accuracy and fluency assessments, with the spectrum map as the feature extraction input. To increase phoneme identification accuracy, reading precision, for example, employs a new type of deep speech. It makes use of the exportchapter tool, which provides a corpus, as well as the TensorFlow framework in the experimental setting. The experimental findings reveal that the suggested model can more effectively assess spoken speech accuracy and reading fluency than the old model, and its evaluation model’s score outcomes are more accurate.

Keywords


Cite This Article

M. K. Ishak, D. . Madsen and F. A. Al-Zahrani, "An optimal method for speech recognition based on neural network," Intelligent Automation & Soft Computing, vol. 36, no.2, pp. 1951–1961, 2023.



This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 387

    View

  • 199

    Download

  • 0

    Like

Share Link