Open Access iconOpen Access

ARTICLE

Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network

by S. Girirajan, A. Pandian*

Department of Computer Science and Engineering, School of Computing, SRM Institute of Science and Engineering, Kattankulathur, Tamil Nadu, India

* Corresponding Author: A. Pandian. Email: email

Intelligent Automation & Soft Computing 2023, 35(2), 1987-2001. https://doi.org/10.32604/iasc.2023.028090

Abstract

Speech enhancement is the task of taking a noisy speech input and producing an enhanced speech output. In recent years, the need for speech enhancement has been increased due to challenges that occurred in various applications such as hearing aids, Automatic Speech Recognition (ASR), and mobile speech communication systems. Most of the Speech Enhancement research work has been carried out for English, Chinese, and other European languages. Only a few research works involve speech enhancement in Indian regional Languages. In this paper, we propose a two-fold architecture to perform speech enhancement for Tamil speech signal based on convolutional recurrent neural network (CRN) that addresses the speech enhancement in a real-time single channel or track of sound created by the speaker. In the first stage mask based long short-term memory (LSTM) is used for noise suppression along with loss function and in the second stage, Convolutional Encoder-Decoder (CED) is used for speech restoration. The proposed model is evaluated on various speaker and noisy environments like Babble noise, car noise, and white Gaussian noise. The proposed CRN model improves speech quality by 0.1 points when compared with the LSTM base model and also CRN requires fewer parameters for training. The performance of the proposed model is outstanding even in low Signal to Noise Ratio (SNR).

Keywords


Cite This Article

APA Style
Girirajan, S., Pandian, A. (2023). Real-time speech enhancement based on convolutional recurrent neural network. Intelligent Automation & Soft Computing, 35(2), 1987-2001. https://doi.org/10.32604/iasc.2023.028090
Vancouver Style
Girirajan S, Pandian A. Real-time speech enhancement based on convolutional recurrent neural network. Intell Automat Soft Comput . 2023;35(2):1987-2001 https://doi.org/10.32604/iasc.2023.028090
IEEE Style
S. Girirajan and A. Pandian, “Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network,” Intell. Automat. Soft Comput. , vol. 35, no. 2, pp. 1987-2001, 2023. https://doi.org/10.32604/iasc.2023.028090



cc Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1241

    View

  • 615

    Download

  • 0

    Like

Share Link