Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network

S. Girirajan; A. Pandian

doi:10.32604/iasc.2023.028090

Open Access icon Open Access

ARTICLE

Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network

S. Girirajan, A. Pandian^*

Department of Computer Science and Engineering, School of Computing, SRM Institute of Science and Engineering, Kattankulathur, Tamil Nadu, India

* Corresponding Author: A. Pandian. Email: email

Intelligent Automation & Soft Computing 2023, 35(2), 1987-2001. https://doi.org/10.32604/iasc.2023.028090

Received 02 February 2022; Accepted 01 April 2022; Issue published 19 July 2022

Abstract

Speech enhancement is the task of taking a noisy speech input and producing an enhanced speech output. In recent years, the need for speech enhancement has been increased due to challenges that occurred in various applications such as hearing aids, Automatic Speech Recognition (ASR), and mobile speech communication systems. Most of the Speech Enhancement research work has been carried out for English, Chinese, and other European languages. Only a few research works involve speech enhancement in Indian regional Languages. In this paper, we propose a two-fold architecture to perform speech enhancement for Tamil speech signal based on convolutional recurrent neural network (CRN) that addresses the speech enhancement in a real-time single channel or track of sound created by the speaker. In the first stage mask based long short-term memory (LSTM) is used for noise suppression along with loss function and in the second stage, Convolutional Encoder-Decoder (CED) is used for speech restoration. The proposed model is evaluated on various speaker and noisy environments like Babble noise, car noise, and white Gaussian noise. The proposed CRN model improves speech quality by 0.1 points when compared with the LSTM base model and also CRN requires fewer parameters for training. The performance of the proposed model is outstanding even in low Signal to Noise Ratio (SNR).

Keywords

Speech enhancement; convolutional encoder-decoder; long short-term memory; noise suppression; speech restoration

Cite This Article

APA Style

Girirajan, S., Pandian, A. (2023). Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network. Intelligent Automation & Soft Computing, 35(2), 1987–2001. https://doi.org/10.32604/iasc.2023.028090

Vancouver Style

Girirajan S, Pandian A. Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network. Intell Automat Soft Comput. 2023;35(2):1987–2001. https://doi.org/10.32604/iasc.2023.028090

IEEE Style

S. Girirajan and A. Pandian, “Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network,” Intell. Automat. Soft Comput., vol. 35, no. 2, pp. 1987–2001, 2023. https://doi.org/10.32604/iasc.2023.028090

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network

Abstract

Keywords

Cite This Article

2077

1013

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link