Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Ziyi Wang; Xiaoyan Zhao; Hongjun Rong; Ying Tong; Jingang Shi

doi:10.32604/jnm.2022.030178

Open Access icon Open Access

ARTICLE

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Ziyi Wang¹, Xiaoyan Zhao^1,*, Hongjun Rong¹, Ying Tong¹, Jingang Shi²

1 School of Information and Communication Engineering, Nanjing Institute of Technology, Nanjing, 211167, China
2 University of Oulu, Oulu, 90014, FI, Finland

* Corresponding Author: Xiaoyan Zhao. Email: email

Journal of New Media 2022, 4(3), 145-153. https://doi.org/10.32604/jnm.2022.030178

Received 20 March 2022; Accepted 15 April 2022; Issue published 13 June 2022

Abstract

Microphone array-based sound source localization (SSL) is widely used in a variety of occasions such as video conferencing, robotic hearing, speech enhancement, speech recognition and so on. The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments. In order to improve localization performance, a novel SSL algorithm using convolutional residual network (CRN) is proposed in this paper. The spatial features including time difference of arrivals (TDOAs) between microphone pairs and steered response power-phase transform (SRP-PHAT) spatial spectrum are extracted in each Gammatone sub-band. The spatial features of different sub-bands with a frame are combine into a feature matrix as the input of CRN. The proposed algorithm employ CRN to fuse the spatial features. Since the CRN introduces the residual structure on the basis of the convolutional network, it reduce the difficulty of training procedure and accelerate the convergence of the model. A CRN model is learned from the training data in various reverberation and noise environments to establish the mapping regularity between the input feature and the sound azimuth. Through simulation verification, compared with the methods using traditional deep neural network, the proposed algorithm can achieve a better localization performance in SSL task, and provide better generalization capacity to untrained noise and reverberation.

Keywords

Convolutional residual network; microphone array; spatial features; sound source localization

Cite This Article

APA Style

Wang, Z., Zhao, X., Rong, H., Tong, Y., Shi, J. (2022). Microphone Array-Based Sound Source Localization Using Convolutional Residual Network. Journal of New Media, 4(3), 145–153. https://doi.org/10.32604/jnm.2022.030178

Vancouver Style

Wang Z, Zhao X, Rong H, Tong Y, Shi J. Microphone Array-Based Sound Source Localization Using Convolutional Residual Network. J New Media. 2022;4(3):145–153. https://doi.org/10.32604/jnm.2022.030178

IEEE Style

Z. Wang, X. Zhao, H. Rong, Y. Tong, and J. Shi, “Microphone Array-Based Sound Source Localization Using Convolutional Residual Network,” J. New Media, vol. 4, no. 3, pp. 145–153, 2022. https://doi.org/10.32604/jnm.2022.030178

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Abstract

Keywords

Cite This Article

1921

1450

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link