Ziyi Wang1, Xiaoyan Zhao1,*, Hongjun Rong1, Ying Tong1, Jingang Shi2
Journal of New Media, Vol.4, No.3, pp. 145-153, 2022, DOI:10.32604/jnm.2022.030178
- 13 June 2022
Abstract Microphone array-based sound source localization (SSL) is widely used in a variety of occasions such as video conferencing, robotic hearing, speech enhancement, speech recognition and so on. The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments. In order to improve localization performance, a novel SSL algorithm using convolutional residual network (CRN) is proposed in this paper. The spatial features including time difference of arrivals (TDOAs) between microphone pairs and steered response power-phase transform (SRP-PHAT) spatial spectrum are extracted in each Gammatone sub-band. The spatial features of different sub-bands with a… More >