Tech Science Press - Publisher of Open Access Journals

News & Announcements

10 October 2024
Tsinghua University Press and Tech Science Press Collaborate to Bring 11 Journals to SciOpen Platform
20 September 2024
Tech Science Press Journals Now Available on PubScholar Platform
14 August 2024
30th ICCES2024 Concludes Successfully in Singapore
13 August 2024
ResearchGate and Tech Science Press announce new Journal Home partnership
03 July 2024
Congenital Heart Disease Journal Strengthens Collaboration at the 4th AAPCHS Annual Meeting, Seoul, South Korea
28 June 2024
SDHM Selected for the Jiangsu Science and Technology Journal Excellence Action Plan

Show export options

Articles
Online

Search Results (5)

Open Access

ARTICLE

Speech Separation Algorithm Using Gated Recurrent Network Based on Microphone Array

Xiaoyan Zhao^1,*, Lin Zhou², Yue Xie¹, Ying Tong¹, Jingang Shi³

Intelligent Automation & Soft Computing, Vol.36, No.3, pp. 3087-3100, 2023, DOI:10.32604/iasc.2023.030180 - 15 March 2023

Abstract Speech separation is an active research topic that plays an important role in numerous applications, such as speaker recognition, hearing prosthesis, and autonomous robots. Many algorithms have been put forward to improve separation performance. However, speech separation in reverberant noisy environment is still a challenging task. To address this, a novel speech separation algorithm using gate recurrent unit (GRU) network based on microphone array has been proposed in this paper. The main aim of the proposed algorithm is to improve the separation performance and reduce the computational cost. The proposed algorithm extracts the sub-band steered… More >

View
966

Download
653
Open Access

ARTICLE

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Ziyi Wang¹, Xiaoyan Zhao^1,*, Hongjun Rong¹, Ying Tong¹, Jingang Shi²

Journal of New Media, Vol.4, No.3, pp. 145-153, 2022, DOI:10.32604/jnm.2022.030178 - 13 June 2022

Abstract Microphone array-based sound source localization (SSL) is widely used in a variety of occasions such as video conferencing, robotic hearing, speech enhancement, speech recognition and so on. The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments. In order to improve localization performance, a novel SSL algorithm using convolutional residual network (CRN) is proposed in this paper. The spatial features including time difference of arrivals (TDOAs) between microphone pairs and steered response power-phase transform (SRP-PHAT) spatial spectrum are extracted in each Gammatone sub-band. The spatial features of different sub-bands with a… More >

View
1328

Download
871
Open Access

ARTICLE

Robust Sound Source Localization Using Convolutional Neural Network Based on Microphone Array

Xiaoyan Zhao^1,*, Lin Zhou², Ying Tong¹, Yuxiao Qi¹, Jingang Shi³

Intelligent Automation & Soft Computing, Vol.30, No.1, pp. 361-371, 2021, DOI:10.32604/iasc.2021.018823 - 26 July 2021

Abstract In order to improve the performance of microphone array-based sound source localization (SSL), a robust SSL algorithm using convolutional neural network (CNN) is proposed in this paper. The Gammatone sub-band steered response power-phase transform (SRP-PHAT) spatial spectrum is adopted as the localization cue due to its feature correlation of consecutive sub-bands. Since CNN has the “weight sharing” characteristics and the advantage of processing tensor data, it is adopted to extract spatial location information from the localization cues. The Gammatone sub-band SRP-PHAT spatial spectrum are calculated through the microphone signals decomposed in frequency domain by Gammatone… More >

View
1736

Download
1161
Open Access

ARTICLE

Microphone Array Speech Separation Algorithm Based on TC-ResNet

Lin Zhou^1,*, Yue Xu¹, Tianyi Wang¹, Kun Feng¹, Jingang Shi²

CMC-Computers, Materials & Continua, Vol.69, No.2, pp. 2705-2716, 2021, DOI:10.32604/cmc.2021.017080 - 21 July 2021

Abstract Traditional separation methods have limited ability to handle the speech separation problem in high reverberant and low signal-to-noise ratio (SNR) environments, and thus achieve unsatisfactory results. In this study, a convolutional neural network with temporal convolution and residual network (TC-ResNet) is proposed to realize speech separation in a complex acoustic environment. A simplified steered-response power phase transform, denoted as GSRP-PHAT, is employed to reduce the computational cost. The extracted features are reshaped to a special tensor as the system inputs and implements temporal convolution, which not only enlarges the receptive field of the convolution layer More >

View
2077

Download
1364

Like
1
Open Access

ARTICLE

Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network

Xiaoyan Zhao^{1, *}, Shuwen Chen², Lin Zhou³, Ying Chen^{3, 4}

CMC-Computers, Materials & Continua, Vol.64, No.1, pp. 253-271, 2020, DOI:10.32604/cmc.2020.09848 - 20 May 2020

Abstract Microphone array-based sound source localization (SSL) is a challenging task in adverse acoustic scenarios. To address this, a novel SSL algorithm based on deep neural network (DNN) using steered response power-phase transform (SRP-PHAT) spatial spectrum as input feature is presented in this paper. Since the SRP-PHAT spatial power spectrum contains spatial location information, it is adopted as the input feature for sound source localization. DNN is exploited to extract the efficient location information from SRP-PHAT spatial power spectrum due to its advantage on extracting high-level features. SRP-PHAT at each steering position within a frame is More >

View
3110

Download
1524

Cited by
4

Displaying 1-10 on page 1 of 5. Per Page

Speech Separation Algorithm Using Gated Recurrent Network Based on Microphone Array

View

966

Download

653

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

View

1328

Download

871

Robust Sound Source Localization Using Convolutional Neural Network Based on Microphone Array

View

1736

Download

1161

Microphone Array Speech Separation Algorithm Based on TC-ResNet

View

2077

Download

1364

Like

1

Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network

View

3110

Download

1524

Cited by

4

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: