Multi-source Location of Speech Signal Based on Improved DSB Method

doi:10.3778/j.issn.1002-8331.1910-0123

Abstract

Abstract:

Delay and Sum Beamforming（DSB） is widely used in angle of arrival estimation of microphone array signals. However, this method is not ideal for azimuth estimation of multiple speech signal sources due to the problem of grid lobe in speech signal sources. In addition, in the actual complex environment, it is affected by noise and reverberation, which makes azimuth recognition more difficult. In order to solve these problems, an improved DSB method is proposed, which combines signal frequency and microphone array spacing to select the frequency points in the sub-segment, and then weighs the data covariance matrix. At the same time, experiments are carried out in the simulation and actual environment, and the results show that, compared with the unimproved DSB method, the computational complexity of this method is reduced to 18.37%, and the amount of computation is effectively reduced. In the simulation experiment, the average angle positioning deviation is reduced by 27.3%, 21.4% and 36%, respectively, under different reflection coefficients of 0.2, 0.4 and 0.6, respectively. In the actual environmental experiment, the maximum azimuth angle estimation deviation is 9° and the minimum azimuth angle estimation deviation is 1.35°, which is lower than 12.1° and 3° of the unimproved algorithm.

Key words: delay and sum, beamforming, microphone array, covariance matrix, multi-source localization

摘要：

延迟求和波束形成（DSB）在麦克风阵列信号到达角估计上有着广泛应用，然而在语音信号源下由于栅瓣等问题使得该方法对多个语音信号源方位估计不理想，此外，在实际复杂环境下，该方法受噪声混响影响，方位识别更加困难。针对这些问题，提出一种改进的DSB方法，联合信号频率及麦克风阵列间距对子段内的频点进行选择，之后对数据协方差矩阵加权处理。同时在仿真及实际环境下进行实验，结果表明，与未改进DSB方法相比，该方法计算量降低为原来的18.37%，有效地降低了运算量；仿真实验中在不同反射系数0.2、0.4、0.6下，平均角度定位偏差分别降低了27.3%、21.4%、36%；实际环境实验方位角度估计偏差最大值为9°、最低为1.35°，要低于未改进算法的12.1°和3°。

关键词: 延迟求和, 波束形成, 麦克风阵列, 协方差矩阵, 多声源定位

WANG Jie, HUANG Lixia, ZHANG Xueying. Multi-source Location of Speech Signal Based on Improved DSB Method[J]. Computer Engineering and Applications, 2021, 57(1): 173-180.

王杰，黄丽霞，张雪英. 改进DSB方法的语音信号多声源定位[J]. 计算机工程与应用, 2021, 57(1): 173-180.

[1]	ZOU Jie, LI Jun. Multi-strategy Covariance Matrix Learning Differential Evolution Algorithm [J]. Computer Engineering and Applications, 2021, 57(7): 78-87.
[2]	ZHANG Suisui, HUANG Lixia, WANG Jie, ZHANG Xueying. Localization of Sound Source with Classification of Cross-Correlation Function Within Microphone Array [J]. Computer Engineering and Applications, 2020, 56(4): 128-133.
[3]	CHI Zongzheng, DONG Shaozheng, GUO Tong, REN Zhilei, ZHOU Kuanjiu, GUO He. Research on Wind Farm Layout Based on Hyper-Heuristic [J]. Computer Engineering and Applications, 2019, 55(7): 220-225.
[4]	DU Tingting, WEN Guoqiu, WU Lin, TONG Tao, TAN Malong. Spectral Clustering Algorithm Based on Local Covariance Matrix [J]. Computer Engineering and Applications, 2019, 55(14): 148-154.
[5]	WANG Haiyan1, TONG Qi1, LIAN Zhipeng2, JI Qingbo1. K-L transform optimization algorithm for measurement matrix [J]. Computer Engineering and Applications, 2018, 54(19): 186-190.
[6]	MENG Deming1，2，3, CHEN Xin1，2, HE Xiaonian1，2, CHEN Siping1，2. Eigenspace-based beamforming combined with spatio-temporally coherence factor for ultrasound imaging [J]. Computer Engineering and Applications, 2018, 54(1): 60-63.
[7]	GUI Yajun1, WU Xiaopei1, ZHANG Chao1, LV Zhao1, WAN Mengshi1, WANG Yingguan2. Indoor intelligent monitoring system with fusion of audio and video [J]. Computer Engineering and Applications, 2017, 53(1): 220-226.
[8]	JIANG Xiangang, ZHANG Panpan, SHENG Meibo. Flame recognition method based on temporal-spatial block covariance matrix blending feature [J]. Computer Engineering and Applications, 2016, 52(17): 208-214.
[9]	ZHANG Yong, YUE Jinwang, WANG Jing, JIN Yong. Joint probabilistic constrained robust beamforming and antenna selection [J]. Computer Engineering and Applications, 2016, 52(12): 127-130.
[10]	ZHANG Yi, MENG Shujie. Research of sound source localization algorithm for head-worn microphone array [J]. Computer Engineering and Applications, 2015, 51(24): 266-270.
[11]	HUANG Xiaoyan, FENG Xi’an, GAO Tiande. Space time adaptive reverberation suppressing method in shallow water active sonar system [J]. Computer Engineering and Applications, 2015, 51(11): 187-189.
[12]	WANG Yang1, ZHAO Zhijin1，2, LIU Xiaoli1，2. Robust full array beamforming algorithm [J]. Computer Engineering and Applications, 2014, 50(6): 205-209.
[13]	WEN Xiaojun, JI Jianhua, ZHONG Linbo, WU Shouhao, WANG Yanfen. Experimental research of microphone array acoustic source location algorithm based on time delay estimation [J]. Computer Engineering and Applications, 2014, 50(23): 212-214.
[14]	NI Zhilian, CAI Weiping, ZHANG Yidian. Method for multiple speech source localization based on sub-band steered response power [J]. Computer Engineering and Applications, 2013, 49(24): 205-209.
[15]	CHENG Shasha, SU Guoshao, YAN Liubin. Structural optimization design method with frequency forbidden zone using evolution strategy [J]. Computer Engineering and Applications, 2013, 49(19): 250-253.

Multi-source Location of Speech Signal Based on Improved DSB Method

改进DSB方法的语音信号多声源定位

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics