Method for multiple speech source localization based on sub-band steered response power

Abstract

Abstract: To improve localization performance of microphone array in the case of multiple speakers, a method for multiple speech source localization based on sub-band steered response power is presented. In this method, speech signal is divided into seven sub-bands in frequency domain, and the steered response power-phase transform functions are computed in each sub-band. Then initial estimations of source location are generated by searching the maximum value for each function in the source space. According to the frequency sparsity characteristic for speech signal, these initial estimations include multiple source locations. The final source location estimations are produced from them using agglomerative clustering. Simulation and experiment results show that the proposed algorithm facilitates about 4% increase in localization correct rate and about 7% reduction in localization extra rate compared with the conventional algorithm under the conditions of two speakers, 10 dB signal-to-noise ratio and moderate reverberation.

Key words: microphone array, multiple speech source localization, sub-band steered response power, clustering

摘要： 为了提高多个说话人情况下麦克风阵列的定位性能，提出基于子带可控响应功率的多声源定位算法。该算法将语音信号频域分为7个子带，在每个子带计算相位变换加权的可控响应功率函数，在声源空间搜索其最大值得到声源位置的初始估计。根据语音信号频率的稀疏性，这些初始估计包含多个声源的位置，运用会聚聚类算法得到最终的声源位置估计。仿真和实验表明，在有2个说话人，10 dB信噪比，较强混响的条件下，该算法比传统算法的定位正确率提高了约4%，额外率降低了约7%。

关键词: 麦克风阵列, 多声源定位, 子带可控响应功率, 聚类

NI Zhilian, CAI Weiping, ZHANG Yidian. Method for multiple speech source localization based on sub-band steered response power[J]. Computer Engineering and Applications, 2013, 49(24): 205-209.

倪志莲，蔡卫平，张怡典. 基于子带可控响应功率的多声源定位方法[J]. 计算机工程与应用, 2013, 49(24): 205-209.

[1]	LAN Hong, HUANG Min. Fusion of KNN Optimized Density Peaks and FCM Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 81-88.
[2]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[3]	LI Li, JI Xinyuan, SONG Song. Prediction Model for Number of Software Defects in Loop [J]. Computer Engineering and Applications, 2021, 57(7): 158-163.
[4]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[5]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[6]	ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167.
[7]	PENG Qihui, XUAN Shibin, GAO Qing. Distribution Automatic Threshold Density Peak Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 71-78.
[8]	LI Yongzhen, LIAO Husheng. Multi-view Clustering via Graph Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(5): 115-122.
[9]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[10]	HU Xiaomin, WANG Mingfeng, ZHANG Shourong, LI Min. New Differential Evolution with Particle Swarm Optimization Algorithm for Text Clustering [J]. Computer Engineering and Applications, 2021, 57(4): 61-67.
[11]	WANG Junling, LU Xinming. Video Key Frame Extraction Algorithm Based on Semantic Correlation [J]. Computer Engineering and Applications, 2021, 57(4): 192-198.
[12]	WANG Fuyin, ZHANG Desheng, ZHANG Xiao. Adaptive Density Peaks Clustering Algorithm Combining with Whale Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(3): 94-102.
[13]	CHEN Junfeng, ZHENG Zhongtuan. Over-Sampling Method on Imbalanced Data Based on WKMeans and SMOTE [J]. Computer Engineering and Applications, 2021, 57(23): 106-112.
[14]	ZHANG Zhonglin, ZHAO Yu, YAN Guanghui. Natural Neighbor Density Extremum Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(23): 200-210.
[15]	MEI Jie, WEI Yuanyuan, XU Taosheng. Fusion Clustering Algorithm Based on Multi-Prototypes Using Density Peaks [J]. Computer Engineering and Applications, 2021, 57(22): 78-85.

Method for multiple speech source localization based on sub-band steered response power

基于子带可控响应功率的多声源定位方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics