计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (23): 125-127.

• 数据库、信号与信息处理 • 上一篇    下一篇

基于样本熵的语音/音乐识别

杨  松,于凤芹   

  1. 江南大学 物联网工程学院,江苏 无锡 214122
  • 出版日期:2012-08-11 发布日期:2012-08-21

Speech/music discriminator based on sample entropy

YANG Song, YU Fengqin   

  1. School of Internet of Things Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China
  • Online:2012-08-11 Published:2012-08-21

摘要: 传统的MFCC及短时能量特征只反映了信号序列的静态特征,目前基于这些特征的语音/音乐识别率为79%~86%。样本熵可以反映信号序列中的新信息量的大小以及新信息量的变化程度。以样本熵作为特征对语音/音乐进行分类识别,提取混合信号的样本熵,计算每段信号样本熵的均值和方差,采用[k]均值聚类进行识别。仿真实验结果表明,基于样本熵的语音/音乐识别的识别率可提高到88.073%。

关键词: 语音/音乐识别, 样本熵, k均值聚类

Abstract: Mel frequency cepstral coefficients and short time energy only reflect the static characteristics in signal sequence and the recognition rate of speech/music discrimination is 79%~86%. Sample entropy reflects the size and variational extent of new information in signal sequence. This paper conducts speech/music discrimination using sample entropy. The mean and variance of the sample entropy are calculated after extracting the sample entropy of mixed signals, then each signal is recognized by k-means cluster. Simulation experimental results show that the recognition rate of speech/music discrimination reaches 88.073% when using sample entropy.

Key words: speech/music discrimination, sample entropy, k-means