计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (28): 138-140.DOI: 10.3778/j.issn.1002-8331.2010.28.039

• 数据库、信号与信息处理 • 上一篇    下一篇

混淆网络在音频文档检索系统中的应用研究

孙成立   

  1. 南昌航空大学 信息工程学院,南昌 330063
  • 收稿日期:2009-03-03 修回日期:2009-05-06 出版日期:2010-10-01 发布日期:2010-10-01
  • 通讯作者: 孙成立

Application research of confusion network in spoken document retrieval system

SUN Cheng-li   

  1. School of Information Engineering,Nanchang Hangkong University,Nanchang 330063,China
  • Received:2009-03-03 Revised:2009-05-06 Online:2010-10-01 Published:2010-10-01
  • Contact: SUN Cheng-li

摘要: 给出了一个基于音节混淆网络的语音文档内容检索系统,提出了一种基于两阶段解码的查询自动扩展方法,首先通过Viterbi解码算法在混淆音节网格上计算混淆音节的似然得分,然后利用A*解码算法从音节格上产生易混淆的扩展项,扩展项由其置信得分与阈值的比较自动产生。实验结果显示该方法能够有效提高系统的检出率。

关键词: 关键词识别, 混淆网络, 查询扩展, 音节相似度

Abstract: A syllable confusion network based spoken document content retrieval system is presented.A two-stage decoding based query automatic expansion method is proposed.It is achieved by first using a modified Viterbi decoder to calculate confusion syllable likehihood score in confusable syllable lattice,and then running A* search algorithm to generate mostly confusable phrases from lattice.The expansion terms are automaticly generated by its confidence score.Experimental results show that the proposed method can effectively improve term detection rate.

Key words: keyword recognition, confusion network, query expansion, syllable similarity

中图分类号: