Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (25): 173-175.DOI: 10.3778/j.issn.1002-8331.2009.25.053

• 图形、图像、模式识别 • Previous Articles     Next Articles

News video monologue shot detection based on multimodality information

JI Zhong,SU Yu-ting,YANG Yi-zheng   

  1. School of Electronic and Information Engineering,Tianjin University,Tianjin 300072,China
  • Received:2008-05-27 Revised:2008-08-18 Online:2009-09-01 Published:2009-09-01
  • Contact: JI Zhong

基于多模态信息融合的新闻独白镜头检测

冀 中,苏育挺,杨益铮   

  1. 天津大学 电子信息工程学院,天津 300072
  • 通讯作者: 冀 中

Abstract: Monologue shots are informative and valuable in the application of news video retrieval and mining.Multimodality information,such as audio,visual,temporal and contextual features is employed to detect them.Commercial and other shots are first removed by rules,and then anchorperson shots are detected by clustering method.At last monologue and reporter shots are labeled.The experimental results achieve better performance without external knowledge.

Key words: monologue shot detection, news video, video retrieval, multimodality, conditional random fields

摘要: 新闻视频中的独白镜头具有较大的信息量,在视频检索和挖掘中具有较高的应用价值。提出了一种融合音频、视频、时域以及上下文信息等多模态特征进行独白场景检测的方法。首先利用规则移除广告和“其他”镜头,然后应用聚类的方法检测主持人镜头,最后应用条件随机场(CRFs)模型标记独白和记者镜头。该方法无需额外的信息,具有较好的普适性,实验取得了较好的性能。

关键词: 独白镜头检测, 新闻视频, 视频检索, 多模态, 条件随机场

CLC Number: