计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (7): 189-192.

• 图形、图像、模式识别 • 上一篇    下一篇

改进的视频帧中文本区域定位算法

张海云,张凤荔,刘 伟,万明成   

  1. 电子科技大学 计算机科学与工程学院,成都 611731
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-03-01 发布日期:2011-03-01

Improved algorithm for text region localization in video frames

ZHANG Haiyun,ZHANG Fengli,LIU Wei,WAN Mingcheng   

  1. College of Computer Science & Engineering,University of Electronic Science and Technology of China,Chengdu 611731,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-03-01 Published:2011-03-01

摘要: 针对视频帧图像背景复杂、字体大小变化较大等特点,提出一种改进的视频帧中文本区域定位算法。算法设计并实现一个交叉点检测算法,利用交叉点密度等边缘特征去除大部分非文字边缘以降低背景边缘对文本区域的影响,并对剩余边缘进行膨胀以形成候选文本区域,结合文本区域特征和支持向量机算法区分文本区域和非文本区域。实验表明,该算法可以提取视频帧中90%以上的文本区域,并且文本区域定位准确率达92.0%。

关键词: 边缘检测, 交叉点检测, 图像膨胀, 支持向量机

Abstract: Contraposing the characters of images in video,such as complicated backdrop and big diversification of font,this paper proposes an improved algorithm for text region localization in video.The algorithm designs and implements a detecting cross points algorithm at first.Then the algorithm recognizes and deletes most of non-text edges with the features of edge like the density of cross points,in order to reduce the influence of backdrop on text region,and then expands the rest edges to form candidate text regions.The text region and non-text region are distinguished by combing the features of text region and a SVM classifier.The experiments show that the algorithm can extract more than 90% of text regions contained in video frames and the precision can reach up to 92.0%.

Key words: edge detection, cross point detection, image expand, support vector machine