计算机工程与应用 ›› 2014, Vol. 50 ›› Issue (18): 142-146.

• 图形图像处理 • 上一篇    下一篇

基于连通域特征的维吾尔手写文本行分割

易晓芳,卡米力·木依丁,艾斯卡尔·艾木都拉   

  1. 新疆大学 信息科学与工程学院,乌鲁木齐 830046
  • 出版日期:2014-09-15 发布日期:2014-09-12

Connected component feature analysis based handwritten Uyghur text line detection and separation algorithm

YI Xiaofang, Kamil MOYDIN, Askar HAMDULLA   

  1. Institute of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
  • Online:2014-09-15 Published:2014-09-12

摘要: 针对维吾尔文手写体文本中行分割问题,基于连通域大小将图像中文字分为三类,提出了自适应涂抹细化算法,对主体文本行进行定位;并对第三类连通域中相邻两文本行间粘连的字符进行切割;此外,利用重心范围内的邻域搜索算法,解决了剩余笔画的文本行归附问题。实验结果表明,该方法与常见的水平投影法,分段投影法,及涂抹方法相比具有更好的分割效果。

关键词: 维吾尔文, 手写体文本, 文本行分割, 重心, 邻域

Abstract: To deal with the issues of text lines segmentation in Uyghur handwritten documents, based on the size of connected components, this paper divides the text image into three categories. In order to get the location of main text-line, it proposes adaptive painting and thinning algorithm. Furthermore, it separates connected characters and assigns them to text lines. In addition, by using the neighborhood search algorithm based on center of gravity, it solves the belonging problems for remaining small strokes. Experimental results show that separately compared with those horizontal projection based, piecewise projection based, smearing based segmentation methods, this method has better text line segmentation results indeed.

Key words: Uyghur, handwritten documents, text line segmentation, center of gravity, neighborhood