计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (24): 165-169.DOI: 10.3778/j.issn.1002-8331.2009.24.049

• 图形、图像、模式识别 • 上一篇    下一篇

印刷体藏文文字识别技术研究

欧 珠1,普次仁2,大罗桑朗杰2,赵栋才2,刘 芳2,边巴旺堆2   

  1. 1.西藏大学 工学院,拉萨 850000
    2.西藏大学 工学院 计算机科学系,拉萨 850000
  • 收稿日期:2009-01-08 修回日期:2009-03-18 出版日期:2009-08-21 发布日期:2009-08-21
  • 通讯作者: 欧 珠

Study on printed Tibetan character recognition

Ngodrup1,Putseren2,Daluosanglangjie2,ZHAO Dong-cai2,LIU Fang2,Bianbawangdui2   

  1. 1.School of Engineering,Tibet University,Lhasa 850000,China
    2.Department of Computer Science,School of Engineering,Tibet University,Lhasa 850000,China
  • Received:2009-01-08 Revised:2009-03-18 Online:2009-08-21 Published:2009-08-21
  • Contact: Ngodrup

摘要: 藏文字因其结构的特殊性,在应用传统文字识别方法进行识别时正确识别率较低,识别效果较差。在深入分析以印刷体藏文文字特征的基础上,提出了一系列可以在干扰情况下提高识别率的方法,包括局部自适应二值化算法、基于连通域的切分、基于网格的模糊笔划特征提取等。实验结果说明,这些方法可提高印刷体藏文文字识别系统的正确识别率和抗干扰能力。

关键词: 印刷体藏文字符, 切分, 藏文文字识别, 光学字符识别

Abstract: Owing to the special structure of Tibetan characters,the recognition of traditional Tibetan characters encounters the problems of low recognition rates and poor recognition effects.Through an in-depth study on features of the printed Tibetan characters,this paper develops a series of methods to increase recognition rate and improve the recognition effects of Tibetan characters even in the case of jamming.These methods include local self-adaptive binary algorithm,segmentation based on the connected domain,grid-based fuzzy stroke feature extraction and so on.The results of the experiments indicate that the methods can definitely increase the recognition rates of the printed Tibetan character recognition system and improve the ability to prevent jamming.

Key words: printed Tibetan character, segmentation, Tibetan character recognition, Optical Character Recognition(OCR)

中图分类号: