Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (6): 206-209.

• 工程与应用 • Previous Articles     Next Articles

Type identification of single character based on manifold learning

HE Xiu-ling1,2,YANG Yang2,CHEN Zeng-zhao1,2,YU Ying1,2,DONG Cai-lin1   

  1. 1.The Center for Optimal Control & Discrete Mathematics,Institute of Mathematics and Statistics,Central China Normal University,Wuhan 430079,China
    2.Institute of Information Engineering,University of Science & Technology Beijing,Beijing 100083,China
  • Received:2007-06-26 Revised:2007-08-29 Online:2008-02-21 Published:2008-02-21
  • Contact: HE Xiu-ling

基于流形学习的单字符字体辨别

何秀玲1,2,杨 扬2,陈增照1,2,喻 莹1,2,董才林1   

  1. 1.华中师范大学 数学与统计学学院 最优控制与离散数学重点实验室,武汉 430079
    2.北京科技大学 信息工程学院,北京 100083
  • 通讯作者: 何秀玲

Abstract: The identification of language and character type has been an active area of research after recognition of machine printed text.Research on identification of handwritten text and printed text is seldom conducted.But it is common used in recognition of form.For character type identification,manifold learning algorithm Locally Linear Embedding (LLE) is imported.A generalizing method and a parameters estimation method are proposed.Experiments in identification printed/handwritten Chinese characters and digits show that its performance is higher than Support Vector Machine (SVM) classification.The combination of dimensionality reduction of LLE and Linear Discriminant Analysis(LDA) classification achieves a similar accurate rate as or higher than the combination of LLE and SVM classification but runs much faster than it.

摘要: 文字种类识别及字体辨别已成为继印刷体文字识别以后新的国内外研究的热点,关于单字的手写体和印刷体辨别的研究不多,但在表单中却极为常用。对于字体辨别问题,引入流形学习算法局部线性嵌套(LLE),假定数据为存在于嵌入高维空间的一个低维流形。提出了用于单字字体辨别的LLE泛化方法及邻域和内在维数的参数估计方法,基于印刷体/手写体汉字字符及数字的辨别实验表明,其性能优于直接支持向量机(SVM)分类,且经过LLE降维后的数据直接用线性判别分析方法(LDA)分类可以获得与LLE计算后SVM分类相近甚至更高的正确率和更快的分类速度。