Uyghur Text Regions Localization Using Channel-Enhanced MSER and CNN

doi:10.3778/j.issn.1002-8331.1906-0027

Computer Engineering and Applications ›› 2020, Vol. 56 ›› Issue (16): 132-138.DOI: 10.3778/j.issn.1002-8331.1906-0027

Previous Articles Next Articles

Uyghur Text Regions Localization Using Channel-Enhanced MSER and CNN

Ahmatjan Mattohti, Askar Hamdulla, Abdusalam Dawut

1.College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
2.School of Software, Xinjiang University, Urumqi 830046, China

Online:2020-08-15 Published:2020-08-11

应用通道增强MSER与CNN的维吾尔文本区域定位

艾合麦提江·麦提托合提，艾斯卡尔·艾木都拉，阿布都萨拉木·达吾提

1.新疆大学信息科学与工程学院，乌鲁木齐 830046
2.新疆大学软件学院，乌鲁木齐 830046

Abstract

Abstract:

In order to locate Uyghur text regions in images accurately and effectively, an image text region location method based on channel enhancement Maximally Stable Extremal Region（MSER） and Convolutional Neural Network（CNN） is proposed. Channel-enhanced MSER is applied to extract candidate regions, non-text and repeated regions are removed according to heuristic rules of text features and CNN classification results, word-level text regions are obtained through a region fusion algorithm, missing text regions are recalled according to the color similarity and spatial relationship of the regions, and the recalled regions are classified and fused through CNN to locate image text regions. The experimental results show that the proposed method can locate text regions accurately and effectively, and has robustness and applicability.

Key words: image text, Uyghur text regions localization, channel-enhanced MSER, Convolutional Neural Network（CNN）, region fusion algorithm

摘要：

为了准确有效地定位出图像中的维吾尔文本区域，提出了一种基于通道增强最大稳定极值区域（Maximally Stable Extremal Region，MSER）和卷积神经网络（Convolutional Neural Network，CNN）的图像文本区域定位方法。应用通道增强MSER提取候选区域，根据文本特征的启发式规则以及CNN分类结果去除非文本和重复区域，通过区域融合算法得到词级别文本区域，根据该区域的色彩相近程度和空间关系召回遗漏的文本区域，并通过CNN网络对召回的区域分类融合，定位出图像文本区域。实验结果表明，该方法可以准确有效地定位文本区域，具有鲁棒性和应用性。

关键词: 图像文本, 维吾尔文本区域定位, 通道增强MSER, 卷积神经网络, 区域融合算法

Ahmatjan Mattohti, Askar Hamdulla, Abdusalam Dawut. Uyghur Text Regions Localization Using Channel-Enhanced MSER and CNN[J]. Computer Engineering and Applications, 2020, 56(16): 132-138.

艾合麦提江·麦提托合提，艾斯卡尔·艾木都拉，阿布都萨拉木·达吾提. 应用通道增强MSER与CNN的维吾尔文本区域定位[J]. 计算机工程与应用, 2020, 56(16): 132-138.

[1]	LIANG Fangxuan, YANG Feng, LU Liyun, YIN Mengxiao. Review of Brain Tumor Segmentation Methods Based on Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(7): 34-43.
[2]	LYU Hao, ZHANG Shengbing, WANG Jia, LIU Shuo, JING Desheng. Implementation of Convolutional Neural Network SIP Microsystem [J]. Computer Engineering and Applications, 2021, 57(5): 216-221.
[3]	HAN Wenjing, LUO Xiaoshu, YANG Rixing. Research on Compound Gesture Recognition Method [J]. Computer Engineering and Applications, 2021, 57(4): 108-113.
[4]	WAN Yaling, ZHONG Xiwu, LIU Hui, QIAN Yurong. Survey of Application of Convolutional Neural Network in Classification of Hyperspectral Images [J]. Computer Engineering and Applications, 2021, 57(4): 1-10.
[5]	ZHAO Hongrui, XUE Lei. Research on Stock Forecasting Based on LSTM-CNN-CBAM Model [J]. Computer Engineering and Applications, 2021, 57(3): 203-207.
[6]	HE Wenliang, ZHU Minling. Research Status and Future Analysis of Capsule Neural Network [J]. Computer Engineering and Applications, 2021, 57(3): 33-43.
[7]	CAO Yudong, LIU Haiyan, JIA Xu, LI Xiaohui. Overview of Image Quality Assessment Method Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(23): 27-36.
[8]	TENG Jinbao, KONG Weiwei, TIAN Qiaoxin, WANG Zhaoqian, LI Long. Multi-channel Attention Mechanism Text Classification Model Based on CNN and LSTM [J]. Computer Engineering and Applications, 2021, 57(23): 154-162.
[9]	ZHANG De, LIN Qingyu, GUO Maozu. Review of Single Image Super-Resolution Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(22): 28-41.
[10]	GU Shanghang, ZHANG Lijun, GUO Yuechao, XU Yong. Neural Network Optimization Method Based on Invalid Filters Weight Regression [J]. Computer Engineering and Applications, 2021, 57(22): 86-91.
[11]	CHEN Xiaohan, WEI Shuning, QIN Zhengze. Malware Family Classification Based on Deep Learning Visualization [J]. Computer Engineering and Applications, 2021, 57(22): 131-138.
[12]	ZUO Jianhao, JIANG Wengang. Adaptive Feature Fusion Network for Crowd Counting [J]. Computer Engineering and Applications, 2021, 57(21): 203-208.
[13]	DANG Jianwu, CONG Xiaoqing. Research on Hybrid Stock Index Forecasting Model Based on CNN and GRU [J]. Computer Engineering and Applications, 2021, 57(16): 167-174.
[14]	SUN Ming, CHEN Xin. Design Method of Convolutional Neural Network Accelerator [J]. Computer Engineering and Applications, 2021, 57(13): 77-84.
[15]	LUO Qu, FENG Jingwen, LAI Hongyu, LI Tao, DENG Wei, LIU Kai, ZHANG Junpeng. Classification of Rest State EEG in Patients with Schizophrenia or Depression [J]. Computer Engineering and Applications, 2021, 57(13): 138-146.

Uyghur Text Regions Localization Using Channel-Enhanced MSER and CNN

应用通道增强MSER与CNN的维吾尔文本区域定位

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics