Chinese character CAPTCHA recognition based on convolution neural network

doi:10.3778/j.issn.1002-8331.1706-0304

Abstract

Abstract: CAPTCHAs（Completed Automated Public Turing test to tell Computers and Humans Apart） have already been widely applied in various fields of social life. Automatic recognition of CAPTCHAs consisting of English letters and Arabic numerals has already reached an advanced level. While with general methods identifing the CAPTCHAs?consisting of Chinese characters seems too difficult and the accuracy needs to be promoted. This paper mainly proposes a method of automatic identification CAPTCHAs which is based on convolutional neural network to improve the accuracy of characters recognition. In order to improve the generalization performance of the model by which adopting the framework of Keras convolution neural network and designing of multilayer convolution to extract deep-layer image information of which identifing Chinese characters CAPTCHAs and alphanumeric CAPTCHAs respectively. The experimental results indicate that the accuracy of identification has been promoted remarkably. The identification rate of Chinese characters is up to 99.4%. Meanwhile, the maximum of the identification rate of alphanumeric four-character CAPTCHAs is as high as 99.3%. These findings show that the Deep Neural Network possesses an excellent perceptivity against complex structures. It can be seen from the comparative experiments that the framework of Keras convolution neural network has better performance than other frameworks in CAPTCHAs recognition.

Key words: CAPTCHAs（Completed Automated Public Turing test to tell Computers and Humans Apart）, Chinese character CAPTCHAs, CNN, Keras framework

摘要： 验证码今已广泛应用在各个领域，常见的英文字母与数字组合的验证码自动识别准确率已达到较高的水准，而汉字因其字符复杂，用传统方法进行自动识别难度很大。提出一种基于卷积神经网络的验证码自动识别方法来提高字符的识别准确率。采用Keras卷积神经网络框架，设计多层卷积来提取深层次图像信息，分别对汉字验证码和字母数字验证码进行识别，以提高模型的泛化性。实验结果表明用该方法汉字验证码的单字识别率已达到99.4%；传统四字符字母数字验证码的识别率最高达到99.3%。这一结果表明深度神经网络对验证码复杂结构的感知能力很强大，通过对比实验发现Keras框架在验证码识别领域有较好效果。

关键词: 验证码, 汉字验证码, CNN, Keras框架

FAN Wang, HAN Jungang, GOU Fan, LI Shuai. Chinese character CAPTCHA recognition based on convolution neural network[J]. Computer Engineering and Applications, 2018, 54(3): 160-165.

范望，韩俊刚，苟凡，李帅. 卷积神经网络识别汉字验证码[J]. 计算机工程与应用, 2018, 54(3): 160-165.

[1]	LIANG Fangxuan, YANG Feng, LU Liyun, YIN Mengxiao. Review of Brain Tumor Segmentation Methods Based on Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(7): 34-43.
[2]	LI Xianguo, FENG Xinxin, LI Jianxiong. Sigle Image Super-Resolution Reconstruction Based on Multi-scale Residual Network [J]. Computer Engineering and Applications, 2021, 57(7): 215-221.
[3]	LI Songjiang, WU Ning, WANG Peng, LI Hailan. Vehicle Target Detection Method Based on Improved Cascade RCNN [J]. Computer Engineering and Applications, 2021, 57(5): 123-130.
[4]	LYU Hao, ZHANG Shengbing, WANG Jia, LIU Shuo, JING Desheng. Implementation of Convolutional Neural Network SIP Microsystem [J]. Computer Engineering and Applications, 2021, 57(5): 216-221.
[5]	HAN Wenjing, LUO Xiaoshu, YANG Rixing. Research on Compound Gesture Recognition Method [J]. Computer Engineering and Applications, 2021, 57(4): 108-113.
[6]	WANG Yutan, ZHU Chaowei, ZHAO Chen, LI Lekai, LI Ping, FENG Zhaoxu, XUE Junrui, LI Jiajing, ZHANG Jiaxin. Image Detection Method of Lingwu Long Jujube Based on Faster R-CNN [J]. Computer Engineering and Applications, 2021, 57(4): 216-224.
[7]	WAN Yaling, ZHONG Xiwu, LIU Hui, QIAN Yurong. Survey of Application of Convolutional Neural Network in Classification of Hyperspectral Images [J]. Computer Engineering and Applications, 2021, 57(4): 1-10.
[8]	ZHAO Hongrui, XUE Lei. Research on Stock Forecasting Based on LSTM-CNN-CBAM Model [J]. Computer Engineering and Applications, 2021, 57(3): 203-207.
[9]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[10]	HE Wenliang, ZHU Minling. Research Status and Future Analysis of Capsule Neural Network [J]. Computer Engineering and Applications, 2021, 57(3): 33-43.
[11]	CAO Yudong, LIU Haiyan, JIA Xu, LI Xiaohui. Overview of Image Quality Assessment Method Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(23): 27-36.
[12]	TENG Jinbao, KONG Weiwei, TIAN Qiaoxin, WANG Zhaoqian, LI Long. Multi-channel Attention Mechanism Text Classification Model Based on CNN and LSTM [J]. Computer Engineering and Applications, 2021, 57(23): 154-162.
[13]	ZHANG De, LIN Qingyu, GUO Maozu. Review of Single Image Super-Resolution Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(22): 28-41.
[14]	GU Shanghang, ZHANG Lijun, GUO Yuechao, XU Yong. Neural Network Optimization Method Based on Invalid Filters Weight Regression [J]. Computer Engineering and Applications, 2021, 57(22): 86-91.
[15]	CHEN Xiaohan, WEI Shuning, QIN Zhengze. Malware Family Classification Based on Deep Learning Visualization [J]. Computer Engineering and Applications, 2021, 57(22): 131-138.

Chinese character CAPTCHA recognition based on convolution neural network

卷积神经网络识别汉字验证码

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics