Research of Chinese Resume Analysis Based on Feature Fusion

doi:10.3778/j.issn.1002-8331.1803-0142

Computer Engineering and Applications ›› 2019, Vol. 55 ›› Issue (10): 244-249.DOI: 10.3778/j.issn.1002-8331.1803-0142

Previous Articles Next Articles

Research of Chinese Resume Analysis Based on Feature Fusion

CHEN Yi1，3，4, FU Lei2，3，4, DAI Yunxia1, ZHANG Jian3，4

1.Key Laboratory of Optical Communication and Networks, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
2.Key Laboratory of Intelligent Computing and Signal Processing, Ministry of Education, Anhui University, Hefei 230601, China
3.Peking University Shenzhen Institute, Shenzhen, Guangdong 518057, China
4.IMSL Shenzhen Key Lab, PKU-HKUST Shenzhen Hong Kong Institution, Shenzhen, Guangdong 518057, China

Online:2019-05-15 Published:2019-05-13

基于特征融合的中文简历解析方法研究

陈毅1，3，4，符磊2，3，4，代云霞1，张剑3，4

1.重庆邮电大学光通信与网络重点实验室，重庆 400065
2.安徽大学计算机智能与信号处理教育部重点实验室，合肥 230601
3.北京大学深圳研究院，广东深圳 518057
4.深港产学研基地深圳市智能媒体和语音重点实验室，广东深圳 518057

Abstract

Abstract: It’s typical for the Chinese resume analysis to apply the rule-based and statistical-based methods, suffering from the low efficiency, high cost and poor generalization ability. This paper proposes a Chinese resume analysis method based on feature fusion model. The concatenation of the word vectors generated by Word2Vec and the word representation is generated from BLSTM neural network, then the text resume is analyzed by intergrating the BLSTM and CRF model（BLSTM-CRF）. In order to improve the efficiency of Chinese resume resolution, the two vectors are concatenated into a new word representation. Furthermore, the BLSTM layer is used to fuse the contextual information of the words to be marked, and then the values of all possible tag sequences are exported to the CRF layer. Finally, according to the constraints of the front and rear labels, the CRF is utilized to obtain the optimal labeling sequence. All of the neural networks are trained by the gradient descent algorithm and are optimized by the pretrained word embeddings and Dropout. The experimental results show that the feature fusion method is superior to the traditional resume analysis schemes.

Key words: Chinese resume, resume analysis, feature fusion, word vectors, neural network

摘要： 针对基于规则和统计的传统中文简历解析方法效率低、成本高、泛化能力差的缺点，提出一种基于特征融合的中文简历解析方法，即级联Word2Vec生成的词向量和用BLSTM（Bidirectional Long Short-Term Memory）建模字序列生成的词向量，然后再结合BLSTM和CRF（Conditional Random Fields）对中文简历进行解析（BLSTM-CRF）。为了提高中文简历解析的效率，级联包含字序列信息的词向量和用Word2Vec生成的词向量，融合成一个新的词向量表示；再由BLSTM强大的学习能力融合词的上下文信息，输出所有可能标签序列的分值给CRF层；再由CRF引入标签之间约束关系求解最优序列。利用梯度下降算法训练神经网络，使用预先训练的词向量和Dropout优化神经网络，最终完成对中文简历的解析工作。实验结果表明，所提的特征融合方法优于传统的简历解析方法。

关键词: 中文简历, 简历解析, 特征融合, 词向量, 神经网络

CHEN Yi1，3，4, FU Lei2，3，4, DAI Yunxia1, ZHANG Jian3，4. Research of Chinese Resume Analysis Based on Feature Fusion[J]. Computer Engineering and Applications, 2019, 55(10): 244-249.

陈毅1，3，4，符磊2，3，4，代云霞1，张剑3，4. 基于特征融合的中文简历解析方法研究[J]. 计算机工程与应用, 2019, 55(10): 244-249.

[1]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[2]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[3]	WANG Lin, CHAI Jiangyun. Research on Deep Neural Network in Multi-scene Vehicle Attribute Recognition [J]. Computer Engineering and Applications, 2021, 57(9): 162-167.
[4]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[5]	ZHAO Zhiyan, YANG Hua, HU Zhiwei, YU Haiping. Identification Model of Pests on Yuluxiang Pear Leaves Based on TACNN [J]. Computer Engineering and Applications, 2021, 57(9): 176-181.
[6]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[7]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[8]	LU Lixia, ZOU Junzhong, GUO Yucheng, ZHANG Jian, WANG Bei. Prediction of Knee Injury Based on Multimodal Fusion [J]. Computer Engineering and Applications, 2021, 57(9): 225-232.
[9]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[10]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[11]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[12]	LI Mingshan, HAN Qingpeng, ZHANG Tianyu, WANG Daolei. Safety Helmet Detection Method of Improved SSD [J]. Computer Engineering and Applications, 2021, 57(8): 192-197.
[13]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[14]	JIANG Bin, ZHONG Rui, ZHANG Qiuwen, ZHANG Huanlong. Survey of Non-frontal Facial Expression Recognition by Using Deep Learning Methods [J]. Computer Engineering and Applications, 2021, 57(8): 48-61.
[15]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.

Research of Chinese Resume Analysis Based on Feature Fusion

基于特征融合的中文简历解析方法研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics