Chinese Named Entity Recognition Based on XLnet Language Model

doi:10.3778/j.issn.1002-8331.2005-0355

Abstract

Abstract:

The establishment of linguistic model has a direct impact on exploring the semantic information in sentences. To improve the recognition rate of Chinese named entities, the semantic representation of Chinese characters is the pointed. Aiming at the traditional Chinese named entity recognition algorithm has not fully tapped the hidden information inside the sentence, this article puts forward CAW-XLnet-BiGRU-CRF network framework by word-vector features generated by large-scale corpus pretraining with LSTM extract and uses CNN to extract spatial information between words, then integrates the extracted spatial information with the word vector features obtained and imports it into the language model XLnet （Generalized autoregressive pretraining for language understanding）, finally outputs the optimal tag sequence by BiGRU-CRF. The experiment result shows that the F1 value of the framework in the January 1998 data set of People’s Daily reachs 95.73% and solves the problem of hidden information inner, which can be well applied to Chinese named entity recognition task.

Key words: named entity recognition, word vector, XLnet, language model

摘要：

语言模型的建立对挖掘句子内部语义信息有着直接的影响，为了提高中文命名实体识别率，字的语义表示是关键所在。针对传统的中文命名实体识别算法没有充分挖掘到句子内部的隐藏信息问题，该文利用LSTM提取经过大规模语料预训练生成的字向量特征，同时将词向量预测矩阵传入到字向量特征提取阶段，通过矩阵运算融合为词向量特征，并进一步利用CNN提取词语之间的空间信息，将其与得到的词向量特征整合到一起输入语言模型XLnet（Generalized autoregressive pretraining for language understanding）中，然后经过BiGRU-CRF输出最优标签序列，提出了CAW-XLnet-BiGRU-CRF网络框架。并与其他的语言模型作了对比分析，实验结果表明，该框架解决了挖掘内部隐藏信息不充分问题，在《人民日报》1998年1月份数据集上的F1值达到了95.73%，能够较好地应用于中文命名实体识别任务。

关键词: 命名实体识别, 词向量, XLnet, 语言模型

YAO Guibin, ZHANG Qigui. Chinese Named Entity Recognition Based on XLnet Language Model[J]. Computer Engineering and Applications, 2021, 57(18): 156-162.

姚贵斌，张起贵. 基于XLnet语言模型的中文命名实体识别[J]. 计算机工程与应用, 2021, 57(18): 156-162.

[1]	YANG Qian, GU Lei. Chinese Named Entity Recognition Based on Denoising Joint Character-Word Model [J]. Computer Engineering and Applications, 2021, 57(7): 151-157.
[2]	WEI Hao, ZHOU Ai, ZHANG Yijia, CHEN Fei, QU Wen, LU Mingyu. Review of Deep Learning-Based Biomedical Entity Relation Extraction Research [J]. Computer Engineering and Applications, 2021, 57(21): 14-23.
[3]	CHENG Yuhang, ZHANG Jianqin, LI Jiangchuan, ZHANG An. Visual Mining and Analysis Method of Text Data in Traffic Accident [J]. Computer Engineering and Applications, 2021, 57(21): 116-122.
[4]	HUANG Meigen, LIU Jiale, LIU Chuan. Research on Improved BERT’s Chinese Multi-relation Extraction Method [J]. Computer Engineering and Applications, 2021, 57(21): 234-240.
[5]	JIAO Kainan, LI Xin, ZHU Rongchen. Overview of Chinese Domain Named Entity Recognition [J]. Computer Engineering and Applications, 2021, 57(16): 1-15.
[6]	HE Yujie, DU Fang, SHI Yingjie, SONG Lijuan. Survey of Named Entity Recognition Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(11): 21-36.
[7]	SUN Linghao. Cross-Lingual Chinese Named Entity Recognition Based on Translation Model [J]. Computer Engineering and Applications, 2021, 57(10): 94-100.
[8]	CAO Junbo，YE Xia，XU Feixiang，YIN Liedong. Improved CBOW Emotional Information Acquisition Research [J]. Computer Engineering and Applications, 2020, 56(9): 142-147.
[9]	LI Bo, KANG Xiaodong, ZHANG Huali, WANG Yage, CHEN Yayuan, BAI Fang. Named Entity Recognition in Chinese Electronic Medical Records Using Transformer-CRF [J]. Computer Engineering and Applications, 2020, 56(5): 153-159.
[10]	LIU Xiaoan, PENG Tao. Research on Chinese Scenic Spot Named Entity Recognition Based on Convolutional Neural Network [J]. Computer Engineering and Applications, 2020, 56(4): 140-145.
[11]	YU Tongrui, JIN Ran, HAN Xiaozhen, LI Jiahui, YU Ting. Review of Pre-training Models for Natural Language Processing [J]. Computer Engineering and Applications, 2020, 56(23): 12-22.
[12]	TU Wenbo, YUAN Zhenming, YU Kai. Convolutional Neural Networks Without Pooling Layer for Chinese Word Segmentation [J]. Computer Engineering and Applications, 2020, 56(2): 120-126.
[13]	CHEN Zeyu, HUANG Bo. Research on User Portrait of Improved Word Vector Model [J]. Computer Engineering and Applications, 2020, 56(1): 180-184.
[14]	YU Tao, LUO Ke. Commentary Sentiment Classification Model Combining Product Features [J]. Computer Engineering and Applications, 2019, 55(16): 108-114.
[15]	JI Mingyu, WANG Chenlong, AN Xiang, MU Weiye. Method of Sentence Similarity Calculation for Intelligent Customer Service [J]. Computer Engineering and Applications, 2019, 55(13): 123-128.

Chinese Named Entity Recognition Based on XLnet Language Model

基于XLnet语言模型的中文命名实体识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics