Identification of Chinese names based on maximum entropy model and rules

Computer Engineering and Applications ›› 2007, Vol. 43 ›› Issue (35): 1-4.

• 博士论坛 • Previous Articles Next Articles

Identification of Chinese names based on maximum entropy model and rules

JIA Ning^1,2,ZHANG Quan²

1.Graduate School of Chinese Academy of Sciences，Beijing 100039，China
2.Institute of Acoustics，Chinese Academy of Sciences，Beijing 100080，China

Received:1900-01-01 Revised:1900-01-01 Online:2007-12-11 Published:2007-12-11
Contact: JIA Ning

基于最大熵模型和规则的中文姓名识别

贾宁^1,2,张全²

1.中国科学院研究生院，北京 100039
2.中国科学院声学研究所，北京 100080

通讯作者: 贾宁

Abstract

Abstract: Identification of Chinese names is one of the important fields for the Chinese language automatic processing.The recall rate of identification will affect other processing deeply.But most methods can’t get a good recall rate which is up to 90%.This paper presents a method based on maximum entropy model and rules.The open test on real corpus shows that the recall rate of the system reaches 94%，with a precision more than 84%.The method is practicable，and benefits from its recall rate.

Key words: Chinese name recognition, maximum entropy, rule

摘要： 中文姓名识别是中文信息处理的一项重要技术，识别的召回率对其它需要以姓名识别为基础的中文信息处理技术有至关重要的影响。提出了一种统计模型和处理规则相结合的中文姓名识别方法：首先以最大熵模型识别潜在姓氏，而后再通过判定规则作进一步处理。真实语料的开放测试表明，该方法在召回率方面有明显的优势，可以达到94%以上的召回率，同时能保证较高的准确率。

关键词: 中文姓名识别, 最大熵, 规则

JIA Ning^1,2,ZHANG Quan². Identification of Chinese names based on maximum entropy model and rules[J]. Computer Engineering and Applications, 2007, 43(35): 1-4.

贾宁^1,2,张全²

. 基于最大熵模型和规则的中文姓名识别[J]. 计算机工程与应用, 2007, 43(35): 1-4.

[1]	LIU Teng, CHEN Heng, LI Guanyu. Knowledge Graph Representation Learning Method Jointing FOL Rules [J]. Computer Engineering and Applications, 2021, 57(4): 100-107.
[2]	SONG Haonan, ZHAO Gang, WANG Xingfen. Knowledge Reasoning Method Combining Knowledge Representation with Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(19): 189-197.
[3]	TONG Wenlin, CHEN Dewang, HUANG Yunhu, LYU Yisheng. Fuzzy System Optimization Method Based on Simulated Annealing and Rule Reduction [J]. Computer Engineering and Applications, 2021, 57(16): 142-150.
[4]	MENG Xiaojuan, ZHANG Yueqin, HAO Xiaoli, LYU Jinlai. Multi-class Deep Convolutional Generative Adversarial Networks for Belt Tear Detection [J]. Computer Engineering and Applications, 2021, 57(16): 269-275.
[5]	ZHANG Zhenhai，ZHANG Xiangting. Context-Aware Information Service Recommendation Method for High-Speed Rail [J]. Computer Engineering and Applications, 2021, 57(12): 231-236.
[6]	YANG Geying, SHEN Xiajiong, SHI Xianjin, ZHANG Lei. Visualization of Association Rules in Context of Concept Lattices [J]. Computer Engineering and Applications, 2021, 57(1): 84-91.
[7]	CHEN Guang, JIANG Tonghai, WANG Meng, TANG Xinyu, JI Wenfei. Custom SWRL Knowledge Graph Completion Reasoning Built-ins Implementation Method [J]. Computer Engineering and Applications, 2021, 57(1): 261-270.
[8]	YUAN Shunjie, CHENG Hui, YE Zhencheng, CHENG Peixin. Application of SOM-T2 FLS in Stock Market Forecasting [J]. Computer Engineering and Applications, 2020, 56(7): 130-136.
[9]	DU Yufei, WU Baoguo, CHEN Dong. Study of Trees and Shrubs Recognition Inference Algorithm Based on Production Rules [J]. Computer Engineering and Applications, 2020, 56(5): 242-250.
[10]	LI Jian, XI Wenfeng. Detection Algorithm for Rule Base of Drilling Fluid Design Expert System [J]. Computer Engineering and Applications, 2020, 56(4): 256-261.
[11]	JI Wenlu, WANG Hailong, SU Guibin, LIU Lin. Review of Recommendation Methods Based on Association Rules Algorithm [J]. Computer Engineering and Applications, 2020, 56(22): 33-41.
[12]	GU Junhua, SU Ming, ZHANG Yajuan, ZHANG Danhong. Research on Fast Frequent Pattern Mining Algorithm Based on Bitmap-Code List [J]. Computer Engineering and Applications, 2020, 56(19): 86-93.
[13]	ZHANG Xiao, SUN Yiming, WU Xufeng. Research on Query-Aware Relation-Graph Database Adaptive Storage Technology [J]. Computer Engineering and Applications, 2020, 56(17): 100-108.
[14]	YANG Ying, WANG Jun, WANG Gang. Customer Complaints Classification Method Based on Improved Random Subspace [J]. Computer Engineering and Applications, 2020, 56(13): 230-235.
[15]	ZHOU Wanying, MA Yingcang, XU Qiuxia, ZHENG Yi. Unsupervised Feature Selection Algorithm Based on Maximum Entropy and [l2,0] Norm Constraints [J]. Computer Engineering and Applications, 2020, 56(11): 51-59.

Identification of Chinese names based on maximum entropy model and rules

基于最大熵模型和规则的中文姓名识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics