Hybrid model for overlapping ambiguities resolution

doi:10.3778/j.issn.1002-8331.2008.21.002

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (21): 5-8.DOI: 10.3778/j.issn.1002-8331.2008.21.002

• 博士论坛 • Previous Articles Next Articles

Hybrid model for overlapping ambiguities resolution

LI Tian-xia,DAI Xin-yu,CHEN Jia-jun

National Laboratory of Novel Software Technology，Nanjing University，Nanjing 210093，China
Department of Computer Science and Technology，Nanjing University，Nanjing 210093，China

Received:2008-04-30 Revised:2008-06-02 Online:2008-07-21 Published:2008-07-21
Contact: LI Tian-xia

基于混合模型的交集型歧义消歧策略

李天侠,戴新宇,陈家骏

南京大学计算机软件新技术国家重点实验室，南京 210093
南京大学计算机科学与技术系，南京 210093

通讯作者: 李天侠

Abstract

Abstract: Overlapping ambiguity is one of the key problems in Chinese words segmentation.In this paper，a new hybrid strategy which integrates rule-based method and statistical-based method is presented for solving the overlapping ambiguity.Firstly，rule-set is constructed automatically through error-driven learning which will be used for some ambiguities detection and resolution.Secondly，a score function based on N-Gram language model is constructed.Lastly，a rule-based module and a statistical-based module will be combined for solving all ambiguities detected by FMM&BMM and the rule-set.The experiments show that this hybrid method is more suitable for ambiguities detection and possesses the advantages of both rule-based and statistical-based methods for overlapping ambiguities resolution in Chinese words segmentation.

Key words: overlapping ambiguity, disambiguation rules, statistical language model, score function, full segmentation

摘要： 针对交集型歧义这一汉语分词中的难点问题，提出了一种规则和统计相结合的交集型歧义消歧模型。首先，根据标注语料库，通过基于错误驱动的学习思想，获取交集型歧义消歧规则库，同时，利用统计工具，构建N-Gram统计语言模型；然后，采用正向/逆向最大匹配方法和消歧规则库探测发现交集型歧义字段；最后，通过消歧规则库和评分函数进行交集型歧义的消歧处理。这种基于混合模型的方法可以探测到更多的交集型歧义字段，并且结合了规则方法和统计方法在处理交集型歧义上的优势。实验表明，这种方法提高了交集型歧义处理的精度，为解决交集型歧义提供了一种新的思路。

关键词: 交集型歧义, 消歧规则, 统计语言模型, 评分函数, 全切分

LI Tian-xia,DAI Xin-yu,CHEN Jia-jun. Hybrid model for overlapping ambiguities resolution[J]. Computer Engineering and Applications, 2008, 44(21): 5-8.

李天侠,戴新宇,陈家骏. 基于混合模型的交集型歧义消歧策略[J]. 计算机工程与应用, 2008, 44(21): 5-8.

[1]	CAI Qingsong, CHEN Xihou. Bayesian Network Structure Merging Algorithm Based on Scoring Function [J]. Computer Engineering and Applications, 2019, 55(11): 147-152.
[2]	WANG Bin1, WANG Zhechen2, ZHOU Wei1, HAO Tianpeng1. Intuitionistic fuzzy decision-making method based on entropy and improved co-correlation degree [J]. Computer Engineering and Applications, 2018, 54(6): 247-251.
[3]	YU Qian1, HOU Fujun2. Application of hesitant trapezoid fuzzy aggregation operators on multiple attribute decision making [J]. Computer Engineering and Applications, 2018, 54(22): 252-257.
[4]	WANG Cuicui1, LI Baoping1，2, MAO Junjun2. Multiple attributes decision-making method based on interval type-2 fuzzy entropy [J]. Computer Engineering and Applications, 2017, 53(18): 132-136.
[5]	FANG Gang1, ZHANG Shemin2. 3-gram statistical language model optimization to expression vector design [J]. Computer Engineering and Applications, 2016, 52(15): 60-64.
[6]	REN Jian. Methods of interval-valued fuzzy-stochastic multiple-criterion decision-making problem [J]. Computer Engineering and Applications, 2015, 51(8): 27-31.
[7]	MA Qinggong. Hesitant fuzzy multi-attribute group decision-making method based on prospect theory [J]. Computer Engineering and Applications, 2015, 51(24): 249-253.
[8]	LI Wei1，2, YANG Huizhong1. Blind separation of over-determined mixtures with conjugate gradient and kernel estimation [J]. Computer Engineering and Applications, 2014, 50(22): 22-27.
[9]	XU Danqing1, MAO Junjun1，2, FU Yanan1. Multiple attribute group decision making method of improved IITFN based on aggregation operator [J]. Computer Engineering and Applications, 2013, 49(12): 53-56.
[10]	MAO Junjun1，2, WANG Cuicui1, YAO Dengbao1, SUN Li1. Multi-attribute decision-making method of normal distribution interval number based on cross-entropy [J]. Computer Engineering and Applications, 2012, 48(33): 44-48.
[11]	CUI Chunsheng1, SU Baiyun2. Study of non-personalized recommender systems based on vague value [J]. Computer Engineering and Applications, 2012, 48(13): 63-66.
[12]	DONG Jiu-ying. Application of Vague set in multi-sensor information fusion [J]. Computer Engineering and Applications, 2010, 46(16): 147-149.
[13]	HUANG Ying-yi,CAI Guang-cheng,LIU Wen-qi. Multicriteria decision-making based on interval-value Vague sets [J]. Computer Engineering and Applications, 2009, 45(7): 65-67.
[14]	LI Peng,WEI Cui-ping. New method based on intuitionistic fuzzy sets in multiple attribute decision making [J]. Computer Engineering and Applications, 2009, 45(1): 59-60.
[15]	ZHANG Guo-bing^1,2,LI Miao¹. Rapid word segmentation algorithm based on local ambiguity word grid [J]. Computer Engineering and Applications, 2008, 44(12): 175-177.

Hybrid model for overlapping ambiguities resolution

基于混合模型的交集型歧义消歧策略

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics