Parallel processing of contemporary Chinese “V+N” sequence relations

doi:10.3778/j.issn.1002-8331.2010.30.003

Computer Engineering and Applications ›› 2010, Vol. 46 ›› Issue (30): 8-10.DOI: 10.3778/j.issn.1002-8331.2010.30.003

• 博士论坛 • Previous Articles Next Articles

Parallel processing of contemporary Chinese “V+N” sequence relations

FENG Min-xuan

School of Chinese Language and Literature，Nanjing Normal University，Nanjing 210097，China

Received:2010-05-10 Revised:2010-09-06 Online:2010-10-21 Published:2010-10-21
Contact: FENG Min-xuan

现代汉语“V+N”序列关系的平行处理

冯敏萱

南京师范大学文学院，南京　210097

通讯作者: 冯敏萱

Abstract

Abstract: At present，the Chinese text processing in English-Chinese parallel corpus，more confined to only use monolingual analysis results，without sufficient use bilingual resources.Structural relation of contemporary Chinese v+n sequence is regarded as the research object，and the parallel processing algorithm is designed for recognizing v+n structural relation in English-Chinese parallel corpus.At first，this paper utilizes various form single language resources to extract the restriction rules of verb and noun that having different structural relations.And then judges v+n structural relation type separately according to translation of Chinese noun and verb，and context template in parallel English text.The experiment proves，in PCCE1000 which having been word-segmented and POS-tagged，F value that using single language resources to process v+n is 72.14%，and further utilizing the Chinese-English dictionary and English translation information，F value has reached 88.81%，having improved by 16.67 percentage points.

Key words: parallel corpus, collocation, phrase analysis, automatic recognition, Chinese information processing

摘要： 目前，在英汉平行语料中，对汉语文本的深加工多局限于只利用单语分析的成果，没有充分利用双语资源。以现代汉语v+n序列的结构关系为研究对象，设计出在英汉平行语料中识别v+n结构关系的平行处理算法：首先利用各种单语资源，提取出构成不同结构关系的动词和名词相互间的制约规则，再分别依据v+n中汉语名词、动词的语义在英语译文中的具体形式及上下文模板来判断v+n的结构关系类型。实验证明，在自动分词和词性标注的PCCE1000文本中，v+n单语处理的F值为72.14%，而进一步利用汉英词典和英语译文信息，F值到达了88.81%，提高了16.67个百分点。

关键词: 平行语料, 词语搭配, 短语分析, 自动识别, 中文信息处理

CLC Number:

TP391.1

FENG Min-xuan. Parallel processing of contemporary Chinese “V+N” sequence relations[J]. Computer Engineering and Applications, 2010, 46(30): 8-10.

冯敏萱. 现代汉语“V+N”序列关系的平行处理[J]. 计算机工程与应用, 2010, 46(30): 8-10.

[1]	SHAO Wenting, ZHENG Shuoyu. Width Estimation of Singular Perturbed Interior Layer Problem and Its Numerical Solution [J]. Computer Engineering and Applications, 2020, 56(4): 44-49.
[2]	ZHAO Yan, ZUO Baoqi. Analysis on Application of Machine Vision in Fabric Defect Detection [J]. Computer Engineering and Applications, 2020, 56(2): 11-17.
[3]	LIU Libin1，2, LONG Guangqing1，2, SHANGGUAN Zhenping1. Differential evolution and rational spectral methods for singularly perturbed problems [J]. Computer Engineering and Applications, 2018, 54(17): 225-230.
[4]	SUN Jinggao1，2, ZHAN Zihe1，2. Multi-stage nonlinear model polymerization reaction predictive control based on direct radau configuration [J]. Computer Engineering and Applications, 2018, 54(12): 244-250.
[5]	LU Zhiying1, DIAO Changying1, LU Huanzhen2, JIA Huizhen2. Automatic recognition and drawing of low-level jet based on wind field information of MICAPS [J]. Computer Engineering and Applications, 2017, 53(8): 230-234.
[6]	LI Caiyan, WANG Huiqin, WU Meng, PAN Sicheng. Automatic recognition and virtual restoration of mud spot disease of Tang dynasty tomb murals image [J]. Computer Engineering and Applications, 2016, 52(15): 233-236.
[7]	HU Jinzhu1, SHU Jiangbo2, HU Quan3, LI Yuan1, YANG Jincai1, XIE Fang4. Research on expression method of rules in auto-identifying relational word of Chinese compound sentences [J]. Computer Engineering and Applications, 2016, 52(1): 127-132.
[8]	GULIZADA·Haisa1, GULILA·Altenbek2，3. Research on automatic identification of base verb phrases in Kazakh [J]. Computer Engineering and Applications, 2015, 51(2): 218-223.
[9]	Mairehaba Aili1，2, Aziguli Xialifu3, Tuergen Yibulayin1，2. Research on extracting methods of multi word expression in Uyghur texts [J]. Computer Engineering and Applications, 2014, 50(8): 26-30.
[10]	TONG Xiaohong, QIN Xinqiang. Domain decomposition method for Helmholtz equations based on ridge basis function [J]. Computer Engineering and Applications, 2013, 49(13): 40-42.
[11]	CHENG Nanchang1，2, HOU Min3. Parallel corpus retrieval technology research [J]. Computer Engineering and Applications, 2012, 48(31): 134-139.
[12]	YAN Rong, GAO Guanglai. Word sense disambiguation based on word semantic relevancy computation [J]. Computer Engineering and Applications, 2012, 48(27): 109-113.
[13]	FAN Xinghua, WANG Peng, ZHOU Peng. Two-step text orientation identification based on feature extension [J]. Computer Engineering and Applications, 2012, 48(1): 162-165.
[14]	XU Runhua1，FENG Minxuan2，CHEN Xiaohe3. Automatic acquisition?and?recognition of two word-collocation in treebank [J]. Computer Engineering and Applications, 2011, 47(28): 17-20.
[15]	HU Xiaodong¹，LUO Jiancheng¹，WU Wei¹，CHENG Xi¹，SHEN Zhanfeng¹，JIA Yinfang². Automatic recognition method of chalk handwritten numerals [J]. Computer Engineering and Applications, 2011, 47(2): 182-184.

Parallel processing of contemporary Chinese “V+N” sequence relations

现代汉语“V+N”序列关系的平行处理

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics