Two-step text orientation identification based on feature extension

Computer Engineering and Applications ›› 2012, Vol. 48 ›› Issue (1): 162-165.

• 数据库、信号与信息处理 • Previous Articles Next Articles

Two-step text orientation identification based on feature extension

FAN Xinghua, WANG Peng, ZHOU Peng

College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China

Received:1900-01-01 Revised:1900-01-01 Online:2012-01-01 Published:2012-01-01

一种基于扩展的两步文本倾向性分析方法

樊兴华，王鹏，周鹏

重庆邮电大学计算机科学与技术学院，重庆 400065

Abstract

Abstract: This paper presents an extension-based two-step text orientation analysis method. This method uses sentiment words including orientation word list, negative word list and adverb of degree list to extend features of the training texts, and then constructs the classifier CF1 and the classifier CF2 according to whether sentiment words and content words are used in the same way or not. At the classification time, extend features of the testing texts in the same way as for the training texts and classify them with the classifier CF1. If the result of classification is reliable, make a judgment；if not, conduct the second classification for the testing texts with the classifier CF2. Experimental results have proved the effectiveness of the method.

Key words: Chinese information processing, features extension, orientation identification, constructing classifier

摘要： 提出一种基于扩展的两步文本倾向性分析方法，该方法利用包含倾向性词表、否定词表、程度词表在内的情感词语对训练文本进行特征扩展，按照将情感词语和内容词语是否同等对待来构造两个分类器CF1和CF2；在分类时，对测试文本进行和训练文本类似的特征扩展，使用分类器CF1对其进行分类，对分类结果中的可靠部分直接做出判定，对分类结果中的不可靠部分利用分类器CF2进行二次分类并做出判定。实验结果验证了该方法的有效性。

关键词: 中文信息处理, 特征扩展, 倾向性分析, 构造分类器

FAN Xinghua, WANG Peng, ZHOU Peng. Two-step text orientation identification based on feature extension[J]. Computer Engineering and Applications, 2012, 48(1): 162-165.

樊兴华，王鹏，周鹏. 一种基于扩展的两步文本倾向性分析方法[J]. 计算机工程与应用, 2012, 48(1): 162-165.

[1]	HU Jinzhu1, SHU Jiangbo2, HU Quan3, LI Yuan1, YANG Jincai1, XIE Fang4. Research on expression method of rules in auto-identifying relational word of Chinese compound sentences [J]. Computer Engineering and Applications, 2016, 52(1): 127-132.
[2]	FENG Min-xuan. Parallel processing of contemporary Chinese “V+N” sequence relations [J]. Computer Engineering and Applications, 2010, 46(30): 8-10.
[3]	LIU Yan,ZHANG Lei. One technique for automatically translate Chinese text message into concept graphs [J]. Computer Engineering and Applications, 2008, 44(15): 151-154.
[4]	LI Lei^1,2,WANG Jin-lin¹,BAI He^1,2,HU Jing-jing^1,2. Research and implementation of FFT-based extraction algorithm of webpage content main body [J]. Computer Engineering and Applications, 2007, 43(30): 148-151.
[5]	WEI Jin，CHANG Chao-wen. Full-mapping dictionary implemented by single array [J]. Computer Engineering and Applications, 2007, 43(23): 184-186.
[6]	,,,. Design Dictionary of Chinese Word Segmentation [J]. Computer Engineering and Applications, 2007, 43(1期): 1-1.
[7]	CHEN Zhi-qun. Study on recognizing predicate of Chinese sentences [J]. Computer Engineering and Applications, 2007, 43(17): 176-178.
[8]	ZHANG Yong-kui^1，2，ZHANG Yan^1，2，AN Zeng-bo³，LIU Rui^1，2. Analysis of inaccurate style in processing Web true news text——about word segmentation and part of speech tagging [J]. Computer Engineering and Applications, 2007, 43(15): 166-169.

Two-step text orientation identification based on feature extension

一种基于扩展的两步文本倾向性分析方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 8

Recommended Articles

Metrics