计算机工程与应用 ›› 2014, Vol. 50 ›› Issue (1): 127-129.

• 数据库、数据挖掘、机器学习 • 上一篇    下一篇

使用基于模式的Bootstrapping方法抽取情感词

王昌厚1,王  菲2   

  1. 1.晋中学院 计算机学院,山西 晋中 030600
    2.北京大学 计算语言所,北京 100871
  • 出版日期:2014-01-01 发布日期:2013-12-30

Extracting sentiment words using pattern based Bootstrapping method

WANG Changhou1, WANG Fei2   

  1. 1.School of Computer Science and Technology, Jinzhong University, Jinzhong, Shanxi 030600, China
    2.Institute of Computational Linguistics, Peking University, Beijing 100871, China
  • Online:2014-01-01 Published:2013-12-30

摘要: 情感评价词典在情感分析中具有非常重要的作用,在新词频发的网络环境中,识别新的情感评价词,完善现有的情感词典是非常有必要的。使用基于模式的Bootstrapping方法,在微博语料中抽取情感评价词。实验证明,在保持了较理想的精确率的情况下,上述方法抽取了数量可观的传统情感词典未收录的情感评价词。

关键词: 情感评价词, 模式, Bootstrapping方法

Abstract: Sentiment(or opinionated) lexicons play an important role in sentiment analysis. With the blooming of net neologisms, it is quite necessary to identify new sentiment words and improve current sentiment lexicons. This paper proposes a pattern based Bootstrapping method which extracts sentiment words from micro blogs. The experimental results validate the effectiveness of the method and large quantity of un-recorded sentiment words are extracted with reasonable precisions.

Key words: sentiment(or opinionated) word, pattern, Bootstrapping