Sentiment analysis method based on pre-training Convolutional Neural Networks by distant supervision

doi:10.3778/j.issn.1002-8331.1804-0208

Abstract

Abstract: Traditional researches of sentiment analysis are mostly based on machine learning algorithm, which rely on a huge number of artificially extracted features and domain knowledge. Convolution neural network is used to automatically learn the characteristics of texts and then identify the sentiment polarity of them. In order to solve the problem of insufficient supervision training dataset in sentiment analysis, the large-scale distant supervision data the used to train convolution neural network. At the same time, the “pre-train-fine-tune” strategy is used to overcome the noises in the distant supervision data, by pre-training convolution neural network on the distant supervision data and then fine-tuning it on the supervision dataset. Experimental results on the SemEval-2013 Twitter sentiment analysis dataset show that the ability of convolutional neural network to learn emotion semantics is enhanced effectively by using distant supervision data to participate in the training.

Key words: sentiment analysis, distant supervision, pre-train-fine-tune, Convolutional Neural Networks（CNN）

摘要： 传统的情感分析研究大多基于机器学习算法，此类方法依赖大量人工抽取的特征与领域知识。使用卷积神经网络自动学习文本的特征表示，进而判别文本的情感极性。为了解决情感分析中监督训练样本不足的问题，利用大规模弱监督数据来训练卷积神经网络。同时引入“预训练-微调”策略，先在弱监督数据集上对卷积神经网络进行预训练，然后使用监督数据集进行微调训练来克服弱监督数据中的噪声问题。在SemEval-2013 Twitter情感分析数据集上进行实验验证，结果表明由于引入了弱监督数据参与训练，有效增强了卷积神经网络学习情感语义的能力，从而提升了模型的准确性。

关键词: 情感分析, 弱监督, 预训练-微调, 卷积神经网络

ZHANG Yue1，2， XIA Hongbin1，2. Sentiment analysis method based on pre-training Convolutional Neural Networks by distant supervision[J]. Computer Engineering and Applications, 2018, 54(13): 27-33.

张越1，2，夏鸿斌1，2. 基于弱监督预训练CNN模型的情感分析方法[J]. 计算机工程与应用, 2018, 54(13): 27-33.

[1]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[2]	YANG Shanliang, CHANG Zheng. Chinese Implicit Sentiment Analysis Based on Graph Attention Neural Network [J]. Computer Engineering and Applications, 2021, 57(24): 161-167.
[3]	YUAN Xun, LIU Rong, LIU Ming. Aspect-Level Sentiment Analysis Model Incorporating Multi-layer Attention [J]. Computer Engineering and Applications, 2021, 57(22): 147-152.
[4]	ZHAO Lihua, WANG Chunli, CHU Yufeng. Attention-Based Double BiReGU Model for Aspect Term Extraction [J]. Computer Engineering and Applications, 2021, 57(22): 160-165.
[5]	LI Wenliang, YANG Qiuxiang, QIN Quan. Multi-feature Mixed Model Text Sentiment Analysis Method [J]. Computer Engineering and Applications, 2021, 57(19): 205-213.
[6]	HU Renyuan, LIU Jianhua, BU Guannan, ZHANG Dongyang, LUO Yixuan. Research on Sentiment Analysis of Multi-level Semantic Collaboration Model Fused with BERT [J]. Computer Engineering and Applications, 2021, 57(13): 176-184.
[7]	WANG Ting, YANG Wenzhong. Review of Text Sentiment Analysis Methods [J]. Computer Engineering and Applications, 2021, 57(12): 11-24.
[8]	KANG Yue, XUE Huizhen, HUA Bin. Analysis of Fine-Grained Commodity Evaluation for Deep Learning Network [J]. Computer Engineering and Applications, 2021, 57(11): 140-147.
[9]	ZHANG Ren, HE Ning. A Survey of Micro-Expression Recognition Methods [J]. Computer Engineering and Applications, 2021, 57(1): 38-47.
[10]	HU Can, CUI Xiaohui. Research on SNS Users’ Posting Behavior and Interest Prediction [J]. Computer Engineering and Applications, 2020, 56(9): 99-105.
[11]	CAO Junbo，YE Xia，XU Feixiang，YIN Liedong. Improved CBOW Emotional Information Acquisition Research [J]. Computer Engineering and Applications, 2020, 56(9): 142-147.
[12]	LI Jinyuan, KANG Yan, YANG Qiyue, WANG Peiyao, CUI Guorong. Aspect-Based Memory Network for Fine-Grained Product Sentiment Analysis [J]. Computer Engineering and Applications, 2020, 56(3): 159-164.
[13]	TU Wenbo, YUAN Zhenming, YU Kai. Convolutional Neural Networks Without Pooling Layer for Chinese Word Segmentation [J]. Computer Engineering and Applications, 2020, 56(2): 120-126.
[14]	HAN Hu, LIU Guoli. Interactive Attention Networks for Target-Based Sentiment Analysis [J]. Computer Engineering and Applications, 2020, 56(18): 104-110.
[15]	CHEN Yuting, LIU Xuhong, LIU Xiulei. Research on Entity Relation Extraction Based on Distant Supervision in Bidding Field [J]. Computer Engineering and Applications, 2020, 56(17): 243-250.

Sentiment analysis method based on pre-training Convolutional Neural Networks by distant supervision

基于弱监督预训练CNN模型的情感分析方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles 0

Metrics