GCN-PU: PU Text Classification Algorithm Based on Graph Convolutional Network

doi:10.3778/j.issn.1002-8331.2003-0195

Abstract

Abstract:

Towards PU（Positive and Unlabeled） text classification, a PU text classification algorithm based on graph convolution network is proposed. The basic idea is to assign different weights to unlabeled examples. Firstly, the unlabeled examples are all regarded as negative examples to train the text classifier based on convolutional neural network. Then, the vector of the penultimate layer of the convolutional neural network is taken as the feature vector of the text, and the corresponding class probability, as an input to the graph convolutional network. Finally, the loss weight of each unlabeled examples is calculated using the class probability derived from the graph convolutional network, and the text classifier is retrained. It repeats the above three steps until the algorithm parameters are stable. The experimental results on the public dataset 20newsgroup show that the proposed algorithm is superior to the existing ones, especially in the case of fewer positive samples.

Key words: convolutional neural network, graph convolutional network, loss weight, PU text classification

摘要：

针对PU（Positive and Unlabeled）文本分类问题，提出了一种基于图卷积网络的PU文本分类算法（GCN-PU），基本思想是给未标注样本加以不同的损失权重。将未标注样本全部视为负类样本，用以训练基于卷积神经网络的文本分类器；取卷积神经网络的倒数第二层的向量为文本的特征向量，以及对应的类别概率，作为图卷积网络的输入；利用图卷积网络得出的类别概率计算每个未标注样本的损失权重，重新训练文本分类器。不断重复上述三个步骤，直到算法参数稳定。在公开数据集20newsgroup上的实验结果表明，GCN-PU算法优于现有的方法，尤其在正类样本较少的情况下。

关键词: 卷积神经网络, 图卷积网络, 损失权重, PU文本分类

YAO Jiaqi, XU Zhengguo, YAN Jikun, WANG Keren. GCN-PU: PU Text Classification Algorithm Based on Graph Convolutional Network[J]. Computer Engineering and Applications, 2021, 57(11): 162-167.

姚佳奇，徐正国，燕继坤，王科人. GCN-PU:基于图卷积网络的PU文本分类算法[J]. 计算机工程与应用, 2021, 57(11): 162-167.

[1]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[2]	JIA Xiang’en, DONG Yihong, ZHU Feng, QIAN Jiangbo. Research Progress of Heterogeneous Graph Convolutional Networks [J]. Computer Engineering and Applications, 2021, 57(9): 36-49.
[3]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[4]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[5]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[6]	ZHAO Zhiyan, YANG Hua, HU Zhiwei, YU Haiping. Identification Model of Pests on Yuluxiang Pear Leaves Based on TACNN [J]. Computer Engineering and Applications, 2021, 57(9): 176-181.
[7]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[8]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[9]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[10]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[11]	LIANG Fangxuan, YANG Feng, LU Liyun, YIN Mengxiao. Review of Brain Tumor Segmentation Methods Based on Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(7): 34-43.
[12]	YANG Peiwei, ZHOU Yuhong, XING Gang, TIAN Zhiqiang, XU Xiayu. Applications of Convolutional Neural Network in Biomedical Image [J]. Computer Engineering and Applications, 2021, 57(7): 44-58.
[13]	CHANG Hao, CHEN Xiaolei, ZHANG Aihua, LI Ce, LIN Dongmei. Continuous Blood Pressure Prediction Based on Improved SENet Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 130-135.
[14]	WANG Chong, HAN Zhenqi, XU Haoyu, ZHU Yongxin, XU Sheng, CHEN Xia. Efficient Crack Detection Algorithm Based on Improved Saliency Map [J]. Computer Engineering and Applications, 2021, 57(6): 219-224.
[15]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.

GCN-PU: PU Text Classification Algorithm Based on Graph Convolutional Network

GCN-PU:基于图卷积网络的PU文本分类算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics