Computer Engineering and Applications ›› 204, Vol. 60 ›› Issue (17): 148-157.DOI: 10.3778/j.issn.1002-8331.2306-0254

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Self-Supervised Graph Representation Learning Method Based on Data and Feature Augmentation

XU Yunfeng, FAN Hexun   

  1. School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang 050000, China
  • Online:2024-09-01 Published:2024-08-30

基于数据与特征增强的自监督图表示学习方法

许云峰,范贺荀   

  1. 河北科技大学 信息科学与工程学院,石家庄 050000

Abstract: Graph representation learning plays a crucial role in handling graph data structures, but it faces a significant challenge of heavy reliance on labeled information. To overcome this challenge, a novel self-supervised graph representation learning framework is proposed. By leveraging contrastive learning methods, it integrates the structural and attribute information of the original graph, as well as the high- and low-frequency information in the spectral domain, enhancing the preserved node information. Additionally, residual fusion and unbiased feature augmentation are employed to ensure feature effectiveness while further reducing bias in augmented samples. Moreover, in the contrastive part, the probability of negating the samples as true is estimated, and weights are used to measure the hardness and similarity of negations. Experiments on three public datasets prove that the performance in the downstream tasks of node classification is not only better than the current state-of-the-art unsupervised methods but also surpasses previous supervised methods in most tasks.

Key words: self-supervised learning, graph contrastive learning, feature augmentation, node classification, graph representation learning

摘要: 图表示学习在处理图数据结构中起着非常重要的作用,但它面临着严重依赖于标记信息的挑战。为了克服这一挑战,提出了一种新的自监督图表示学习框架,通过使用对比学习方法,融合原始图的结构与属性以及频谱的高低频信息,在保留节点信息的基础上进行增强。同时,利用残差融合机制和无偏特征增强方法,在保证特征有效性的同时进一步减少增强样本的偏差。此外,在对比部分估计负样本为真的概率,并使用权重来度量负样本的硬度和相似度。通过在3个公开数据集上实验证明,在节点分类的下游任务中表现不仅优于当前最先进的无监督方法,而且还在多数任务中超过了以往的有监督方法。

关键词: 自监督学习, 图对比学习, 特征增强, 节点分类, 图表示学习