具有数据效率学习的位置感知子图神经网络

论文标题

具有数据效率学习的位置感知子图神经网络

Position-Aware Subgraph Neural Networks with Data-Efficient Learning

论文作者

Liu, Chang, Yang, Yuwen, Xie, Zhe, Lu, Hongtao, Ding, Yue

论文摘要

在现实世界应用中，对图（GEL）的数据有效学习至关重要。现有的凝胶方法着重于学习有用的节点，边缘或具有``小型''标记数据的整个图表的有用表示。但是，尚未探讨用于子图预测的数据有效学习的问题。这个问题的挑战在于以下方面：1）学习位置特征以在其存在的基本图中获取结构信息至关重要。尽管现有的子图神经网络方法能够学习分离的位置编码，但总体计算复杂性非常高。 2）凝胶的流行图扩大方法，包括基于规则的，基于样本的，自适应和自动化方法，不适合增强子图，因为子刻度包含较少的节点，但更丰富的信息，例如位置，邻居和结构。亚图扩大更容易受到不良扰动的影响。 3）基础图中仅包含少数节点，这导致了潜在的``偏差''问题，即子图表示学习以这些``热''节点主导。相比之下，其余的节点无法充分学习，从而降低了子图表示学习的概括能力。在本文中，我们旨在应对上述挑战，并为称为PADEL的子图神经网络提出了一个能力的数据有效学习框架。具体而言，我们提出了一种不含锚定的新节点位置编码方法，并根据扩散的变性子图自动编码器设计了一种新的生成子图扩展方法，我们提出了子图对比相对于偏见学习的探索和可利用的视图。三个现实世界数据集的广泛实验结果表明，我们提出的方法优于最先进的基准。

Data-efficient learning on graphs (GEL) is essential in real-world applications. Existing GEL methods focus on learning useful representations for nodes, edges, or entire graphs with ``small'' labeled data. But the problem of data-efficient learning for subgraph prediction has not been explored. The challenges of this problem lie in the following aspects: 1) It is crucial for subgraphs to learn positional features to acquire structural information in the base graph in which they exist. Although the existing subgraph neural network method is capable of learning disentangled position encodings, the overall computational complexity is very high. 2) Prevailing graph augmentation methods for GEL, including rule-based, sample-based, adaptive, and automated methods, are not suitable for augmenting subgraphs because a subgraph contains fewer nodes but richer information such as position, neighbor, and structure. Subgraph augmentation is more susceptible to undesirable perturbations. 3) Only a small number of nodes in the base graph are contained in subgraphs, which leads to a potential ``bias'' problem that the subgraph representation learning is dominated by these ``hot'' nodes. By contrast, the remaining nodes fail to be fully learned, which reduces the generalization ability of subgraph representation learning. In this paper, we aim to address the challenges above and propose a Position-Aware Data-Efficient Learning framework for subgraph neural networks called PADEL. Specifically, we propose a novel node position encoding method that is anchor-free, and design a new generative subgraph augmentation method based on a diffused variational subgraph autoencoder, and we propose exploratory and exploitable views for subgraph contrastive learning. Extensive experiment results on three real-world datasets show the superiority of our proposed method over state-of-the-art baselines.

下载PDF全文

下载文献需遵守相关版权规定

论文标题