用于图神经建议的空间自回旋编码

论文标题

用于图神经建议的空间自回旋编码

Spatial Autoregressive Coding for Graph Neural Recommendation

论文作者

Zheng, Jiayi, Yang, Ling, Wang, Heyuan, Yang, Cheng, Li, Yinghong, Hu, Xiaowei, Hong, Shenda

论文摘要

图形嵌入方法在内，包括传统浅层模型和深图神经网络（GNN）已导致有希望的应用。然而，由于其优化范式，浅层模型尤其是基于随机步行的算法无法充分利用采样子图或序列中的邻居接近度。基于GNN的算法在堆叠过多的层时很容易利用高阶信息，并且很容易引起过度平滑的问题，这可能会恶化低度（长尾）项目的建议，从而限制了表现力和可伸缩性。在本文中，我们提出了一个新颖的框架SAC，即空间自回归编码，以统一的方式解决上述问题。为了充分利用邻居接近和高级信息，我们设计了一种新型的空间自回旋范式。具体而言，我们首先随机掩盖了多跳邻居，并通过以明确的多跳高关注整合所有其他周围邻居来嵌入目标节点。然后，我们通过对比编码和蒙面邻居的嵌入来加强模型，以学习目标节点的邻居预测性编码，并配备了新的硬性负面采样策略。为了了解目标到邻居的预测任务的足够足够的表示并删除邻居的冗余，我们通过最大程度地提高目标预测性编码和蒙面邻居的嵌入，同时约束编码和周围邻居的胚胎之间的相互信息来设计邻居信息瓶颈。公共建议数据集和实际场景网络规模数据集Douyin-Friend-Recormendation的实验结果证明了SAC的优势与最先进的方法相比。

Graph embedding methods including traditional shallow models and deep Graph Neural Networks (GNNs) have led to promising applications in recommendation. Nevertheless, shallow models especially random-walk-based algorithms fail to adequately exploit neighbor proximity in sampled subgraphs or sequences due to their optimization paradigm. GNN-based algorithms suffer from the insufficient utilization of high-order information and easily cause over-smoothing problems when stacking too much layers, which may deteriorate the recommendations of low-degree (long-tail) items, limiting the expressiveness and scalability. In this paper, we propose a novel framework SAC, namely Spatial Autoregressive Coding, to solve the above problems in a unified way. To adequately leverage neighbor proximity and high-order information, we design a novel spatial autoregressive paradigm. Specifically, we first randomly mask multi-hop neighbors and embed the target node by integrating all other surrounding neighbors with an explicit multi-hop attention. Then we reinforce the model to learn a neighbor-predictive coding for the target node by contrasting the coding and the masked neighbors' embedding, equipped with a new hard negative sampling strategy. To learn the minimal sufficient representation for the target-to-neighbor prediction task and remove the redundancy of neighbors, we devise Neighbor Information Bottleneck by maximizing the mutual information between target predictive coding and the masked neighbors' embedding, and simultaneously constraining those between the coding and surrounding neighbors' embedding. Experimental results on both public recommendation datasets and a real scenario web-scale dataset Douyin-Friend-Recommendation demonstrate the superiority of SAC compared with state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题