图形光谱域中的变压器的表现力如何？

论文标题

图形光谱域中的变压器的表现力如何？

How Expressive are Transformers in Spectral Domain for Graphs?

论文作者

Bastos, Anson, Nadgeri, Abhishek, Singh, Kuldeep, Kanezashi, Hiroki, Suzumura, Toyotaro, Mulang', Isaiah Onando

论文摘要

最新提出的基于变压器的图形模型的作品证明，香草变压器用于图形表示学习不足。要了解这种不足，需要研究变压器的光谱分析是否会揭示其对表现力的见解。类似的研究已经确定，图形神经网络（GNN）的光谱分析为其表现力提供了额外的观点。在这项工作中，我们系统地研究并建立了变压器领域中的空间和光谱域之间的联系。我们进一步提供了理论分析，并证明了变压器中的空间注意机制无法有效地捕获所需的频率响应，从而固有地限制了其在光谱空间中的表现力。因此，我们提出了feta，该框架旨在在整个图形频谱（即图形的实际频率成分）上进行注意力类似于空间空间中的注意力。经验结果表明，在标准基准的所有任务中，FETA可为香草变压器提供均匀的性能增益，并且可以轻松地扩展到具有低通特性的基于GNN的模型（例如GAT）。

The recent works proposing transformer-based models for graphs have proven the inadequacy of Vanilla Transformer for graph representation learning. To understand this inadequacy, there is a need to investigate if spectral analysis of the transformer will reveal insights into its expressive power. Similar studies already established that spectral analysis of Graph neural networks (GNNs) provides extra perspectives on their expressiveness. In this work, we systematically study and establish the link between the spatial and spectral domain in the realm of the transformer. We further provide a theoretical analysis and prove that the spatial attention mechanism in the transformer cannot effectively capture the desired frequency response, thus, inherently limiting its expressiveness in spectral space. Therefore, we propose FeTA, a framework that aims to perform attention over the entire graph spectrum (i.e., actual frequency components of the graphs) analogous to the attention in spatial space. Empirical results suggest that FeTA provides homogeneous performance gain against vanilla transformer across all tasks on standard benchmarks and can easily be extended to GNN-based models with low-pass characteristics (e.g., GAT).

下载PDF全文

下载文献需遵守相关版权规定

论文标题