实体对齐方式的知识图嵌入方法：实验综述

论文标题

实体对齐方式的知识图嵌入方法：实验综述

Knowledge Graph Embedding Methods for Entity Alignment: An Experimental Review

论文作者

Fanourakis, Nikolaos, Efthymiou, Vasilis, Kotzinos, Dimitris, Christophides, Vassilis

论文摘要

近年来，我们目睹了知识图的扩散（kg）在各个领域中的旨在支持诸如问答，建议等的应用程序等。在整合不同kg的知识时，经常执行任务是找到哪些子图指的是同一现实世界实体。最近，嵌入方法已用于实体对齐任务，该任务学习了实体的矢量空间表示，该实体保留了它们在原始kg中的相似性。已经提出了各种各样的监督，无监督和半监督的方法，这些方法既利用了KGS中实体的事实（基于属性）和基于属性的实体（基于属性）和结构信息（关系）。尽管如此，根据文献中缺少根据不同性能指标和KG特征对现实世界中的优势和缺点进行定量评估。在这项工作中，我们基于统计上合理的方法，对实体对齐方式流行的嵌入方法进行了首次元级分析。我们的分析揭示了不同嵌入方法的统计学意义与KGS提取的各种荟萃功能的统计学意义相关性，并根据我们测试床的所有现实世界中的有效性以统计学意义的方式对它们进行排名。最后，我们从方法的有效性和效率方面研究了有趣的权衡。

In recent years, we have witnessed the proliferation of knowledge graphs (KG) in various domains, aiming to support applications like question answering, recommendations, etc. A frequent task when integrating knowledge from different KGs is to find which subgraphs refer to the same real-world entity. Recently, embedding methods have been used for entity alignment tasks, that learn a vector-space representation of entities which preserves their similarity in the original KGs. A wide variety of supervised, unsupervised, and semi-supervised methods have been proposed that exploit both factual (attribute based) and structural information (relation based) of entities in the KGs. Still, a quantitative assessment of their strengths and weaknesses in real-world KGs according to different performance metrics and KG characteristics is missing from the literature. In this work, we conduct the first meta-level analysis of popular embedding methods for entity alignment, based on a statistically sound methodology. Our analysis reveals statistically significant correlations of different embedding methods with various meta-features extracted by KGs and rank them in a statistically significant way according to their effectiveness across all real-world KGs of our testbed. Finally, we study interesting trade-offs in terms of methods' effectiveness and efficiency.

下载PDF全文

下载文献需遵守相关版权规定

论文标题