论文标题
链接预测的知识图嵌入:比较分析
Knowledge Graph Embedding for Link Prediction: A Comparative Analysis
论文作者
论文摘要
知识图(KGS)在行业和学术环境中发现了许多应用,这些应用程序又激发了大量研究工作,以从各种来源中提取大规模的信息。尽管做出了这样的努力,但众所周知,即使是最先进的公斤也不完整。链接预测(LP)是预测已经成为公斤的实体之间缺失事实的任务,是一个有前途且经过广泛研究的任务,旨在解决KG不完整。在最近的LP技术中,基于KG嵌入的技术在某些基准测试中取得了非常有希望的表演。尽管该主题具有快速成长的文献,但对这些方法中各种设计选择的影响的关注不足。此外,该领域的标准做法是通过汇总大量的测试事实来报告准确性,其中某些实体代表过多。这允许LP方法仅仅参与包括此类实体在内的结构特性,同时忽略其余大部分KG,从而表现出良好的性能。该分析提供了基于嵌入的LP方法的全面比较,从而将分析的维度扩展到了文献中常见的范围之外。我们通过实验比较16种最先进方法的有效性和效率,考虑基于规则的基准,并报告对文献中最流行的基准测试的详细分析。
Knowledge Graphs (KGs) have found many applications in industry and academic settings, which in turn, have motivated considerable research efforts towards large-scale information extraction from a variety of sources. Despite such efforts, it is well known that even state-of-the-art KGs suffer from incompleteness. Link Prediction (LP), the task of predicting missing facts among entities already a KG, is a promising and widely studied task aimed at addressing KG incompleteness. Among the recent LP techniques, those based on KG embeddings have achieved very promising performances in some benchmarks. Despite the fast growing literature in the subject, insufficient attention has been paid to the effect of the various design choices in those methods. Moreover, the standard practice in this area is to report accuracy by aggregating over a large number of test facts in which some entities are over-represented; this allows LP methods to exhibit good performance by just attending to structural properties that include such entities, while ignoring the remaining majority of the KG. This analysis provides a comprehensive comparison of embedding-based LP methods, extending the dimensions of analysis beyond what is commonly available in the literature. We experimentally compare effectiveness and efficiency of 16 state-of-the-art methods, consider a rule-based baseline, and report detailed analysis over the most popular benchmarks in the literature.