论文标题
2-HOP标签方案的有效可及性比率计算
Efficient Reachability Ratio Computation for 2-hop Labeling Scheme
论文作者
论文摘要
作为基本的图形操作之一,在过去几十年中,已对可及性查询处理进行了广泛的研究。许多方法遵循了设计2-HOP标签以加速加速的线路。考虑到使用所有节点构建2-HOP标签时的索引大小不能受到界限,研究人员提议使用重要节点的一部分来构建2-Hop标签(部分2-Hop标签),以覆盖尽可能多的可及性信息。然后,我们可能会在有限的索引大小和索引构造时间内实现更好的查询性能。但是,部分2-HOP标签并不总是在不同的图表上表现良好。 在本文中,我们重点介绍了如何有效计算可及性比率的问题,以帮助用户确定是否应使用部分2-HOP标签来回答给定图形的可及性查询。直观地,可及性比表示可以通过局部2-Hop标签回答的可及查询数量的比率,而不是给定图中涉及的可及查询总数。我们讨论了可达比率计算的困难,并提出了一种增量分区算法,以实现可达性比率计算。我们通过丰富的实验结果表明,我们的算法可以有效地获得可及性比的结果,并显示整个查询性能如何受到不同部分2-HOP标签的影响。基于实验结果,我们提供了有关是否应将部分2-HOP标签用于给定图表以进行可及性查询处理的发现。
As one of the fundamental graph operations, reachability queries processing has been extensively studied during the past decades. Many approaches followed the line of designing 2-hop labels to make acceleration. Considering that the index size cannot be bounded when using all nodes to construct 2-hop labels, researchers proposed to use a part of important nodes to construct 2-hop labels (partial 2-hop labels) to cover as much reachability information as possible. Then, we may achieve better query performance with limited index size and index construction time. However, partial 2-hop labels do not always perform well on different graphs. In this paper, we focus on the problem of how to efficiently compute reachability ratio, such that to help users determine whether partial 2-hop labels should be used to answer reachability queries for the given graph. Intuitively, reachability ratio denotes the ratio of the number of reachable queries that can be answered by partial 2-hop labels over the total number of reachable queries involved in the given graph. We discuss the difficulties of reachability ratio computation, and propose an incremental-partition algorithm for reachability ratio computation. We show by rich experimental results that our algorithm can efficiently get the result of reachability ratio, and show how the overall query performance is affected by different partial 2-hop labels. Based on the experimental results, we give out our findings on whether partial 2-hop labels should be used to the given graph for reachability queries processing.