论文标题

线性运行时的核心边缘分区

Out-of-Core Edge Partitioning at Linear Run-Time

论文作者

Mayer, Ruben, Orujzade, Kamil, Jacobsen, Hans-Arno

论文摘要

图边缘分区是在图形结构数据上优化分布式计算作业的重要预处理步骤。给定图的边缘集分为$ k $均等的分区,以便将跨分区的顶点复制最小化。核心边缘分区算法能够通过低内存开销来解决问题。 Exsistist Out Out Out算法主要以流方式工作,可以分为两种类型。尽管\ emph {无状态}流缘边缘分区速度很快,并且可以产生低分配的质量,但状态流缘分配的质量更高,但价格昂贵,因为它需要对每个分区的每个边缘进行评估功能,从而导致$ \ natercal {o}(o}(o}(o}(e e |*k))的时间复杂性。在本文中,我们提出了2PS-L,这是一种基于状态流模型的新颖核心边缘分区算法,但是实现了线性运行时(即$ \ MATHCAL {O}(O}(| e |)$)。 2PS-L由两个阶段组成。在第一阶段,将顶点通过轻巧的流聚类算法分为簇。在第二阶段,将图形重新流程,并从第一阶段进行顶点聚类,以将图形分区的搜索空间减少到每个边缘的两个目标分区。我们的评估表明,与现有的状态流式边缘分区者相比,2PS-L可以实现更好的分配质量,而运行时间要低得多。结果,可以大大减少分区和随后的分布图处理的总运行时间。

Graph edge partitioning is an important preprocessing step to optimize distributed computing jobs on graph-structured data. The edge set of a given graph is split into $k$ equally-sized partitions, such that the replication of vertices across partitions is minimized. Out-of-core edge partitioning algorithms are able to tackle the problem with low memory overhead. Exsisting out-of-core algorithms mainly work in a streaming manner and can be grouped into two types. While \emph{stateless} streaming edge partitioning is fast and yields low partitioning quality, stateful streaming edge partitioning yields better quality, but is expensive, as it requires a scoring function to be evaluated for every edge on every partition, leading to a time complexity of $\mathcal{O}(|E|*k)$. In this paper, we propose 2PS-L, a novel out-of-core edge partitioning algorithm that builds upon the stateful streaming model, but achieves linear run-time (i.e., $\mathcal{O}(|E|)$). 2PS-L consists of two phases. In the first phase, vertices are separated into clusters by a lightweight streaming clustering algorithm. In the second phase, the graph is re-streamed and vertex clustering from the first phase is exploited to reduce the search space of graph partitioning to only two target partitions for every edge. Our evaluations show that 2PS-L can achieve better partitioning quality than existing stateful streaming edge partitioners while having a much lower run-time. As a consequence, the total run-time of partitioning and subsequent distributed graph processing can be significantly reduced.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源