论文标题
简化基于订单的核心维护的算法
Simplified Algorithms for Order-Based Core Maintenance
论文作者
论文摘要
图分析吸引了研究和行业社区的广泛关注。由于线性时间的复杂性,$ k $核的分解被广泛用于许多现实世界中的应用,例如生物学,社交网络,社区发现,生态学和信息传播。在许多这样的应用程序中,数据图随着时间的推移不断变化。这些变化对应于边缘插入和去除。我们没有重新计算耗时耗时的$ k $ core,而是研究如何有效地维护$ k $ core。也就是说,在插入或删除边缘时,我们需要通过搜索更多顶点来识别受影响的顶点。基于最新的订单方法在所有顶点中维护订单,即所谓的$ k $ order,这可以大大降低搜索空间。但是,这种基于订单的方法对于理解和实施很复杂,并且没有正式讨论其正确性。在这项工作中,我们通过引入经典的订单数据结构来维护$ K $ - 订单,从而提出了一种简化的基于订单的方法,从而显着提高了边缘插入和去除算法的最差时间复杂性。另外,我们简化的方法是直观的理解和实施。正式地争论正确性很容易。此外,我们讨论了一种简化的批处理插入方法。该实验评估了我们的12个真实和合成图的简化方法,并具有数十亿个顶点。与现有方法相比,我们的简化方法分别实现了高达7.7倍和9.7倍的高速加速,分别用于边缘插入和去除。
Graph analytics attract much attention from both research and industry communities. Due to the linear time complexity, the $k$-core decomposition is widely used in many real-world applications such as biology, social networks, community detection, ecology, and information spreading. In many such applications, the data graphs continuously change over time. The changes correspond to edge insertion and removal. Instead of recomputing the $k$-core, which is time-consuming, we study how to maintain the $k$-core efficiently. That is, when inserting or deleting an edge, we need to identify the affected vertices by searching for more vertices. The state-of-the-art order-based method maintains an order, the so-called $k$-order, among all vertices, which can significantly reduce the searching space. However, this order-based method is complicated for understanding and implementation, and its correctness is not formally discussed. In this work, we propose a simplified order-based approach by introducing the classical Order Data Structure to maintain the $k$-order, which significantly improves the worst-case time complexity for both edge insertion and removal algorithms. Also, our simplified method is intuitive to understand and implement; it is easy to argue the correctness formally. Additionally, we discuss a simplified batch insertion approach. The experiments evaluate our simplified method over 12 real and synthetic graphs with billions of vertices. Compared with the existing method, our simplified approach achieves high speedups up to 7.7x and 9.7x for edge insertion and removal, respectively.