论文标题

缩放强烈一致的复制

Scaling Strongly Consistent Replication

论文作者

Charapko, Aleksey, Ailijiang, Ailidani, Demirbas, Murat

论文摘要

强大的一致性复制有助于使应用程序逻辑保持简单,并为正确性和可管理性提供了重大好处。不幸的是,由于其可扩展性和性能有限,因此采用强烈的复制协议已被遏制。为了减轻在强烈的复制方案中的领导者瓶颈,我们引入了Pig,这是一种协议内的沟通聚合和背包技术。 Pig使用从追随者子组中随机选择的节点将领导者的消息传达给子组中的其余关注者,并从这些关注者中执行确认的网络内聚合。通过将继电器节点随机交替跨复制操作,Pig屏蔽了继电器节点以及领导者成为热点并改善吞吐量可伸缩性。 我们在许多云计算服务和数据库中使用的经典Paxos协议的背景下展示了猪。与Paxos和Epaxos协议相比,我们实施和评估Pigpaxos在5至25个节点的群集上的各种工作负载下。我们表明,继电器的聚集几乎没有延迟的开销,Pigpaxos可以提供3倍以上的吞吐量,而吞吐量几乎没有延迟,而paxos和Epaxos几乎没有延迟。我们通过瓶颈的分析建模来支持我们的实验观察结果,并表明继电器节点的旋转为减少瓶颈提供了最大的好处,并且在仅使用1个随机旋转中继节点时,吞吐量是最大化的。

Strong consistency replication helps keep application logic simple and provides significant benefits for correctness and manageability. Unfortunately, the adoption of strongly-consistent replication protocols has been curbed due to their limited scalability and performance. To alleviate the leader bottleneck in strongly-consistent replication protocols, we introduce Pig, an in-protocol communication aggregation and piggybacking technique. Pig employs randomly selected nodes from follower subgroups to relay the leader's message to the rest of the followers in the subgroup, and to perform in-network aggregation of acknowledgments back from these followers. By randomly alternating the relay nodes across replication operations, Pig shields the relay nodes as well as the leader from becoming hotspots and improves throughput scalability. We showcase Pig in the context of classical Paxos protocols employed for strongly consistent replication by many cloud computing services and databases. We implement and evaluate PigPaxos, in comparison to Paxos and EPaxos protocols under various workloads over clusters of size 5 to 25 nodes. We show that the aggregation at the relay has little latency overhead, and PigPaxos can provide more than 3 folds improved throughput over Paxos and EPaxos with little latency deterioration. We support our experimental observations with the analytical modeling of the bottlenecks and show that the rotating of the relay nodes provides the most benefit for reducing the bottlenecks and that the throughput is maximized when employing only 1 randomly rotating relay node.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源