论文标题

爱马仕:快速,容忍和可线化的复制协议

Hermes: a Fast, Fault-Tolerant and Linearizable Replication Protocol

论文作者

Katsarakis, A., Gavrielatos, V., Katebzadeh, M., Joshi, A., Dragojevic, A., Grot, B., Nagarajan, V.

论文摘要

当今的数据中心应用程序的基础是负责提供可用性,一致性和性能的数据存储。为了在存在故障的情况下高可用性,这些数据存储在几个节点上复制数据。这是在可靠的复制协议的帮助下完成的,该协议即使发生故障,也负责维护副本的强烈一致。强大的一致性比较弱的一致性模型优选,这些模型无法保证客户的直观行为。此外,为了满足实时潜伏期的高需求,数据座必须提供高吞吐量和低延迟。 这项工作介绍了HERMES,这是一种基于广播的内存数据存储的可靠复制协议,可以通过启用本地读取和完全连续的快速写入所有复制品,从而提供高吞吐量和低潜伏期。爱马仕(Hermes)伴侣的逻辑时间戳与缓存辅助启发的无效无效,以确保线性化性,避免在集中式有序点上写入序列化,在每个复制处本地解决本地的写入冲突(因此,确保从未撰写文字从未流产),并通过可重复的书面提供缺陷。我们通过五个复制品在启用RDMA的可靠数据存储中实施爱马仕的实施表明,爱马仕(Hermes)的吞吐量始终比基于最新的RDMA可靠协议(ZAB和CRAQ)更高,而所有写入比率也大大降低了尾延迟。在5%的写入时,爱马仕的尾部潜伏期比Craq和Zab的尾部低3.6倍。

Today's datacenter applications are underpinned by datastores that are responsible for providing availability, consistency, and performance. For high availability in the presence of failures, these datastores replicate data across several nodes. This is accomplished with the help of a reliable replication protocol that is responsible for maintaining the replicas strongly-consistent even when faults occur. Strong consistency is preferred to weaker consistency models that cannot guarantee an intuitive behavior for the clients. Furthermore, to accommodate high demand at real-time latencies, datastores must deliver high throughput and low latency. This work introduces Hermes, a broadcast-based reliable replication protocol for in-memory datastores that provides both high throughput and low latency by enabling local reads and fully-concurrent fast writes at all replicas. Hermes couples logical timestamps with cache-coherence-inspired invalidations to guarantee linearizability, avoid write serialization at a centralized ordering point, resolve write conflicts locally at each replica (hence ensuring that writes never abort) and provide fault-tolerance via replayable writes. Our implementation of Hermes over an RDMA-enabled reliable datastore with five replicas shows that Hermes consistently achieves higher throughput than state-of-the-art RDMA-based reliable protocols (ZAB and CRAQ) across all write ratios while also significantly reducing tail latency. At 5% writes, the tail latency of Hermes is 3.6X lower than that of CRAQ and ZAB.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源