论文标题

有效的主机和远程共享内存的编排,用于内存密集的工作负载

Efficient Orchestration of Host and Remote Shared Memory for Memory Intensive Workloads

论文作者

Bae, Juhyun, Su, Gong, Iyengar, Arun, Wu, Yanzhao, Liu, Ling

论文摘要

由于已经做出了有效管理主机和远程空闲内存的统一内存编排框架的贡献,因此我们提出了代客,这是一种有效的主机编排和远程共享内存的方法,以提高内存密集型工作负载的性能。该论文做出了三个原始贡献。首先,我们通过引入一个主机协调的内存池来重新设计数据流,该存储池可作为本地缓存,以减少主机和远程存储器编排的关键路径中的延迟。其次,代客通过VICHET主机协调的内存池来管理本地存储器,利用跨容器的本地内存,该内存允许容器根据工作负载需求动态扩展和收缩内存分配。第三,代客基于两个优化提供了有效的远程内存回收技术:(1)基于活动的受害者选择方案,允许选择最小活跃的数据来选择驱逐请求,以及(2)将数据量最小的数据移动到较小的模式压力压力压力较小的远程远程发射方案。结果,代客可以有效地减少对本地节点的绩效影响和迁移开销。我们对NOSQL系统和机器学习(ML)工作负载的广泛实验表明,代表具有高达226倍吞吐量的现有代表性远程分页系统,并且在常规的OS交换设施中,对于大数据和ML工作负载,高达98%的延迟降低了,并且最多可高达5.5 x逐步浏览量,并且最大高达5.5 x越过的速度超过了78%。代客在https://github.com/git-disl/valet上开放。

Since very few contributions to the development of an unified memory orchestration framework for efficient management of both host and remote idle memory have been made, we present Valet, an efficient approach to orchestration of host and remote shared memory for improving performance of memory intensive workloads. The paper makes three original contributions. First, we redesign the data flow in the critical path by introducing a host-coordinated memory pool that works as a local cache to reduce the latency in the critical path of the host and remote memory orchestration. Second, Valet utilizes unused local memory across containers by managing local memory via Valet host-coordinated memory pool, which allows containers to dynamically expand and shrink their memory allocations according to the workload demands. Third, Valet provides an efficient remote memory reclaiming technique on remote peers, based on two optimizations: (1) an activity-based victim selection scheme to allow the least-active-chunk of data to be selected for serving the eviction requests and (2) a migration protocol to move the least-active-chunk of data to less-memory-pressured remote node. As a result, Valet can effectively reduce the performance impact and migration overhead on local nodes. Our extensive experiments on both NoSQL systems and Machine Learning (ML) workloads show that Valet outperforms existing representative remote paging systems with up to 226X throughput improvement and up to 98% latency decrease over conventional OS swap facility for big data and ML workloads, and by up to 5.5X throughput improvement and up to 78.4% latency decrease over the state-of-the-art remote paging systems. Valet is open sourced at https://github.com/git-disl/Valet.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源