论文标题

Trireme:探索域特定硬件加速的分层多级并行性

Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration

论文作者

Zacharopoulos, Georgios, Ejjeh, Adel, Jing, Ying, Yang, En-Yu, Jia, Tianyu, Brumar, Iulian, Intan, Jeremy, Huzaifa, Muhammad, Adve, Sarita, Adve, Vikram, Wei, Gu-Yeon, Brooks, David

论文摘要

包括特定域的加速器在内的异质系统的设计是一个具有挑战性且耗时的过程。在考虑区域限制的同时,设计人员必须确定应用程序中的哪些部分在硬件中加速以及在软件中留下的部分。此外,诸如扩展现实(XR)之类的域中的应用程序为各种并行执行(包括循环级别,任务级别和管道并行性)提供了机会。为了协助设计过程并揭示所有可能的并行级别,我们提出了Trireme,这是一种全自动的工具链,探索了多个并行级别的平行性,并产生特定域的加速器设计和配置,从而使性能最大化,并且给定面积预算。与纯软件实现相比,从XR域进行苛刻的基准实验显示,较小应用的加速度最高为20倍,高达37倍的加速度。

The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution, including loop level, task level and pipeline parallelism. To assist the design process and expose every possible level of parallelism, we present Trireme, a fully automated tool-chain that explores multiple levels of parallelism and produces domain specific accelerator designs and configurations that maximize performance, given an area budget. Experiments on demanding benchmarks from the XR domain revealed a speedup of up to 20x, as well as a speedup of up to 37x for smaller applications, compared to software-only implementations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源