Gem5-AcceSys: Enabling System-Level Exploration of Standard Interconnects for Novel Accelerators

Bibliographic Details
Title: Gem5-AcceSys: Enabling System-Level Exploration of Standard Interconnects for Novel Accelerators
Authors: Liu, Qunyou, Zapater, Marina, Atienza, David
Publication Year: 2025
Collection: Computer Science
Subject Terms: Computer Science - Hardware Architecture, Computer Science - Performance
More Details: The growing demand for efficient, high-performance processing in machine learning (ML) and image processing has made hardware accelerators, such as GPUs and Data Streaming Accelerators (DSAs), increasingly essential. These accelerators enhance ML and image processing tasks by offloading computation from the CPU to dedicated hardware. These accelerators rely on interconnects for efficient data transfer, making interconnect design crucial for system-level performance. This paper introduces Gem5-AcceSys, an innovative framework for system-level exploration of standard interconnects and configurable memory hierarchies. Using a matrix multiplication accelerator tailored for transformer workloads as a case study, we evaluate PCIe performance across diverse memory types (DDR4, DDR5, GDDR6, HBM2) and configurations, including host-side and device-side memory. Our findings demonstrate that optimized interconnects can achieve up to 80% of device-side memory performance and, in some scenarios, even surpass it. These results offer actionable insights for system architects, enabling a balanced approach to performance and cost in next-generation accelerator design.
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2502.12273
Accession Number: edsarx.2502.12273
Database: arXiv
More Details
Description not available.