Increasing FPGA Accelerators Memory Bandwidth with a Burst-Friendly Memory Layout

Bibliographic Details
Title: Increasing FPGA Accelerators Memory Bandwidth with a Burst-Friendly Memory Layout
Authors: Ferry, Corentin, Yuki, Tomofumi, Derrien, Steven, Rajopadhye, Sanjay
Publication Year: 2022
Collection: Computer Science
Subject Terms: Computer Science - Hardware Architecture, B.4.4
More Details: Offloading compute-intensive kernels to hardware accelerators relies on the large degree of parallelism offered by these platforms. However, the effective bandwidth of the memory interface often causes a bottleneck, hindering the accelerator's effective performance. Techniques enabling data reuse, such as tiling, lower the pressure on memory traffic but still often leave the accelerators I/O-bound. A further increase in effective bandwidth is possible by using burst rather than element-wise accesses, provided the data is contiguous in memory. In this paper, we propose a memory allocation technique, and provide a proof-of-concept source-to-source compiler pass, that enables such burst transfers by modifying the data layout in external memory. We assess how this technique pushes up the memory throughput, leaving room for exploiting additional parallelism, for a minimal logic overhead.
Comment: 16 pages; 17 figures
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2202.05933
Accession Number: edsarx.2202.05933
Database: arXiv
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://arxiv.org/abs/2202.05933
    Name: EDS - Arxiv
    Category: fullText
    Text: View this record from Arxiv
    MouseOverText: View this record from Arxiv
  – Url: https://resolver.ebsco.com/c/xy5jbn/result?sid=EBSCO:edsarx&genre=article&issn=&ISBN=&volume=&issue=&date=20220211&spage=&pages=&title=Increasing FPGA Accelerators Memory Bandwidth with a Burst-Friendly Memory Layout&atitle=Increasing%20FPGA%20Accelerators%20Memory%20Bandwidth%20with%20a%20Burst-Friendly%20Memory%20Layout&aulast=Ferry%2C%20Corentin&id=DOI:
    Name: Full Text Finder (for New FTF UI) (s8985755)
    Category: fullText
    Text: Find It @ SCU Libraries
    MouseOverText: Find It @ SCU Libraries
Header DbId: edsarx
DbLabel: arXiv
An: edsarx.2202.05933
RelevancyScore: 1023
AccessLevel: 3
PubType: Report
PubTypeId: report
PreciseRelevancyScore: 1022.88940429688
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Increasing FPGA Accelerators Memory Bandwidth with a Burst-Friendly Memory Layout
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Ferry%2C+Corentin%22">Ferry, Corentin</searchLink><br /><searchLink fieldCode="AR" term="%22Yuki%2C+Tomofumi%22">Yuki, Tomofumi</searchLink><br /><searchLink fieldCode="AR" term="%22Derrien%2C+Steven%22">Derrien, Steven</searchLink><br /><searchLink fieldCode="AR" term="%22Rajopadhye%2C+Sanjay%22">Rajopadhye, Sanjay</searchLink>
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2022
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: Computer Science
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Computer+Science+-+Hardware+Architecture%22">Computer Science - Hardware Architecture</searchLink><br /><searchLink fieldCode="DE" term="%22B%2E4%2E4%22">B.4.4</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Offloading compute-intensive kernels to hardware accelerators relies on the large degree of parallelism offered by these platforms. However, the effective bandwidth of the memory interface often causes a bottleneck, hindering the accelerator's effective performance. Techniques enabling data reuse, such as tiling, lower the pressure on memory traffic but still often leave the accelerators I/O-bound. A further increase in effective bandwidth is possible by using burst rather than element-wise accesses, provided the data is contiguous in memory. In this paper, we propose a memory allocation technique, and provide a proof-of-concept source-to-source compiler pass, that enables such burst transfers by modifying the data layout in external memory. We assess how this technique pushes up the memory throughput, leaving room for exploiting additional parallelism, for a minimal logic overhead.<br />Comment: 16 pages; 17 figures
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Working Paper
– Name: URL
  Label: Access URL
  Group: URL
  Data: <link linkTarget="URL" linkTerm="http://arxiv.org/abs/2202.05933" linkWindow="_blank">http://arxiv.org/abs/2202.05933</link>
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsarx.2202.05933
PLink https://login.libproxy.scu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2202.05933
RecordInfo BibRecord:
  BibEntity:
    Subjects:
      – SubjectFull: Computer Science - Hardware Architecture
        Type: general
      – SubjectFull: B.4.4
        Type: general
    Titles:
      – TitleFull: Increasing FPGA Accelerators Memory Bandwidth with a Burst-Friendly Memory Layout
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Ferry, Corentin
      – PersonEntity:
          Name:
            NameFull: Yuki, Tomofumi
      – PersonEntity:
          Name:
            NameFull: Derrien, Steven
      – PersonEntity:
          Name:
            NameFull: Rajopadhye, Sanjay
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 11
              M: 02
              Type: published
              Y: 2022
ResultId 1