Thu 16 Jun 2022 04:30 - 04:50 at Toucan - Tensors
Techniques to evaluate a program’s cache performance fall into two camps: 1. Traditional trace-based cache simulators precisely account for sophisticated real-world cache models and support arbitrary workloads, but their runtime is proportional to the number of memory accesses performed by the program under analysis. 2. Relying on implicit workload characterizations such as the polyhedral model, analytical approaches often achieve problem-size-independent runtimes, but so far have been limited to idealized cache models.
We introduce a hybrid approach, warping cache simulation, that aims to achieve applicability to real-world cache models and problem-size-independent runtimes. As prior analytical approaches, we focus on programs in the polyhedral model, which allows to reason about the sequence of memory accesses analytically. Combining this analytical reasoning with information about the cache behavior obtained from explicit cache simulation allows us to soundly fast-forward the simulation. By this process of ``warping'', we accelerate the simulation so that its cost is often independent of the number of memory accesses.
Wed 15 JunDisplayed time zone: Pacific Time (US & Canada) change
15:30 - 16:50 | |||
15:30 20mTalk | Autoscheduling for Sparse Tensor Algebra with an Asymptotic Cost Model PLDI DOI | ||
15:50 20mTalk | DISTAL: The Distributed Tensor Algebra Compiler PLDI Rohan Yadav Stanford University, Alex Aiken Stanford Univeristy, Fredrik Kjolstad Stanford University DOI | ||
16:10 20mTalk | All you need is Superword-Level Parallelism: Systematic Control-Flow Vectorization with SLP PLDI Yishen Chen Massachusetts Institute of Technology, Charith Mendis University of Illinois at Urbana-Champaign, Saman Amarasinghe Massachusetts Institute of Technology DOI | ||
16:30 20mTalk | Warping Cache Simulation of Polyhedral Programs PLDI DOI |
Thu 16 JunDisplayed time zone: Pacific Time (US & Canada) change
03:30 - 04:50 | |||
03:30 20mTalk | Autoscheduling for Sparse Tensor Algebra with an Asymptotic Cost Model PLDI DOI | ||
03:50 20mTalk | DISTAL: The Distributed Tensor Algebra Compiler PLDI Rohan Yadav Stanford University, Alex Aiken Stanford Univeristy, Fredrik Kjolstad Stanford University DOI | ||
04:10 20mTalk | All you need is Superword-Level Parallelism: Systematic Control-Flow Vectorization with SLP PLDI Yishen Chen Massachusetts Institute of Technology, Charith Mendis University of Illinois at Urbana-Champaign, Saman Amarasinghe Massachusetts Institute of Technology DOI | ||
04:30 20mTalk | Warping Cache Simulation of Polyhedral Programs PLDI DOI |