Thu 16 Jun 2022 03:50 - 04:10 at Toucan - Tensors
We introduce DISTAL, a compiler for dense tensor algebra that targets modern distributed and heterogeneous systems. DISTAL lets users independently describe how tensors and computation map onto target machines through separate format and scheduling languages. The combination of choices for data and computation distribution creates a large design space that includes many algorithms from both the past (e.g., Cannon’s algorithm) and present (e.g., COSMA). DISTAL compiles a tensor algebra domain specific language to a distributed task-based runtime system and supports both nodes with multi-core CPUs and multiple GPUs. Code generated by DISTAL is competitive with optimized codes for matrix multiply on 256 nodes of the Lassen supercomputer and outperforms existing systems by between 1.8x to 3.7x (with a 45.7x outlier) on higher order tensor operations.
Wed 15 JunDisplayed time zone: Pacific Time (US & Canada) change
15:30 - 16:50 | |||
15:30 20mTalk | Autoscheduling for Sparse Tensor Algebra with an Asymptotic Cost Model PLDI DOI | ||
15:50 20mTalk | DISTAL: The Distributed Tensor Algebra Compiler PLDI Rohan Yadav Stanford University, Alex Aiken Stanford Univeristy, Fredrik Kjolstad Stanford University DOI | ||
16:10 20mTalk | All you need is Superword-Level Parallelism: Systematic Control-Flow Vectorization with SLP PLDI Yishen Chen Massachusetts Institute of Technology, Charith Mendis University of Illinois at Urbana-Champaign, Saman Amarasinghe Massachusetts Institute of Technology DOI | ||
16:30 20mTalk | Warping Cache Simulation of Polyhedral Programs PLDI DOI |
Thu 16 JunDisplayed time zone: Pacific Time (US & Canada) change
03:30 - 04:50 | |||
03:30 20mTalk | Autoscheduling for Sparse Tensor Algebra with an Asymptotic Cost Model PLDI DOI | ||
03:50 20mTalk | DISTAL: The Distributed Tensor Algebra Compiler PLDI Rohan Yadav Stanford University, Alex Aiken Stanford Univeristy, Fredrik Kjolstad Stanford University DOI | ||
04:10 20mTalk | All you need is Superword-Level Parallelism: Systematic Control-Flow Vectorization with SLP PLDI Yishen Chen Massachusetts Institute of Technology, Charith Mendis University of Illinois at Urbana-Champaign, Saman Amarasinghe Massachusetts Institute of Technology DOI | ||
04:30 20mTalk | Warping Cache Simulation of Polyhedral Programs PLDI DOI |