Wed 15 Jun 2022 15:50 - 16:10 at Toucan - Tensors Chair(s): Sreepathi Pai
Thu 16 Jun 2022 03:50 - 04:10 at Toucan - Tensors

We introduce DISTAL, a compiler for dense tensor algebra that targets modern distributed and heterogeneous systems. DISTAL lets users independently describe how tensors and computation map onto target machines through separate format and scheduling languages. The combination of choices for data and computation distribution creates a large design space that includes many algorithms from both the past (e.g., Cannon’s algorithm) and present (e.g., COSMA). DISTAL compiles a tensor algebra domain specific language to a distributed task-based runtime system and supports both nodes with multi-core CPUs and multiple GPUs. Code generated by DISTAL is competitive with optimized codes for matrix multiply on 256 nodes of the Lassen supercomputer and outperforms existing systems by between 1.8x to 3.7x (with a 45.7x outlier) on higher order tensor operations.

Wed 15 Jun

Displayed time zone: Pacific Time (US & Canada) change

15:30 - 16:50
TensorsPLDI at Toucan +12h
Chair(s): Sreepathi Pai University of Rochester
15:30
20m
Talk
Autoscheduling for Sparse Tensor Algebra with an Asymptotic Cost Model
PLDI
Peter Ahrens MIT CSAIL, Fredrik Kjolstad Stanford University, Saman Amarasinghe MIT CSAIL
DOI
15:50
20m
Talk
DISTAL: The Distributed Tensor Algebra Compiler
PLDI
Rohan Yadav Stanford University, Alex Aiken Stanford Univeristy, Fredrik Kjolstad Stanford University
DOI
16:10
20m
Talk
All you need is Superword-Level Parallelism: Systematic Control-Flow Vectorization with SLP
PLDI
Yishen Chen Massachusetts Institute of Technology, Charith Mendis University of Illinois at Urbana-Champaign, Saman Amarasinghe Massachusetts Institute of Technology
DOI
16:30
20m
Talk
Warping Cache Simulation of Polyhedral Programs
PLDI
Canberk Morelli Saarland University, Jan Reineke Saarland University
DOI

Thu 16 Jun

Displayed time zone: Pacific Time (US & Canada) change

03:30 - 04:50
TensorsPLDI at Toucan
03:30
20m
Talk
Autoscheduling for Sparse Tensor Algebra with an Asymptotic Cost Model
PLDI
Peter Ahrens MIT CSAIL, Fredrik Kjolstad Stanford University, Saman Amarasinghe MIT CSAIL
DOI
03:50
20m
Talk
DISTAL: The Distributed Tensor Algebra Compiler
PLDI
Rohan Yadav Stanford University, Alex Aiken Stanford Univeristy, Fredrik Kjolstad Stanford University
DOI
04:10
20m
Talk
All you need is Superword-Level Parallelism: Systematic Control-Flow Vectorization with SLP
PLDI
Yishen Chen Massachusetts Institute of Technology, Charith Mendis University of Illinois at Urbana-Champaign, Saman Amarasinghe Massachusetts Institute of Technology
DOI
04:30
20m
Talk
Warping Cache Simulation of Polyhedral Programs
PLDI
Canberk Morelli Saarland University, Jan Reineke Saarland University
DOI