Efficient Approximations for Cache-conscious Data Placement (PLDI 2022 - PLDI Research Papers)

Who

Ali Ahmadi, Majid Daliri, Amir Kafshdar Goharshady, Andreas Pavlogiannis

Track

PLDI 2022

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 17 Jun 2022 13:30 - 13:50 at Toucan - Verification & Optimization Chair(s): Charith Mendis
Sat 18 Jun 2022 01:30 - 01:50 at Toucan - Verification & Optimization

Abstract

There is a huge and growing gap between the speed of accesses to data stored in main memory vs cache. Thus, cache misses account for a significant portion of runtime overhead in virtually every program and minimizing them has been an active research topic for decades. The primary and most classical formal model for this problem is that of Cache-conscious Data Placement (CDP): given a commutative cache with constant capacity $k$ and a sequence $\Sigma$ of accesses to data elements, the goal is to map each data element to a cache line such that the total number of cache misses over $\Sigma$ is minimized. CDP has been widely studied since the 1990s. In POPL 2002, Petrank and Rawitz proved a notoriously strong hardness result: They showed that for every $k \geq 3,$ CDP is not only NP-hard but also hard-to-approximate within any non-trivial factor unless $\text{P}=\text{NP}$. As such, all subsequent works gave up on theoretical improvements and instead focused on heuristic algorithms with no theoretical guarantees.

In this work, we present the first-ever positive theoretical result for CDP. The fundamental idea behind our approach is that real-world instances of the problem have specific structural properties that can be exploited to obtain efficient algorithms with strong approximation guarantees. Specifically, the access graphs corresponding to many real-world access sequences are sparse and tree-like. This was already well-known in the community but has only been used to design heuristics without guarantees. In contrast, we provide efficient algorithms that provably approximate the optimal number of cache misses within any factor $1 + \epsilon,$ assuming that the access graph of a specific degree $d_\epsilon$ is sparse, i.e. sparser real-world instances lead to tighter approximations. We also provide experimental results showing that our approach frequently outperforms previous methods.

DOI

https://doi.org/10.1145/3519939.3523436

Ali Ahmadi

Sharif University of Technology

Majid Daliri

University of Tehran

Iran

Amir Kafshdar Goharshady

Hong Kong University of Science and Technology

Andreas Pavlogiannis

Aarhus University

Denmark