[RUNTIME] Simplify caching API #3433

andrewjcg · 2024-03-21T18:38:42Z

This simplifies the caching API to require callers pass the entire list
of files to cache in a "group" in the put_group method. This allows
caching backends to e.g. serialize these files into a single cache blob,
which can avoid issues with caching atomicity.

This simplifies the caching API to require callers pass the entire list of files to cache in a "group" in the `put_group` method. This allows caching backends to e.g. serialize these files into a single cache blob, which can avoid issues with caching atomicity.

Summary: Pull cache API changes from triton-lang/triton#3433. Among other simplifications, this allows us the cache all files in a "group" atomically, in a single memcache blob, and avoid needing to use other approaches to handle these files coming from different runs. Context: https://fb.workplace.com/groups/420659799592399/posts/778155640509478/ Scuba query for these changes: https://fburl.com/scuba/triton_remote_cache/lb3t1cw4 Test Plan: With D55206078: ``` $ TORCHBENCH_TOL='1e-3' TORCHINDUCTOR_PERMUTE_FUSION='1' TORCHINDUCTOR_SHAPE_PADDING='1' buck2 run mode/opt //pytorch/benchmark:run -- cmf_10x -d cuda -t train --torchdynamo inductor ``` Reviewed By: bertmaher Differential Revision: D55206000

Summary: Pull cache API changes from triton-lang/triton#3433. Among other simplifications, this allows us the cache all files in a "group" atomically, in a single memcache blob, and avoid needing to use other approaches to handle these files coming from different runs. Reviewed By: bertmaher Differential Revision: D55206000 Pull Request resolved: #122470 Approved by: https://github.com/bertmaher

Summary: Pull cache API changes from triton-lang/triton#3433. Among other simplifications, this allows us the cache all files in a "group" atomically, in a single memcache blob, and avoid needing to use other approaches to handle these files coming from different runs. Reviewed By: bertmaher Differential Revision: D55206000 Pull Request resolved: pytorch#122470 Approved by: https://github.com/bertmaher

jlebar · 2024-06-03T21:49:20Z

I assume this PR is dead at this point?

andrewjcg requested a review from ptillet as a code owner March 21, 2024 18:38

andrewjcg force-pushed the cache_api branch 7 times, most recently from deb8d4a to e6bf836 Compare March 22, 2024 01:37

andrewjcg mentioned this pull request Mar 22, 2024

[triton] Backport https://github.com/openai/triton/pull/3433 pytorch/pytorch#122470

Closed

andrewjcg force-pushed the cache_api branch from e6bf836 to 1cd5a85 Compare March 22, 2024 02:39

jlebar closed this Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RUNTIME] Simplify caching API #3433

[RUNTIME] Simplify caching API #3433

andrewjcg commented Mar 21, 2024

jlebar commented Jun 3, 2024

[RUNTIME] Simplify caching API #3433

[RUNTIME] Simplify caching API #3433

Conversation

andrewjcg commented Mar 21, 2024

jlebar commented Jun 3, 2024