feat: backfill lower cache tiers on read-through by worstell · Pull Request #172 · block/cachew

worstell · 2026-03-11T01:43:48Z

Problem

After a pod restart, disk cache is empty but S3 still has snapshots. Tiered.Open hits S3 and streams to the client, but never writes to disk. Since the cache hit path doesn't trigger mirror creation or periodic job scheduling, disk stays empty permanently — every request keeps hitting S3 (4 min for large repos vs 30s from disk).

Solution

When Tiered.Open finds data in a higher tier (S3) but the lowest tier (disk) missed, the returned reader now transparently tees writes to disk as the caller reads. After the full stream is consumed and closed, the disk entry becomes available for future reads.

On write failure or partial read, the backfill is safely abandoned via context cancellation per the Cache contract — reads are never affected.

When a higher tier (e.g., S3) has data but a lower tier (e.g., disk) does not, the returned reader now transparently writes to the lowest tier as the caller reads. This ensures disk cache is populated on the first S3 hit after a pod restart, avoiding repeated slow S3 reads. On write failure or partial read, the backfill is safely abandoned via context cancellation per the Cache contract. Amp-Thread-ID: https://ampcode.com/threads/T-019cda52-ee36-738c-86cd-1fd410c47d7f Co-authored-by: Amp <amp@ampcode.com>

Co-authored-by: Amp <amp@ampcode.com> Amp-Thread-ID: https://ampcode.com/threads/T-019cda52-ee36-738c-86cd-1fd410c47d7f

alecthomas

This is awesome, should have been this way from the start 🤦‍♂️

worstell requested a review from a team as a code owner March 11, 2026 01:43

worstell requested review from alecthomas and removed request for a team March 11, 2026 01:43

stuartwdouglas approved these changes Mar 11, 2026

View reviewed changes

fix: suppress wrapcheck lint for io.Reader contract

fdb369c

Co-authored-by: Amp <amp@ampcode.com> Amp-Thread-ID: https://ampcode.com/threads/T-019cda52-ee36-738c-86cd-1fd410c47d7f

worstell force-pushed the tiered-cache-backfill branch from 2e66222 to fdb369c Compare March 11, 2026 01:56

worstell enabled auto-merge (squash) March 11, 2026 01:57

worstell merged commit 0dca77f into main Mar 11, 2026
5 checks passed

worstell deleted the tiered-cache-backfill branch March 11, 2026 01:58

alecthomas reviewed Mar 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: backfill lower cache tiers on read-through#172

feat: backfill lower cache tiers on read-through#172
worstell merged 2 commits intomainfrom
tiered-cache-backfill

worstell commented Mar 11, 2026

Uh oh!

Uh oh!

alecthomas left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

worstell commented Mar 11, 2026

Problem

Solution

Uh oh!

Uh oh!

alecthomas left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants