Skip to content

Hotfix#542

Merged
moebiusband73 merged 12 commits intomasterfrom
hotfix
Mar 31, 2026
Merged

Hotfix#542
moebiusband73 merged 12 commits intomasterfrom
hotfix

Conversation

@moebiusband73
Copy link
Copy Markdown
Member

No description provided.

moebiusband73 and others added 12 commits March 28, 2026 06:26
Entire-Checkpoint: ea34ae75e21a
Entire-Checkpoint: 0fc6e5388e08
…mall channel

RotateWALFiles used a non-blocking send (select/default) on rotation
channels buffered at 64. With thousands of nodes and few shards, the
channel fills instantly and nearly all hosts are skipped, leaving WAL
files unrotated indefinitely.

Replace with a blocking send using a shared 2-minute deadline so the
checkpoint goroutine waits for the staging goroutine to drain the
channel instead of immediately giving up.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Entire-Checkpoint: a1ec897216fa
WAL writes during checkpoint are redundant since the binary snapshot
captures all in-memory data. Pausing eliminates channel saturation
(1.4M+ dropped messages) caused by disk I/O contention between
checkpoint writes and WAL staging. Also removes direct WAL file
deletion in checkpoint workers that raced with the staging goroutine.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Entire-Checkpoint: 34d698f40bac
Entire-Checkpoint: 99f20c1edd90
Entire-Checkpoint: 2fe286e23a4a
Add configuration and document usage in README

Entire-Checkpoint: 53425877e242
Entire-Checkpoint: 1a908bd95cfa
Entire-Checkpoint: b9544ef2c54f
Entire-Checkpoint: 9bb66d18af6d
@moebiusband73 moebiusband73 merged commit a101f21 into master Mar 31, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant