Add CI build caching and improve benchmark workflow by sbryngelson · Pull Request #1148 · MFlowCode/MFC

sbryngelson · 2026-02-14T15:33:13Z

User description

Summary

Adds build caching to CI for both GitHub-hosted and self-hosted HPC runners, and improves the benchmark workflow's PR detection and approval logic.

Build caching

GitHub-hosted runners: actions/cache@v4 caches build/ keyed by matrix config + source file hashes, with prefix-based fallback for partial cache hits.

Self-hosted HPC runners (Phoenix, Frontier, Frontier AMD): A shared helper script (setup-build-cache.sh) symlinks build/ to a persistent per-runner cache directory at $HOME/scratch/.mfc-ci-cache/<key>/build/. Each runner gets its own cache keyed by (cluster, device, interface, RUNNER_NAME), so CMake's absolute paths are always correct — no cross-runner path fixups or locking needed.

Key details:

actions/checkout uses clean: false on self-hosted runners to prevent git clean -ffdx from following the build symlink and destroying cached artifacts
Retry logic clears only build/staging and build/lock.yaml (not the full cache)
Benchmarks skip the persistent cache entirely to avoid PR/master cache collisions

Benchmark workflow improvements

Cross-repo PRs (forks) now correctly detected via head SHA fallback when pull_requests[] is empty
PR approval check verifies the approver has write/maintain/admin permission (ignores AI bot approvals)

Files changed

File	Change
`.github/scripts/setup-build-cache.sh`	New — per-runner build cache setup for self-hosted runners
`.github/workflows/test.yml`	Add `actions/cache` for GH-hosted jobs; `clean: false` for self-hosted
`.github/workflows/coverage.yml`	Add `actions/cache` for coverage job
`.github/workflows/phoenix/test.sh`	Source `setup-build-cache.sh` before build
`.github/workflows/frontier/build.sh`	Source `setup-build-cache.sh` (skipped for benchmarks)
`.github/workflows/frontier_amd/build.sh`	Same
`.github/workflows/bench.yml`	Cross-repo PR detection + maintainer-only approval check

Test plan

GH-hosted jobs restore cache (actions/cache hit)
Self-hosted Phoenix jobs use per-runner cache and pass all tests
Frontier jobs build and pass with cache
Benchmark workflow triggers correctly for approved PRs

CodeAnt-AI Description

Add persistent CI build cache for self-hosted runners and make benchmarks robust to transient failures

What Changed

Self-hosted CI jobs now create a persistent per-runner build cache (./build -> $HOME/scratch/.mfc-ci-cache//build) so build artifacts persist across runs; Git checkout no longer forcibly cleans on those runners to avoid deleting the cache.
GitHub-hosted CI jobs restore a build cache for faster incremental builds in test and coverage workflows.
Build retry behavior changed: failed builds no longer run a full clean; only staging and lock files are cleared before retry to avoid disrupting concurrent jobs that consume installed artifacts.
Benchmark runner and monitoring scripts now tolerate transient failures: individual benchmark runs are retried once with a short delay, unexpected runtime errors are retried, monitoring script non-zero exits are reported (not fatal), and parallel benchmark orchestrator warns on partial failures while continuing to check for partial results.
Benchmark command now always writes result YAML (even if some cases failed) before reporting failures so diffs can compare partial outputs.
Workflow PR detection and approval checks improved: cross-repo PRs are located by head SHA when needed, and only approvals from users with write/maintain/admin permission count (bot approvals ignored).

Impact

✅ Faster CI builds on self-hosted runners
✅ Fewer flaky benchmark failures due to transient errors
✅ Clearer benchmark failure reports and partial-result availability

💡 Usage Guide

Checking Your Pull Request

Every time you make a pull request, our system automatically looks through it. We check for security issues, mistakes in how you're setting up your infrastructure, and common code problems. We do this to make sure your changes are solid and won't cause any trouble later.

Talking to CodeAnt AI

Got a question or need a hand with something in your pull request? You can easily get in touch with CodeAnt AI right here. Just type the following in a comment on your pull request, and replace "Your question here" with whatever you want to ask:

@codeant-ai ask: Your question here

This lets you have a chat with CodeAnt AI about your pull request, making it easier to understand and improve your code.

Example

@codeant-ai ask: Can you suggest a safer alternative to storing this secret?

Preserve Org Learnings with CodeAnt

You can record team preferences so CodeAnt AI applies them in future reviews. Reply directly to the specific CodeAnt AI suggestion (in the same thread) and replace "Your feedback here" with your input:

@codeant-ai: Your feedback here

This helps CodeAnt AI learn and adapt to your team's coding style and standards.

Example

@codeant-ai: Do not flag unused imports.

Retrigger review

Ask CodeAnt AI to review the PR again, by typing:

@codeant-ai: review

Check Your Repository Health

To analyze the health of your code repository, visit our dashboard at https://app.codeant.ai. This tool helps you identify potential issues and areas for improvement in your codebase, ensuring your repository maintains high standards of code health.

GitHub-hosted runners: Add actions/cache@v4 to test.yml and coverage.yml, caching the build/ directory keyed by matrix config and source file hashes. Partial cache hits via restore-keys enable incremental builds. Self-hosted HPC runners (Phoenix, Frontier, Frontier AMD): Add a persistent build cache that symlinks build/ to $HOME/scratch/.mfc-ci-cache/<config>/build. This ensures cached artifacts persist across CI runs regardless of which runner instance picks up the job. Key details: - Cross-runner workspace path fixup via sed on CMake files - flock-based locking prevents concurrent builds from corrupting the cache - Retry logic uses targeted rm (staging/install only) instead of mfc.sh clean - Phoenix releases the lock after build, before tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codeant-ai · 2026-02-14T15:33:17Z

CodeAnt AI is reviewing your PR.

Thanks for using CodeAnt! 🎉

We're free for open-source projects. if you're enjoying it, help us grow by sharing.

Share on X ·
Reddit ·
LinkedIn

coderabbitai · 2026-02-14T15:33:34Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds persistent per-config build caching for GitHub and self-hosted CI: new cache-setup script with locking and workspace-path fixes, workflow cache restore steps for GitHub runners, and HPC build scripts updated to source the script and perform targeted cleanup on failures.

Changes

Cohort / File(s)	Summary
Build Cache Script `.github/scripts/setup-build-cache.sh`	New Bash script that computes a per-config cache key, creates/resolves a real cache dir under `$HOME/.mfc-ci-cache`, acquires an exclusive flock (1h timeout), symlinks `build` → cache, and patches cached CMake-related files when workspace paths change.
Workflow Cache Steps `.github/workflows/coverage.yml`, `.github/workflows/test.yml`	Add `Restore Build Cache` steps using `actions/cache@v4` to restore `build/` with keys based on matrix variants and a hash of build-related source/toolchain files; include restore-key fallbacks.
HPC Build Script Integrations `.github/workflows/frontier/build.sh`, `.github/workflows/frontier_amd/build.sh`, `.github/workflows/phoenix/test.sh`	Source the new setup script before builds (skipping bench runs); replace full `./mfc.sh clean` with targeted removals (`build/staging/`, `build/install/`, `build/lock.yaml`); `phoenix/test.sh` also releases cache lock before tests.
Checkout behavior tweak `.github/workflows/test.yml` (Self job)	Set `with: clean: false` on checkout to avoid deleting the symlinked `build` directory during checkout.
Bench workflow approval logic `.github/workflows/bench.yml`	PR number resolution fallback by head SHA; stricter approval checks verifying approver permissions and tightening allowed PR authors for self job.

Sequence Diagram

sequenceDiagram
    participant Workflow as Workflow
    participant Runner as Runner (job script)
    participant CacheSetup as setup-build-cache.sh
    participant LockFile as Lock File
    participant CacheDir as Cache Directory
    participant Build as Build Process

    Workflow->>Runner: start job
    Runner->>CacheSetup: source(cluster, device, interface)
    CacheSetup->>CacheSetup: compute cache key & path
    CacheSetup->>LockFile: acquire exclusive flock (1h timeout)
    alt lock acquired
        LockFile-->>CacheSetup: granted
        CacheSetup->>CacheDir: ensure dir, remove stale build symlink
        CacheSetup->>Runner: create symlink `build` -> cache path
        CacheSetup->>CacheDir: read/write workspace marker
        CacheSetup->>CacheDir: patch CMake-related paths if workspace changed
        Runner->>Build: run build using cached `build` dir
        Build-->>Runner: success / failure
        alt success
            Runner->>LockFile: release lock
        else failure
            Runner->>CacheDir: rm -rf build/staging/* build/lock.yaml
            Runner->>LockFile: release lock
        end
    else timeout
        LockFile-->>CacheSetup: timeout
        CacheSetup-->>Runner: fallback to local build (no cache)
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Shell completion auto-install and pre-commit hook improvements #1124 — Modifies overlapping HPC CI build scripts and workflow behavior; closely related to persistent build-cache setup and cleanup changes.

Suggested labels

Review effort 4/5, size:XL

Poem

🐇 I dug a cosy cache below the home,

Locks in place so builds can freely roam,
Symlinks point where nightly artifacts sleep,
Paths get mended when the runners leap,
Hop — next test wakes faster from its loam.

🚥 Pre-merge checks | ✅ 5 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Merge Conflict Detection	⚠️ Warning	❌ Merge conflicts detected (112 files): ⚔️ `.github/workflows/bench.yml` (content) ⚔️ `.github/workflows/coverage.yml` (content) ⚔️ `.github/workflows/frontier/build.sh` (content) ⚔️ `.github/workflows/frontier_amd/build.sh` (content) ⚔️ `.github/workflows/lint-toolchain.yml` (content) ⚔️ `.github/workflows/phoenix/test.sh` (content) ⚔️ `.github/workflows/test.yml` (content) ⚔️ `.gitignore` (content) ⚔️ `CMakeLists.txt` (content) ⚔️ `docs/custom.css` (content) ⚔️ `docs/documentation/case.md` (content) ⚔️ `docs/documentation/contributing.md` (content) ⚔️ `docs/documentation/equations.md` (content) ⚔️ `docs/documentation/readme.md` (content) ⚔️ `docs/documentation/running.md` (content) ⚔️ `docs/gen_constraints.sh` (content) ⚔️ `examples/2D_bubbly_steady_shock/case.py` (content) ⚔️ `examples/2D_forward_facing_step/case.py` (content) ⚔️ `examples/scaling/README.md` (content) ⚔️ `src/common/m_derived_types.fpp` (content) ⚔️ `src/common/m_model.fpp` (content) ⚔️ `src/common/m_mpi_common.fpp` (content) ⚔️ `src/pre_process/m_data_output.fpp` (content) ⚔️ `src/pre_process/m_global_parameters.fpp` (content) ⚔️ `src/pre_process/m_icpp_patches.fpp` (content) ⚔️ `src/pre_process/m_initial_condition.fpp` (content) ⚔️ `src/pre_process/m_start_up.fpp` (content) ⚔️ `src/simulation/m_data_output.fpp` (content) ⚔️ `src/simulation/m_global_parameters.fpp` (content) ⚔️ `src/simulation/m_ibm.fpp` (content) ⚔️ `src/simulation/m_start_up.fpp` (content) ⚔️ `src/simulation/m_time_steppers.fpp` (content) ⚔️ `tests/0387594F/golden.txt` (content) ⚔️ `tests/07D9EBD1/golden.txt` (content) ⚔️ `tests/0A362971/golden.txt` (content) ⚔️ `tests/0B07F3A1/golden.txt` (content) ⚔️ `tests/0BD42F35/golden.txt` (content) ⚔️ `tests/127A967A/golden.txt` (content) ⚔️ `tests/16574715/golden-metadata.txt` (content) ⚔️ `tests/16574715/golden.txt` (content) ⚔️ `tests/18B832DD/golden.txt` (content) ⚔️ `tests/1B218CF1/golden.txt` (content) ⚔️ `tests/1E62093F/golden.txt` (content) ⚔️ `tests/221C9980/golden.txt` (content) ⚔️ `tests/28D037A7/golden.txt` (content) ⚔️ `tests/32DD0363/golden.txt` (content) ⚔️ `tests/34626101/golden.txt` (content) ⚔️ `tests/34A93376/golden.txt` (content) ⚔️ `tests/408BF558/golden.txt` (content) ⚔️ `tests/431554F7/golden.txt` (content) ⚔️ `tests/43A1D3C6/golden.txt` (content) ⚔️ `tests/4B8105EA/golden.txt` (content) ⚔️ `tests/4C3164AE/golden.txt` (content) ⚔️ `tests/4C64244B/golden.txt` (content) ⚔️ `tests/4E0FBE72/golden.txt` (content) ⚔️ `tests/4F5A5E32/golden-metadata.txt` (content) ⚔️ `tests/4F5A5E32/golden.txt` (content) ⚔️ `tests/5173E637/golden.txt` (content) ⚔️ `tests/5600D63B/golden.txt` (content) ⚔️ `tests/6076815B/golden.txt` (content) ⚔️ `tests/6171E9D4/golden-metadata.txt` (content) ⚔️ `tests/6171E9D4/golden.txt` (content) ⚔️ `tests/647715EB/golden-metadata.txt` (content) ⚔️ `tests/647715EB/golden.txt` (content) ⚔️ `tests/65CC7EB7/golden-metadata.txt` (content) ⚔️ `tests/65CC7EB7/golden.txt` (content) ⚔️ `tests/6D1C9C2D/golden.txt` (content) ⚔️ `tests/6ED50013/golden.txt` (content) ⚔️ `tests/7CC8F995/golden.txt` (content) ⚔️ `tests/7DCA9229/golden.txt` (content) ⚔️ `tests/7DCE34B4/golden-metadata.txt` (content) ⚔️ `tests/7DCE34B4/golden.txt` (content) ⚔️ `tests/7F70E665/golden-metadata.txt` (content) ⚔️ `tests/7F70E665/golden.txt` (content) ⚔️ `tests/7FA04E95/golden.txt` (content) ⚔️ `tests/8339878C/golden.txt` (content) ⚔️ `tests/87C3F801/golden.txt` (content) ⚔️ `tests/8D8F6424/golden-metadata.txt` (content) ⚔️ `tests/8D8F6424/golden.txt` (content) ⚔️ `tests/99147C82/golden.txt` (content) ⚔️ `tests/A1B2B963/golden.txt` (content) ⚔️ `tests/A2036630/golden.txt` (content) ⚔️ `tests/A69D2D28/golden.txt` (content) ⚔️ `tests/B0CE19C5/golden-metadata.txt` (content) ⚔️ `tests/B0CE19C5/golden.txt` (content) ⚔️ `tests/B7A6CC79/golden-metadata.txt` (content) ⚔️ `tests/B7A6CC79/golden.txt` (content) ⚔️ `tests/BA4372C5/golden.txt` (content) ⚔️ `tests/BBF7C467/golden.txt` (content) ⚔️ `tests/BFAA7587/golden.txt` (content) ⚔️ `tests/C5D96D9A/golden-metadata.txt` (content) ⚔️ `tests/C5D96D9A/golden.txt` (content) ⚔️ `tests/C7BE8FEA/golden.txt` (content) ⚔️ `tests/C87A466A/golden.txt` (content) ⚔️ `tests/D1C97CD1/golden.txt` (content) ⚔️ `tests/E1199F96/golden.txt` (content) ⚔️ `tests/EA8FA07E/golden.txt` (content) ⚔️ `tests/F60D6594/golden-metadata.txt` (content) ⚔️ `tests/F60D6594/golden.txt` (content) ⚔️ `tests/FA8643EF/golden.txt` (content) ⚔️ `toolchain/mfc/case_validator.py` (content) ⚔️ `toolchain/mfc/cli/schema.py` (content) ⚔️ `toolchain/mfc/gen_case_constraints_docs.py` (content) ⚔️ `toolchain/mfc/generate.py` (content) ⚔️ `toolchain/mfc/lint_docs.py` (content) ⚔️ `toolchain/mfc/params/ast_analyzer.py` (content) ⚔️ `toolchain/mfc/params/definitions.py` (content) ⚔️ `toolchain/mfc/params/generators/docs_gen.py` (content) ⚔️ `toolchain/mfc/run/case_dicts.py` (content) ⚔️ `toolchain/mfc/run/input.py` (content) ⚔️ `toolchain/mfc/test/cases.py` (content) ⚔️ `toolchain/mfc/validate.py` (content) These conflicts must be resolved before merging into `master`.	Resolve conflicts locally and push changes to this branch.

✅ Passed checks (5 passed)

Check name	Status	Explanation
Linked Issues check	✅ Passed	The PR successfully implements all requirements from issue `#1145`: actions/cache for GitHub-hosted runners, persistent cache symlinks for self-hosted HPC runners, workspace path fixup via sed, flock-based locking, and safe retry logic.
Out of Scope Changes check	✅ Passed	All changes are scoped to CI workflows, helper scripts, and HPC build scripts as specified in issue `#1145`; no modifications to MFC source code or Python toolchain.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Title check	✅ Passed	The title 'Add CI build caching and improve benchmark workflow' clearly summarizes the main changes: adding build caching to CI and improving the benchmark workflow.
Description check	✅ Passed	The pull request description is comprehensive and well-structured, covering all required sections including a clear summary, motivation, testing approach, and detailed technical changes.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

qodo-code-review · 2026-02-14T15:33:46Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Possible Issue The script acquires a long-lived `flock` on fd 9 and keeps it held until the caller explicitly releases it. This relies on every consuming workflow script to reliably unlock/close fd 9 on all exit paths (including failures and `set -e`-style exits). If any caller forgets to unlock, other jobs for the same cache key can block for up to an hour and then silently fall back to local builds, reducing CI reliability and making failures harder to diagnose. _cache_locked=false _lock_file="$_cache_dir/.cache.lock" exec 9>"$_lock_file" echo " Acquiring cache lock..." if flock --timeout 3600 9; then _cache_locked=true echo " Cache lock acquired" else echo " WARNING: Cache lock timeout (1h), building locally without cache" exec 9>&- mkdir -p "build" echo "=========================" return 0 2>/dev/null \|\| true fi Correctness Risk The workspace-path rewrite uses `sed` to replace the previous absolute workspace path with the current one across multiple generated files. This can unintentionally rewrite unrelated content if the old path appears in other contexts, and it may also miss relevant path encodings/quoting differences. It should be validated that this does not corrupt cached build system state (CMake/Ninja/Make) and that incremental builds remain correct across runner instances. # Handle cross-runner workspace path changes. # CMakeCache.txt stores absolute paths from whichever runner instance # originally configured the build. If we're on a different runner, sed-replace # the old workspace path with the current one so CMake can do incremental builds. _workspace_marker="$_cache_dir/.workspace_path" if [ -f "$_workspace_marker" ]; then _old_workspace=$(cat "$_workspace_marker") if [ "$_old_workspace" != "$(pwd)" ]; then echo " Workspace path changed: $_old_workspace -> $(pwd)" echo " Updating cached CMake paths..." find "$_cache_dir/staging" -type f \ $ -name "CMakeCache.txt" -o -name ".cmake" \ -o -name ".make" -o -name "Makefile" \ -o -name "build.ninja" $ \ -exec sed -i "s\|${_old_workspace}\|$(pwd)\|g" {} + 2>/dev/null \|\| true fi fi Cache Key The GitHub-hosted cache key includes source and toolchain file hashes, but does not obviously include compiler/CMake versions, runner image changes, or environment/module versions. This can lead to restoring incompatible `build/` artifacts across runs, producing flaky or non-reproducible CI behavior. Consider ensuring the key incorporates the effective compiler/CMake identity (or otherwise guarantees invalidation when the toolchain changes). - name: Restore Build Cache uses: actions/cache@v4 with: path: build key: mfc-build-${{ matrix.os }}-${{ matrix.mpi }}-${{ matrix.debug }}-${{ matrix.precision }}-${{ matrix.intel }}-${{ hashFiles('CMakeLists.txt', 'toolchain/dependencies/', 'toolchain/cmake/', 'src/*/.fpp', 'src/*/.f90') }} restore-keys: \| mfc-build-${{ matrix.os }}-${{ matrix.mpi }}-${{ matrix.debug }}-${{ matrix.precision }}-${{ matrix.intel }}-

.github/scripts/setup-build-cache.sh

.github/workflows/phoenix/test.sh

.github/scripts/setup-build-cache.sh

.github/workflows/frontier/build.sh

.github/scripts/setup-build-cache.sh

codeant-ai · 2026-02-14T15:35:23Z

Nitpicks 🔍

🔒 No security issues identified
⚡ Recommended areas for review Unsafe cleanup on retry The retry logic force-removes staging/install and lock metadata unconditionally. If another process holds the cache lock or is simultaneously accessing the cache, this can cause data races or remove in-use files. Consider checking lock ownership or using an atomic/safer cleanup flow. Lock handling The script acquires an exclusive flock on fd 9 and keeps it open for the lifetime of the shell that sourced the script, but no explicit API to release the lock is provided. Because the setup script is sourced from the build scripts, that means the lock will remain held until the build script exits. The PR description mentions releasing the lock before lengthy tests, but the code does not release it — this can block other jobs waiting on the same cache or keep the shared cache unusable during test phases. CMake path rewrite robustness The sed-based workspace-path rewrite uses unescaped expansions of _old_workspace and $(pwd). If either path contains characters that are meaningful to sed (e.g. '\|', newlines) the replacement may fail or corrupt cached files. Also the find target is limited to "$_cache_dir/staging" only — other cached locations may still contain absolute paths that need update. Missing source guard The script unconditionally sources .github/scripts/setup-build-cache.sh. If that file is missing or fails, the build may continue in an unexpected state or variables (like _cache_locked) may not be set. This can lead to surprising fallback behavior or attempts to use undefined file descriptors / locks.

.github/workflows/phoenix/test.sh

codeant-ai · 2026-02-14T15:35:42Z

CodeAnt AI finished reviewing your PR.

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In @.github/scripts/setup-build-cache.sh:
- Around line 74-78: The sed substitution uses the raw variable _old_workspace
as a regex which breaks on metacharacters; modify the script around the find ...
-exec sed -i call to first escape regex/sed-special characters in _old_workspace
(e.g. implement a small helper like sed_escape that uses printf '%s'
"$_old_workspace" piped to sed to backslash-escape characters such as / \ | & [
] * . + ? ^ $), then use the escaped value in the sed "s|ESCAPED_OLD|$(pwd)|g"
invocation so replacements work reliably for any workspace path.
- Around line 48-54: The fallback branch currently runs mkdir -p "build" which
is a no-op if build is a symlink to the shared cache; before creating the local
directory remove any existing symlink named build so we don't accidentally write
into the shared cache without the lock. In the else block (around the existing
exec 9>&-, mkdir -p "build", and return 0) add a check for a symlink named
"build" (e.g., test -L "build") and unlink/rm that symlink before running mkdir
-p "build" so the local directory is created safely while leaving real
directories untouched.

.github/scripts/setup-build-cache.sh

cubic-dev-ai

3 issues found across 6 files

Confidence score: 3/5

There’s some risk here because the most severe issue is a race condition in .github/workflows/frontier/build.sh where deleting build/install during retries can corrupt concurrent jobs still reading binaries.
.github/scripts/setup-build-cache.sh can leave a shared cache symlink intact after a lock timeout, which undermines the fallback and can cause concurrent cache use without a lock.
Severity is mid-to-high (5–7/10) and impacts build reliability rather than runtime behavior, so it’s likely safe but could cause CI flakiness.
Pay close attention to .github/workflows/frontier/build.sh and .github/scripts/setup-build-cache.sh - cache lock and retry/fallback handling.

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name=".github/scripts/setup-build-cache.sh">

<violation number="1" location=".github/scripts/setup-build-cache.sh:51">
P2: On cache-lock timeout, an existing build/ symlink is left intact. This means the job can keep using the shared cache without a lock, contradicting the "build locally without cache" fallback and risking concurrent cache corruption. Remove build/ (symlink or dir) before creating the local build directory in this path.</violation>

<violation number="2" location=".github/scripts/setup-build-cache.sh:53">
P2: The fallback path after lock timeout doesn't exit cleanly when the script is executed directly (not sourced). `return 0` fails in that context, and `|| true` masks the error, causing the script to incorrectly continue into cache setup logic even though it decided to build locally.</violation>
</file>

<file name=".github/workflows/frontier/build.sh">

<violation number="1" location=".github/workflows/frontier/build.sh:52">
P1: Race condition: deleting `build/install` during retry can corrupt test runs from other concurrent jobs that have released the cache lock but are still reading installed binaries. Only `build/staging` and `build/lock.yaml` should be cleared on retry - the install directory needs to remain intact for concurrent readers.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

.github/workflows/frontier/build.sh

.github/scripts/setup-build-cache.sh

- Only remove build/staging (not build/install) on retry, so concurrent test jobs reading installed binaries are not disrupted - Remove stale symlink in lock-timeout fallback path to prevent writing into the shared cache without holding the lock - Remove redundant flock --unlock (closing fd is sufficient) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

.github/scripts/setup-build-cache.sh

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @.github/workflows/frontier_amd/build.sh:
- Around line 51-52: Update the echo message to accurately describe what is
being removed: change the existing log that says "Clearing staging/install" to
mention only the actual targets being deleted (e.g., "Clearing build/staging and
build/lock.yaml") so CI logs match the rm -rf command; keep the rm command (rm
-rf build/staging build/lock.yaml) unchanged to preserve the intentional
decision not to remove build/install.

.github/workflows/frontier_amd/build.sh

cubic-dev-ai

2 issues found across 4 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name=".github/workflows/frontier/build.sh">

<violation number="1" location=".github/workflows/frontier/build.sh:52">
P2: Retry cleanup no longer removes `build/install` despite the log claiming staging/install are cleared; a failed install can persist and poison the retry build.</violation>
</file>

<file name=".github/workflows/frontier_amd/build.sh">

<violation number="1" location=".github/workflows/frontier_amd/build.sh:52">
P2: Retry cleanup no longer removes build/install even though the retry message says it does. Leaving a partial install can contaminate the next build attempt. Either clean build/install or update the log to reflect the new behavior.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

.github/workflows/frontier/build.sh

.github/workflows/frontier_amd/build.sh

Copilot

Pull request overview

This PR implements CI build caching to speed up GitHub Actions workflows and reduce build times on self-hosted HPC runners. The implementation uses GitHub's native actions/cache@v4 for hosted runners and a custom persistent caching solution for self-hosted systems (Phoenix, Frontier, Frontier AMD).

Changes:

GitHub-hosted runners cache build/ directory keyed by matrix config + source hashes
Self-hosted HPC runners symlink build/ to persistent cache in $HOME/scratch/.mfc-ci-cache/
Cross-runner workspace path fixup via sed enables incremental builds when jobs land on different runner instances
Retry logic uses targeted removal (build/staging, build/install, build/lock.yaml) instead of full clean to preserve cache

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
`.github/workflows/test.yml`	Adds build cache restoration for GitHub-hosted runners with matrix-specific keys
`.github/workflows/coverage.yml`	Adds build cache restoration for coverage workflow with simpler key (no matrix)
`.github/scripts/setup-build-cache.sh`	New script implementing persistent cache with flock-based locking and path fixup
`.github/workflows/phoenix/test.sh`	Integrates cache setup and releases lock before long-running tests
`.github/workflows/frontier/build.sh`	Integrates cache setup and updates retry logic
`.github/workflows/frontier_amd/build.sh`	Integrates cache setup and updates retry logic

.github/scripts/setup-build-cache.sh

.github/workflows/coverage.yml

.github/workflows/phoenix/test.sh

.github/workflows/frontier_amd/build.sh

.github/workflows/frontier/build.sh

.github/scripts/setup-build-cache.sh

The echo said "Clearing staging/install" but build/install is intentionally preserved to avoid disrupting concurrent test jobs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

.github/scripts/setup-build-cache.sh

actions/checkout@v4 defaults to clean: true, which runs git clean -ffdx. This follows the build/ symlink into the shared cache directory and deletes all cached artifacts (staging, install, venv), defeating the purpose of the persistent cache and causing SIGILL errors from partially destroyed build artifacts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

.github/workflows/frontier/build.sh

.github/scripts/setup-build-cache.sh

codecov · 2026-02-14T20:33:57Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 44.07%. Comparing base (21347c1) to head (1adc121).
⚠️ Report is 4 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1148      +/-   ##
==========================================
+ Coverage   44.02%   44.07%   +0.04%     
==========================================
  Files          70       70              
  Lines       20659    20431     -228     
  Branches     2059     1974      -85     
==========================================
- Hits         9096     9004      -92     
+ Misses      10373    10291      -82     
+ Partials     1190     1136      -54

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Benchmarks build PR and master in parallel — sharing a cache key causes collisions. Skip cache setup when run_bench=="bench" so each benchmark builds from scratch. Also fix two issues in the benchmark workflow trigger: - Cross-repo PRs don't populate pull_requests[]; fall back to searching by head SHA so the PR author is correctly detected. - Only count approvals from users with write/maintain/admin permission, filtering out AI bot approvals (Copilot, Qodo). - Remove wilfonba auto-run; only sbryngelson auto-runs benchmarks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codeant-ai · 2026-02-15T01:11:47Z

CodeAnt AI is running Incremental review

Thanks for using CodeAnt! 🎉

We're free for open-source projects. if you're enjoying it, help us grow by sharing.

Share on X ·
Reddit ·
LinkedIn

When the cache moves between runner instances (e.g. actions-runner-6 to actions-runner-1), the sed path replacement only updated staging/ CMake files. Config files in install/ (.pc, .cmake) still had the old runner path, causing silo/HDF5 to link against nonexistent paths and h5dump to fail on all tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

.github/scripts/setup-build-cache.sh

Updating .pc and .cmake config files with sed is insufficient — the MFC executables (simulation, pre_process, post_process) and static libraries have the old runner workspace path baked in at compile time. When the cache moves between runner instances, these binaries fail at runtime. Replace the install/ sed fix with rm -rf install/ so CMake re-links and re-installs all binaries with correct paths. The staging/ object files remain valid, so this is a re-link, not a full rebuild. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

.github/scripts/setup-build-cache.sh

.github/workflows/bench.yml

Replace the shared cache (with flock, sed path fixups, and workspace tracking) with per-runner caches keyed by RUNNER_NAME. Each runner always uses the same workspace path, so CMake's absolute paths are always correct — no cross-runner path issues, no locking needed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codeant-ai · 2026-02-15T15:58:11Z

CodeAnt AI is running Incremental review

Thanks for using CodeAnt! 🎉

We're free for open-source projects. if you're enjoying it, help us grow by sharing.

Share on X ·
Reddit ·
LinkedIn

.github/scripts/setup-build-cache.sh

.github/workflows/bench.yml

.github/workflows/test.yml

.github/scripts/setup-build-cache.sh

codeant-ai · 2026-02-15T16:00:19Z

CodeAnt AI Incremental review completed.

The prefix fallback can restore a cache built on a runner with AVX-512 onto a runner without it, causing SIGILL in Chemistry tests. Without restore-keys, only exact key matches are used — source changes trigger a full rebuild but binaries are always compatible with the runner. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

.github/scripts/setup-build-cache.sh

.github/workflows/bench.yml

sbryngelson · 2026-02-16T00:40:11Z

/config

qodo-code-review · 2026-02-16T00:40:16Z

ⓘ You are approaching your monthly quota for Qodo. Upgrade your plan

🛠️ Wiki configuration file settings:

🛠️ Local configuration file settings:

 # .pr_agent.toml
[github_app]
pr_commands = [
  "/review",
  "/improve",
]

handle_push_trigger = true
push_commands = ["/improve"]

[pr_reviewer]                # (all fields optional)
num_max_findings        = 10  # how many items to surface
require_tests_review    = true
extra_instructions = """
Project context and review priorities: .github/copilot-instructions.md
Coding standards and common pitfalls: docs/documentation/contributing.md
GPU macro API: docs/documentation/gpuParallelization.md

Prioritize correctness over style (formatting is enforced by pre-commit hooks).
Key areas: logic bugs, numerical issues,
array bounds (non-unity lower bounds with ghost cells), MPI halo exchange
correctness (pack/unpack offsets, GPU data coherence), precision mixing
(stp vs wp), ALLOCATE/DEALLOCATE pairing (GPU memory leaks), physics model
consistency (pressure formula must match model_eqns), missing case_validator.py
constraints for new parameters, and compiler portability across all four
supported compilers.
Python toolchain requires Python 3.10+; do not suggest __future__ imports
or other backwards-compatibility shims.
"""

[pr_code_suggestions]
commitable_code_suggestions = true
apply_suggestions_checkbox  = true

🛠️ Global configuration file settings:

🛠️ PR-Agent final configurations:

==================== CONFIG ====================
config.pr_compliance = {'ENABLE_RULES_PLATFORM': True}  
config.model_reasoning = 'vertex_ai/gemini-2.5-pro'  
config.model = 'gpt-5.2-2025-12-11'  
config.model_turbo = 'anthropic/claude-haiku-4-5-20251001'  
config.fallback_models = ['anthropic/claude-sonnet-4-5-20250929', 'bedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0']  
config.second_model_for_exhaustive_mode = 'o4-mini'  
config.git_provider = 'github'  
config.publish_output = True  
config.publish_output_no_suggestions = True  
config.publish_output_progress = True  
config.verbosity_level = 0  
config.publish_logs = False  
config.debug_mode = False  
config.use_wiki_settings_file = True  
config.use_repo_settings_file = True  
config.use_global_settings_file = True  
config.use_global_wiki_settings_file = False  
config.disable_auto_feedback = False  
config.ai_timeout = 150  
config.response_language = 'en-US'  
config.clone_repo_instead_of_fetch = True  
config.always_clone = False  
config.add_repo_metadata = True  
config.clone_repo_time_limit = 300  
config.publish_inline_comments_fallback_batch_size = 5  
config.publish_inline_comments_fallback_sleep_time = 2  
config.max_model_tokens = 32000  
config.custom_model_max_tokens = -1  
config.patch_extension_skip_types = ['.md', '.txt']  
config.extra_allowed_extensions = []  
config.allow_dynamic_context = True  
config.allow_forward_dynamic_context = True  
config.max_extra_lines_before_dynamic_context = 12  
config.patch_extra_lines_before = 5  
config.patch_extra_lines_after = 1  
config.ai_handler = 'litellm'  
config.cli_mode = False  
config.trial_git_org_max_invokes_per_month = 30  
config.trial_ratio_close_to_limit = 0.8  
config.invite_only_mode = False  
config.enable_request_access_msg_on_new_pr = False  
config.check_also_invites_field = False  
config.allowed_users = []  
config.calculate_context = True  
config.disable_checkboxes = False  
config.output_relevant_configurations = False  
config.large_patch_policy = 'clip'  
config.seed = -1  
config.temperature = 0.2  
config.allow_dynamic_context_ab_testing = False  
config.choose_dynamic_context_ab_testing_ratio = 0.5  
config.ignore_pr_title = ['^\\[Auto\\]', '^Auto']  
config.ignore_pr_target_branches = []  
config.ignore_pr_source_branches = []  
config.ignore_pr_labels = []  
config.ignore_ticket_labels = []  
config.allow_only_specific_folders = []  
config.ignore_pr_authors = 'REDACTED'  
config.ignore_repositories = []  
config.ignore_language_framework = []  
config.is_auto_command = False  
config.is_new_pr = False  
config.enable_ai_metadata = True  
config.present_reasoning = True  
config.max_tickets = 10  
config.max_tickets_chars = 8000  
config.prevent_any_approval = False  
config.enable_comment_approval = False  
config.enable_auto_approval = False  
config.auto_approve_for_low_review_effort = -1  
config.auto_approve_for_no_suggestions = False  
config.ensure_ticket_compliance = False  
config.new_diff_format = True  
config.new_diff_format_add_external_references = True  
config.tasks_queue_ttl_from_dequeue_in_seconds = 900  
config.enable_custom_labels = False  

==================== PR_REVIEWER ====================
pr_reviewer.require_score_review = False  
pr_reviewer.require_tests_review = True  
pr_reviewer.require_estimate_effort_to_review = True  
pr_reviewer.require_can_be_split_review = False  
pr_reviewer.require_security_review = True  
pr_reviewer.require_todo_scan = False  
pr_reviewer.require_ticket_analysis_review = True  
pr_reviewer.require_ticket_labels = False  
pr_reviewer.require_no_ticket_labels = False  
pr_reviewer.check_pr_additional_content = False  
pr_reviewer.persistent_comment = True  
pr_reviewer.extra_instructions = 'Project context and review priorities: .github/copilot-instructions.md\nCoding standards and common pitfalls: docs/documentation/contributing.md\nGPU macro API: docs/documentation/gpuParallelization.md\n\nPrioritize correctness over style (formatting is enforced by pre-commit hooks).\nKey areas: logic bugs, numerical issues,\narray bounds (non-unity lower bounds with ghost cells), MPI halo exchange\ncorrectness (pack/unpack offsets, GPU data coherence), precision mixing\n(stp vs wp), ALLOCATE/DEALLOCATE pairing (GPU memory leaks), physics model\nconsistency (pressure formula must match model_eqns), missing case_validator.py\nconstraints for new parameters, and compiler portability across all four\nsupported compilers.\nPython toolchain requires Python 3.10+; do not suggest __future__ imports\nor other backwards-compatibility shims.\n'  
pr_reviewer.final_update_message = True  
pr_reviewer.enable_review_labels_security = True  
pr_reviewer.enable_review_labels_effort = True  
pr_reviewer.enable_help_text = False  
pr_reviewer.num_max_findings = 10  

==================== PR_COMPLIANCE ====================
pr_compliance.enabled = True  
pr_compliance.enable_rules_platform = False  
pr_compliance.rule_providers = []  
pr_compliance.enable_security_section = True  
pr_compliance.enable_ticket_section = True  
pr_compliance.enable_codebase_duplication_section = True  
pr_compliance.enable_custom_compliance_section = True  
pr_compliance.require_ticket_analysis_review = True  
pr_compliance.allow_repo_pr_compliance = True  
pr_compliance.enable_global_pr_compliance = True  
pr_compliance.max_lines_allowed = 2000  
pr_compliance.local_wiki_compliance_str = ''  
pr_compliance.global_wiki_pr_compliance = ''  
pr_compliance.local_repo_compliance_str = ''  
pr_compliance.global_repo_pr_compliance_str = ''  
pr_compliance.global_compliance_str = ''  
pr_compliance.enable_generic_custom_compliance_checklist = True  
pr_compliance.persist_generic_custom_compliance_checklist = False  
pr_compliance.display_no_compliance_only = False  
pr_compliance.enable_security_compliance = True  
pr_compliance.enable_update_pr_compliance_checkbox = True  
pr_compliance.enable_todo_scan = False  
pr_compliance.enable_ticket_labels = False  
pr_compliance.enable_no_ticket_labels = False  
pr_compliance.check_pr_additional_content = False  
pr_compliance.enable_compliance_labels_security = True  
pr_compliance.enable_user_defined_compliance_labels = True  
pr_compliance.enable_estimate_effort_to_review = True  
pr_compliance.max_rag_components_to_analyze = 5  
pr_compliance.min_component_size = 5  
pr_compliance.persistent_comment = True  
pr_compliance.enable_help_text = False  
pr_compliance.extra_instructions = ''  

==================== PR_DESCRIPTION ====================
pr_description.publish_labels = False  
pr_description.add_original_user_description = True  
pr_description.generate_ai_title = False  
pr_description.extra_instructions = ''  
pr_description.enable_pr_type = True  
pr_description.final_update_message = True  
pr_description.enable_help_text = False  
pr_description.enable_help_comment = False  
pr_description.bring_latest_tag = False  
pr_description.enable_pr_diagram = True  
pr_description.publish_description_as_comment = False  
pr_description.publish_description_as_comment_persistent = True  
pr_description.enable_semantic_files_types = True  
pr_description.collapsible_file_list = 'adaptive'  
pr_description.collapsible_file_list_threshold = 8  
pr_description.inline_file_summary = False  
pr_description.use_description_markers = False  
pr_description.include_generated_by_header = True  
pr_description.enable_large_pr_handling = True  
pr_description.max_ai_calls = 4  
pr_description.auto_create_ticket = False  

==================== PR_QUESTIONS ====================
pr_questions.aware_ai_handler = False  
pr_questions.enable_help_text = False  

==================== PR_CODE_SUGGESTIONS ====================
pr_code_suggestions.suggestions_depth = 'exhaustive'  
pr_code_suggestions.commitable_code_suggestions = True  
pr_code_suggestions.decouple_hunks = False  
pr_code_suggestions.dual_publishing_score_threshold = -1  
pr_code_suggestions.focus_only_on_problems = True  
pr_code_suggestions.allow_thumbs_up_down = False  
pr_code_suggestions.enable_suggestion_type_reuse = False  
pr_code_suggestions.enable_more_suggestions_checkbox = True  
pr_code_suggestions.high_level_suggestions_enabled = True  
pr_code_suggestions.extra_instructions = ''  
pr_code_suggestions.enable_help_text = False  
pr_code_suggestions.show_extra_context = False  
pr_code_suggestions.persistent_comment = True  
pr_code_suggestions.max_history_len = 5  
pr_code_suggestions.apply_suggestions_checkbox = True  
pr_code_suggestions.enable_chat_in_code_suggestions = True  
pr_code_suggestions.apply_limit_scope = True  
pr_code_suggestions.suggestions_score_threshold = 0  
pr_code_suggestions.new_score_mechanism = True  
pr_code_suggestions.new_score_mechanism_th_high = 9  
pr_code_suggestions.new_score_mechanism_th_medium = 7  
pr_code_suggestions.discard_unappliable_suggestions = False  
pr_code_suggestions.num_code_suggestions_per_chunk = 3  
pr_code_suggestions.num_best_practice_suggestions = 2  
pr_code_suggestions.max_number_of_calls = 3  
pr_code_suggestions.final_clip_factor = 0.8  
pr_code_suggestions.demand_code_suggestions_self_review = False  
pr_code_suggestions.code_suggestions_self_review_text = '**Author self-review**: I have reviewed the PR code suggestions, and addressed the relevant ones.'  
pr_code_suggestions.approve_pr_on_self_review = False  
pr_code_suggestions.fold_suggestions_on_self_review = True  
pr_code_suggestions.publish_post_process_suggestion_impact = True  
pr_code_suggestions.wiki_page_accepted_suggestions = True  
pr_code_suggestions.enable_local_self_reflect_in_large_prs = False  
pr_code_suggestions.simplify_response = True  

==================== PR_CUSTOM_PROMPT ====================
pr_custom_prompt.prompt = 'The code suggestions should focus only on the following:\n- ...\n- ...\n...\n'  
pr_custom_prompt.suggestions_score_threshold = 0  
pr_custom_prompt.num_code_suggestions_per_chunk = 4  
pr_custom_prompt.self_reflect_on_custom_suggestions = True  
pr_custom_prompt.enable_help_text = False  

==================== PR_ADD_DOCS ====================
pr_add_docs.extra_instructions = ''  
pr_add_docs.docs_style = 'Sphinx'  
pr_add_docs.file = ''  
pr_add_docs.class_name = ''  

==================== PR_UPDATE_CHANGELOG ====================
pr_update_changelog.push_changelog_changes = False  
pr_update_changelog.extra_instructions = ''  
pr_update_changelog.add_pr_link = True  
pr_update_changelog.skip_ci_on_push = True  

==================== PR_ANALYZE ====================
pr_analyze.enable_help_text = False  

==================== PR_TEST ====================
pr_test.enable = True  
pr_test.extra_instructions = ''  
pr_test.testing_framework = ''  
pr_test.num_tests = 3  
pr_test.avoid_mocks = True  
pr_test.file = ''  
pr_test.class_name = ''  
pr_test.enable_help_text = False  

==================== PR_IMPROVE_COMPONENT ====================
pr_improve_component.num_code_suggestions = 4  
pr_improve_component.extra_instructions = ''  
pr_improve_component.file = ''  
pr_improve_component.class_name = ''  

==================== PR_IMPLEMENT ====================
pr_implement.allow_agent_implementation = False  

==================== REVIEW_AGENT ====================
review_agent.enabled = True  
review_agent.publish_output = True  
review_agent.enable_context_collector = False  
review_agent.enable_history_context_collector = False  
review_agent.enable_issues_agent = True  
review_agent.enable_compliance_agent = True  
review_agent.enable_deduplication = True  
review_agent.persistent_comment = True  
review_agent.enable_database_persistence = False  
review_agent.llm_call_timeout = 180  
review_agent.context_collector_llm_model = 'turbo'  
review_agent.llm_model = 'openai/gpt-5.2_thinking'  
review_agent.feedback_tool_llm_model = 'turbo'  
review_agent.conversion_llm_model = 'openai/gpt-5.2'  
review_agent.conversion_batching_mode = 'batch'  
review_agent.conversion_batch_size = 10  
review_agent.langsmith_project_name = 'review-agent'  
review_agent.max_tokens_for_file = 'REDACTED'  
review_agent.single_unified_diff_tokens_limit = 'REDACTED'  
review_agent.max_llm_calls = 100  
review_agent.context_collector_max_llm_calls = 6  
review_agent.compliance_batch_size = 0  
review_agent.deduplication_llm_max_tokens = 'REDACTED'  
review_agent.publishing_action_level_rank_threshold = 0  
review_agent.comments_location_policy = 'both'  
review_agent.inline_comments_severity_threshold = 3  
review_agent.issues_user_guidelines = ''  
review_agent.compliance_user_guidelines = ''  

==================== PR_HELP ====================
pr_help.force_local_db = False  
pr_help.num_retrieved_snippets = 5  

==================== PR_NEW_ISSUE ====================
pr_new_issue.label_to_prompt_part = {'general': 'general question', 'feature': 'feature request (may already be addressed in the documentation)', 'bug': 'possible bug report (may be a by design behavior)'}  
pr_new_issue.supported_repos = ['qodo-ai/pr-agent']  

==================== PR_HELP_DOCS ====================
pr_help_docs.repo_url = ''  
pr_help_docs.repo_default_branch = 'main'  
pr_help_docs.docs_path = 'docs'  
pr_help_docs.exclude_root_readme = False  
pr_help_docs.supported_doc_exts = ['.md', '.mdx', '.rst']  
pr_help_docs.enable_help_text = False  

==================== PR_SIMILAR_ISSUE ====================
pr_similar_issue.skip_comments = False  
pr_similar_issue.force_update_dataset = False  
pr_similar_issue.max_issues_to_scan = 500  
pr_similar_issue.vectordb = 'pinecone'  

==================== PR_FIND_SIMILAR_COMPONENT ====================
pr_find_similar_component.class_name = ''  
pr_find_similar_component.file = ''  
pr_find_similar_component.search_from_org = False  
pr_find_similar_component.allow_fallback_less_words = True  
pr_find_similar_component.number_of_keywords = 5  
pr_find_similar_component.number_of_results = 5  

==================== BEST_PRACTICES ====================
best_practices.auto_best_practices_str = ''  
best_practices.wiki_best_practices_str = ''  
best_practices.global_wiki_best_practices = ''  
best_practices.local_repo_best_practices_str = ''  
best_practices.global_repo_best_practices_str = ''  
best_practices.global_best_practices_str = ''  
best_practices.organization_name = ''  
best_practices.max_lines_allowed = 2000  
best_practices.enable_global_best_practices = True  
best_practices.allow_repo_best_practices = True  
best_practices.enabled = True  

==================== AUTO_BEST_PRACTICES ====================
auto_best_practices.enable_auto_best_practices = True  
auto_best_practices.utilize_auto_best_practices = True  
auto_best_practices.extra_instructions = ''  
auto_best_practices.min_suggestions_to_auto_best_practices = 10  
auto_best_practices.number_of_days_to_update = 30  
auto_best_practices.max_patterns = 5  
auto_best_practices.minimal_date_to_update = '2025-01-26'  

==================== JIRA ====================
jira.jira_client_id = 'REDACTED'  
jira.jira_app_secret = 'REDACTED'  

==================== LINEAR ====================
linear.linear_client_id = 'REDACTED'  

==================== PR_TO_TICKET ====================
pr_to_ticket.default_base_url = ''  
pr_to_ticket.default_project_key = 'REDACTED'  
pr_to_ticket.fallback_to_git_provider_issues = True  
pr_to_ticket.direct_update_compliance = False  

==================== github_app ====================
github_app.bot_user = 'github-actions[bot]'  
github_app.override_deployment_type = True  
github_app.handle_pr_actions = ['opened', 'reopened', 'ready_for_review']  
github_app.pr_commands = ['/review', '/improve']  
github_app.feedback_on_draft_pr = False  
github_app.handle_push_trigger = True  
github_app.push_commands = ['/improve']  
github_app.ignore_pr_title = []  
github_app.ignore_bot_pr = True

Add three layers of defense against transient failures (e.g. ROCm HSA_STATUS_ERROR_INVALID_ARGUMENT) tanking the entire benchmark: 1. Retry failed cases once (5s delay) before marking as failed 2. Always write partial results YAML before raising on failure 3. CI scripts warn on non-zero exit instead of aborting, and bench.yml runs diff() via `if: always()` so partial results are still compared Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codeant-ai · 2026-02-16T20:42:38Z

CodeAnt AI is running Incremental review

Thanks for using CodeAnt! 🎉

We're free for open-source projects. if you're enjoying it, help us grow by sharing.

Share on X ·
Reddit ·
LinkedIn

toolchain/mfc/bench.py

.github/scripts/setup-build-cache.sh

.github/scripts/submit_and_monitor_bench.sh

codeant-ai · 2026-02-16T20:45:48Z

CodeAnt AI Incremental review completed.

cubic-dev-ai

1 issue found across 4 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name=".github/scripts/run_parallel_benchmarks.sh">

<violation number="1" location=".github/scripts/run_parallel_benchmarks.sh:57">
P2: The new warning/continue path is unreachable on job failure because `set -e` causes the script to exit on the `wait` command before this block executes. If the intent is to allow partial results, the `wait` calls need to be guarded (e.g., `if wait ...; then ...; else ...; fi`) so failures don’t abort the script.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

.github/scripts/run_parallel_benchmarks.sh

The retry loop adds nesting depth beyond pylint's default limit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

toolchain/mfc/bench.py

.github/scripts/setup-build-cache.sh

rm -rf on a symlink follows it and deletes the target's contents, which fails when another runner is using the shared cache directory. Use unlink for symlinks, rm -rf only for real directories. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

toolchain/mfc/bench.py

.github/workflows/bench.yml

.github/scripts/setup-build-cache.sh

Phoenix compute nodes may have different CPU architectures, causing SIGILL when running binaries cached from a different node. After a successful build, smoke-test syscheck to detect stale installs and trigger a full rebuild. Also include build/install in retry cleanup for all clusters. With per-runner caching there are no concurrent readers sharing the same cache directory, so clearing install is safe. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

qodo-code-review · 2026-02-17T00:20:39Z

toolchain/mfc/bench.py

+                        # Load summary
+                        summary = file_load_yaml(summary_filepath)
+
+                        # Validate all targets have required data
+                        validation_failed = False
+                        for target in targets:
+                            if target.name not in summary:
+                                cons.print(f"[bold red]ERROR[/bold red]: Target {target.name} missing from summary for {case.slug}")
+                                validation_failed = True
+                                break
+
+                            if "exec" not in summary[target.name]:
+                                cons.print(f"[bold red]ERROR[/bold red]: 'exec' time missing for {target.name} in {case.slug}")
+                                validation_failed = True
+                                break
+
+                            if target.name == "simulation" and "grind" not in summary[target.name]:
+                                cons.print(f"[bold red]ERROR[/bold red]: 'grind' time missing for simulation in {case.slug}")
+                                validation_failed = True
+                                break
+
+                        if validation_failed:
+                            failed_cases.append(case.slug)
+                            break


Suggestion: Add a check to handle invalid or empty summary YAML files and integrate it into the existing retry logic to prevent crashes and improve robustness. [possible issue, importance: 8]

Suggested change

# Load summary

summary = file_load_yaml(summary_filepath)

# Validate all targets have required data

validation_failed = False

for target in targets:

if target.name not in summary:

cons.print(f"[bold red]ERROR[/bold red]: Target {target.name} missing from summary for {case.slug}")

validation_failed = True

break

if "exec" not in summary[target.name]:

cons.print(f"[bold red]ERROR[/bold red]: 'exec' time missing for {target.name} in {case.slug}")

validation_failed = True

break

if target.name == "simulation" and "grind" not in summary[target.name]:

cons.print(f"[bold red]ERROR[/bold red]: 'grind' time missing for simulation in {case.slug}")

validation_failed = True

break

if validation_failed:

failed_cases.append(case.slug)

break

# Load summary

summary = file_load_yaml(summary_filepath)

if not isinstance(summary, dict):

if attempt < max_attempts:

cons.print(f"[bold yellow]WARNING[/bold yellow]: Invalid/empty summary for {case.slug} (attempt {attempt}/{max_attempts})")

cons.print("Retrying in 5s...")

time.sleep(5)

continue

cons.print(f"[bold red]ERROR[/bold red]: Invalid/empty summary for {case.slug}")

failed_cases.append(case.slug)

break

# Validate all targets have required data

validation_failed = False

for target in targets:

if target.name not in summary or not isinstance(summary.get(target.name), dict):

cons.print(f"[bold red]ERROR[/bold red]: Target {target.name} missing/invalid in summary for {case.slug}")

validation_failed = True

break

if "exec" not in summary[target.name]:

cons.print(f"[bold red]ERROR[/bold red]: 'exec' time missing for {target.name} in {case.slug}")

validation_failed = True

break

if target.name == "simulation" and "grind" not in summary[target.name]:

cons.print(f"[bold red]ERROR[/bold red]: 'grind' time missing for simulation in {case.slug}")

validation_failed = True

break

if validation_failed:

if attempt < max_attempts:

cons.print(f"[bold yellow]WARNING[/bold yellow]: Validation failed for {case.slug} (attempt {attempt}/{max_attempts})")

cons.print("Retrying in 5s...")

time.sleep(5)

continue

failed_cases.append(case.slug)

break

qodo-code-review · 2026-02-17T00:20:40Z

toolchain/mfc/bench.py

+        # Always write results (even partial) so diff() can compare the intersection
+        file_dump_yaml(ARG("output"), results)
+        cons.print(f"Wrote results to [bold magenta]{os.path.relpath(ARG('output'))}[/bold magenta].")


Suggestion: Ensure the parent directory for the output file exists before writing to it by calling create_directory. [possible issue, importance: 7]

Suggested change

# Always write results (even partial) so diff() can compare the intersection

file_dump_yaml(ARG("output"), results)

cons.print(f"Wrote results to [bold magenta]{os.path.relpath(ARG('output'))}[/bold magenta].")

# Always write results (even partial) so diff() can compare the intersection

create_directory(os.path.dirname(os.path.abspath(ARG("output"))))

file_dump_yaml(ARG("output"), results)

cons.print(f"Wrote results to [bold magenta]{os.path.relpath(ARG('output'))}[/bold magenta].")

qodo-code-review · 2026-02-17T00:20:40Z

.github/scripts/run_parallel_benchmarks.sh

+# Warn if either job failed (partial results may still be usable)
 if [ "${pr_exit}" -ne 0 ] || [ "${master_exit}" -ne 0 ]; then
-  echo "ERROR: One or both benchmark jobs failed: pr_exit=${pr_exit}, master_exit=${master_exit}"
-  exit 1
+  echo "WARNING: Benchmark jobs had failures: pr=${pr_exit}, master=${master_exit}"
+  echo "Checking for partial results..."
+else
+  echo "=========================================="
+  echo "Both benchmark jobs completed successfully!"
+  echo "=========================================="
 fi


Suggestion: Use a flag to track benchmark job failures and exit with a non-zero status at the end of the script if any job failed, ensuring the workflow status is accurate. [possible issue, importance: 9]

Suggested change

# Warn if either job failed (partial results may still be usable)

if [ "${pr_exit}" -ne 0 ] || [ "${master_exit}" -ne 0 ]; then

echo "ERROR: One or both benchmark jobs failed: pr_exit=${pr_exit}, master_exit=${master_exit}"

exit 1

echo "WARNING: Benchmark jobs had failures: pr=${pr_exit}, master=${master_exit}"

echo "Checking for partial results..."

else

echo "=========================================="

echo "Both benchmark jobs completed successfully!"

echo "=========================================="

fi

bench_failed=0

# Warn if either job failed (partial results may still be usable)

if [ "${pr_exit}" -ne 0 ] || [ "${master_exit}" -ne 0 ]; then

echo "WARNING: Benchmark jobs had failures: pr=${pr_exit}, master=${master_exit}"

echo "Checking for partial results..."

bench_failed=1

else

echo "=========================================="

echo "Both benchmark jobs completed successfully!"

echo "=========================================="

fi

Copilot AI review requested due to automatic review settings February 14, 2026 15:33

Copilot started reviewing on behalf of sbryngelson February 14, 2026 15:33 View session

qodo-code-review bot added the Review effort 3/5 label Feb 14, 2026

codeant-ai bot added the size:L This PR changes 100-499 lines, ignoring generated files label Feb 14, 2026

qodo-code-review bot reviewed Feb 14, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Show resolved Hide resolved

qodo-code-review bot reviewed Feb 14, 2026

View reviewed changes

codeant-ai bot reviewed Feb 14, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

codeant-ai bot reviewed Feb 14, 2026

View reviewed changes

.github/workflows/phoenix/test.sh Outdated Show resolved Hide resolved

coderabbitai bot reviewed Feb 14, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

cubic-dev-ai bot reviewed Feb 14, 2026

View reviewed changes

.github/workflows/frontier/build.sh Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

qodo-code-review bot reviewed Feb 14, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

coderabbitai bot reviewed Feb 14, 2026

View reviewed changes

.github/workflows/frontier_amd/build.sh Outdated Show resolved Hide resolved

cubic-dev-ai bot reviewed Feb 14, 2026

View reviewed changes

.github/workflows/frontier/build.sh Outdated Show resolved Hide resolved

.github/workflows/frontier_amd/build.sh Outdated Show resolved Hide resolved

Copilot AI reviewed Feb 14, 2026

View reviewed changes

Fix stale retry log messages

79ce4e7

The echo said "Clearing staging/install" but build/install is intentionally preserved to avoid disrupting concurrent test jobs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

qodo-code-review bot reviewed Feb 14, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

qodo-code-review bot reviewed Feb 14, 2026

View reviewed changes

.github/workflows/frontier/build.sh Outdated Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

codeant-ai bot added size:L This PR changes 100-499 lines, ignoring generated files and removed size:L This PR changes 100-499 lines, ignoring generated files labels Feb 15, 2026

qodo-code-review bot reviewed Feb 15, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

qodo-code-review bot reviewed Feb 15, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

.github/workflows/bench.yml Show resolved Hide resolved

codeant-ai bot added size:M This PR changes 30-99 lines, ignoring generated files and removed size:L This PR changes 100-499 lines, ignoring generated files labels Feb 15, 2026

qodo-code-review bot reviewed Feb 15, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Show resolved Hide resolved

.github/workflows/bench.yml Show resolved Hide resolved

.github/workflows/test.yml Show resolved Hide resolved

codeant-ai bot reviewed Feb 15, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Show resolved Hide resolved

sbryngelson changed the title ~~Add CI build caching for GH-hosted and self-hosted HPC runners~~ Add CI build caching and improve benchmark workflow Feb 15, 2026

qodo-code-review bot reviewed Feb 15, 2026

View reviewed changes

.github/scripts/setup-build-cache.sh Show resolved Hide resolved

.github/workflows/bench.yml Show resolved Hide resolved

codeant-ai bot added size:L This PR changes 100-499 lines, ignoring generated files and removed size:M This PR changes 30-99 lines, ignoring generated files labels Feb 16, 2026

qodo-code-review bot reviewed Feb 16, 2026

View reviewed changes

toolchain/mfc/bench.py Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Show resolved Hide resolved

codeant-ai bot reviewed Feb 16, 2026

View reviewed changes

.github/scripts/submit_and_monitor_bench.sh Show resolved Hide resolved

cubic-dev-ai bot reviewed Feb 16, 2026

View reviewed changes

.github/scripts/run_parallel_benchmarks.sh Show resolved Hide resolved

Suppress pylint too-many-nested-blocks for bench()

09920e7

The retry loop adds nesting depth beyond pylint's default limit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

qodo-code-review bot reviewed Feb 16, 2026

View reviewed changes

toolchain/mfc/bench.py Show resolved Hide resolved

toolchain/mfc/bench.py Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Outdated Show resolved Hide resolved

qodo-code-review bot reviewed Feb 16, 2026

View reviewed changes

toolchain/mfc/bench.py Show resolved Hide resolved

.github/workflows/bench.yml Show resolved Hide resolved

.github/scripts/setup-build-cache.sh Show resolved Hide resolved

qodo-code-review bot reviewed Feb 17, 2026

View reviewed changes

Conversation

sbryngelson commented Feb 14, 2026 • edited by codeant-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Summary

Build caching

Benchmark workflow improvements

Files changed

Test plan

CodeAnt-AI Description

What Changed

Impact

Checking Your Pull Request

Talking to CodeAnt AI

Example

Preserve Org Learnings with CodeAnt

Example

Retrigger review

Check Your Repository Health

Uh oh!

codeant-ai bot commented Feb 14, 2026

Thanks for using CodeAnt! 🎉

Uh oh!

coderabbitai bot commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (1 warning)

Uh oh!

qodo-code-review bot commented Feb 14, 2026

PR Reviewer Guide 🔍

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codeant-ai bot commented Feb 14, 2026

Nitpicks 🔍

Uh oh!

Uh oh!

codeant-ai bot commented Feb 14, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

sbryngelson commented Feb 14, 2026 •

edited by codeant-ai bot

Loading

coderabbitai bot commented Feb 14, 2026 •

edited

Loading

codecov bot commented Feb 14, 2026 •

edited

Loading