ProSec

Code repo for the paper ProSec: Fortifying Code LLMs with Proactive Security Alignment.

Overview

The pipeline follows these stages:

Synthesize CWE-Inducing Instructions
        |
        v
Generate Vulnerable Code  +  Generate Benign Code
        |                           |
        v                           |
Detect Vulnerabilities (Purple Llama)
        |                           |
        v                           |
Generate Fixes & Re-detect          |
        |                           |
        v                           v
        Mix Fixed Code with Benign Code
                    |
                    v
            Final Training Dataset

Getting Started

Clone the repository with its submodule:

git clone --recurse-submodules https://github.com/PurCL/ProSec.git

If you have already cloned without --recurse-submodules, fetch the submodule separately:

git submodule update --init --recursive

Prerequisites

Python 3
The tested model must be hosted via vLLM with an OpenAI-compatible API endpoint.
PurCL's Purple Llama is included as a git submodule under PurpleLlama/.

Data Synthesis Pipeline

Step 1: Synthesize CWE-Inducing Instructions

Synthesize instructions for a single CWE-language pair:

./synth_claude.sh <CWE_ID> <LANG>

This generates instructions and clusters them to select 2000 per pair.

To synthesize for all CWE-language pairs at once:

./synth_all.sh

Note: Set the HF_USER environment variable to your HuggingFace username before running any scripts (e.g., export HF_USER=your-hf-username). Make sure to mkdir the output directory before running the script.

Step 2: Generate Potentially Vulnerable Code

Generate vulnerable code for all CWE-language pairs using the tested model:

./infer_all_claude.sh

Note: Modify src/gen_inferences.py to specify the addresses of the hosted vLLM model.

Step 3: Generate Benign Code

Generate normal (non-vulnerable) code with the original instructions:

./infer_all_claude_ori_task.sh

Note: Host the tested model via vLLM and modify src/gen_inferences.py accordingly.

Step 4: Scan Vulnerable Code with Purple Llama

This step detects vulnerabilities, generates fixes, and pairs them up. It uses scripts from both this repo and the PurpleLlama/ submodule.

4a. Merge inference results

Create a symlink from the output directory of infer_all_claude to the PurpleLlama/ directory, then merge the inference results:

python3 PurpleLlama/prosec_scripts/merge_multiple_infer_rets.py

Also merge the benign inference results to produce infer-ret-original.jsonl.

Note: You need to manually modify the merge script before running it.

4b. Detect vulnerabilities

python3 PurpleLlama/prosec_scripts/detect_all.py

This produces detection-ret.jsonl.

4c. Generate fix prompts

python3 src/gen_fix_inference_prompts.py \
    --fin detection-ret.jsonl \
    --fout-stats detection-ret.stats.json \
    --fout detection-ret.fix-prompt.jsonl

4d. Generate fixed code

python3 src/gen_fix_inference.py \
    --prompts_in detection-ret.fix-prompt.jsonl \
    --fout detection-ret.fixed.jsonl

Note: Host the tested model and modify src/gen_fix_inference.py.

4e. Re-detect on fixed code

python3 PurpleLlama/prosec_scripts/detect_all_from_fixed.py

This produces detection-ret-fixed.jsonl.

4f. Pair and upload

python3 src/collect_and_upload_fixed_batch.py \
    --detection_ret detection-ret.jsonl \
    --fixed_detected_ret detection-ret-fixed.jsonl \
    --ds_name <name-of-the-dataset> \
    --fout <intermediate-results>

This produces a fix-pair dataset (e.g., purcl/fix-dataset).

Step 5: Mix Fixed Code with Benign Code

Concatenate multiple CWE-inducing instruction datasets:

python3 src/concat_dataset.py

Note: You will need to manually modify this file. Suppose the output is purcl/concat-dataset.

Clean the benign data and mix with the fixed code:

python3 src/clean_benign_data.py --fin infer-ret-original.jsonl

python3 src/mix_and_upload_original_w_fixed_batch.py \
    --inst_ds_name purcl/concat-dataset \
    --fix_pair_ds_name purcl/fix-dataset \
    --infer_ori_in infer-ret-original-filtered.jsonl \
    --out_ds_name <output-dataset-name>

Data Selection Pipeline

The influence_score/ module provides tools for computing training dynamics and influence scores over synthesized datasets. These scores measure how individual training samples contribute to security alignment, enabling better data selection strategies. More detailed instructions will be published soon.

Key components:

Module	Description
`data_utils_refactored.py`	Entry point: prepares selection datasets from instruction, fix-pair, and benign data
`training_dynamics_refactored.py`	Collects log-probabilities and accuracy across training checkpoints
`sample_refactored.py`	Data selection strategies based on training dynamics correlations
`scores.py`	Computes sequence-level log-probabilities and normalized scores
`collator.py`	Data collator for pairwise training data
`collect_grad_reps.py`	Gradient representation collection using TRAK for influence estimation

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
PurpleLlama @ b73958f		PurpleLlama @ b73958f
config		config
cwe-elicitors		cwe-elicitors
cwe-explanations		cwe-explanations
influence_score		influence_score
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
infer_all_claude.sh		infer_all_claude.sh
infer_all_claude_ori_task.sh		infer_all_claude_ori_task.sh
synth_all.sh		synth_all.sh
synth_claude.sh		synth_claude.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProSec

Table of Contents

Overview

Getting Started

Prerequisites

Data Synthesis Pipeline

Step 1: Synthesize CWE-Inducing Instructions

Step 2: Generate Potentially Vulnerable Code

Step 3: Generate Benign Code

Step 4: Scan Vulnerable Code with Purple Llama

4a. Merge inference results

4b. Detect vulnerabilities

4c. Generate fix prompts

4d. Generate fixed code

4e. Re-detect on fixed code

4f. Pair and upload

Step 5: Mix Fixed Code with Benign Code

Data Selection Pipeline

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

PurCL/ProSec

Folders and files

Latest commit

History

Repository files navigation

ProSec

Table of Contents

Overview

Getting Started

Prerequisites

Data Synthesis Pipeline

Step 1: Synthesize CWE-Inducing Instructions

Step 2: Generate Potentially Vulnerable Code

Step 3: Generate Benign Code

Step 4: Scan Vulnerable Code with Purple Llama

4a. Merge inference results

4b. Detect vulnerabilities

4c. Generate fix prompts

4d. Generate fixed code

4e. Re-detect on fixed code

4f. Pair and upload

Step 5: Mix Fixed Code with Benign Code

Data Selection Pipeline

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages