Skill Scanner Test

An automated security scanning pipeline for AI agent skills and plugins in public skill directories.

AI agent skills -- installable packages that extend what coding assistants and AI agents can do -- are a growing attack surface. Skills can contain prompt injection, data exfiltration, command injection, and other vulnerabilities, whether introduced intentionally or by accident. This project systematically scans public skills using Cisco's open-source skill-scanner and publishes the results.

Status

This is early-stage and evolving. The current implementation:

Scanner: Cisco AI Defense skill-scanner (static + behavioral + trigger + LLM + meta-analysis)
Skill registries: skills.sh and clawhub.ai
Coverage: 614 skills scanned so far
Test suite: 77 tests passing

This could be expanded along both axes -- adding more scanners and targeting more skill directories.

Scan Results

Metric	Count
Total Skills Scanned	616
Safe Skills	485 (79%)
Skills with Issues	131 (21%)
Total Findings	1588

Severity	Count
CRITICAL	356
HIGH	188
MEDIUM	625
LOW	419

Category	Count
policy_violation	394
data_exfiltration	335
command_injection	324
transitive_trust_abuse	169
unauthorized_tool_use	91
tool_chaining_abuse	85
hardcoded_secrets	71
social_engineering	35
skill_discovery_abuse	35
prompt_injection	24
resource_abuse	13
autonomy_abuse	7
obfuscation	5

These results are visualized at skillscan.appsechq.com. See summary-report.md for detailed findings by skill, severity breakdowns, and top risks. Per-skill scan results (JSON + Markdown) are in the results/ directory.

Getting Started

Requires Python 3.10+ and Git.

git clone https://github.com/AppSecHQ/skill-scanner-test.git
cd skill-scanner-test
python -m venv .venv
source .venv/bin/activate
pip install -r scripts/requirements.txt

If you're in a container or environment where you don't need a venv, you can skip the venv steps and install directly with pip install -r scripts/requirements.txt --break-system-packages.

Scan top skills from a registry

# Scan top 25 skills from skills.sh (default)
python scripts/scan-skills.py -n 25

# Scan top 10 from clawhub.ai
python scripts/scan-skills.py --source clawhub -n 10

# Scan skills 11-25 (pagination)
python scripts/scan-skills.py -n 15 --offset 10

# Enable LLM semantic analysis (requires SKILL_SCANNER_LLM_API_KEY)
python scripts/scan-skills.py -n 25 --use-llm

# Enable LLM + meta-analysis for false positive filtering
python scripts/scan-skills.py -n 25 --use-llm --enable-meta

Scan a specific repo

# Scan a single GitHub repo
python scripts/scan-skills.py --repo owner/repo --repo-only

# Scan a repo alongside top skills
python scripts/scan-skills.py -n 10 --repo owner/repo

Other options

# List skills without scanning
python scripts/scan-skills.py --list-only -n 25

# Generate report from existing results (skip fetch + scan)
python scripts/scan-skills.py --skip-scan -o results

# Custom report name
python scripts/scan-skills.py -n 25 --report-name top-25

# Verbose logging
python scripts/scan-skills.py -n 10 -v

Run python scripts/scan-skills.py --help for the full set of options.

Running tests

python -m pytest tests/ -v

Project Structure

scripts/
  scan-skills.py          # Main orchestrator and CLI
  fetch_skills.py         # API fetching from skills.sh and clawhub.ai
  run_scans.py            # Clone, download, and scan logic
  generate_report.py      # Aggregate results into markdown/JSON reports
  pipeline_utils.py       # Shared utilities: logging, HTTP session management
  requirements.txt        # Dependencies

tests/                    # 77 tests across 4 modules
  conftest.py             # Shared fixtures
  test_run_scans.py       # Clone, download, scan, ZIP security tests
  test_generate_report.py # Report aggregation tests
  test_fetch_skills.py    # API fetching tests
  test_pipeline_utils.py  # Session management and retry tests

results/
  summary-report.md       # Consolidated findings across all skills
  <skill-name>-scan.json  # Raw scan output per skill
  <skill-name>-scan.md    # Readable scan report per skill

Cloned skill repositories are kept in skills/ locally but excluded from version control via .gitignore.

Known Limitations

No runtime verification -- analysis is static and semantic only
LLM and meta-analysis require an API key (SKILL_SCANNER_LLM_API_KEY); without one, scans fall back to static/behavioral/trigger only
Limited to skills with public source repos
Scanner coverage depends on the rule sets of the underlying tools

Links

skillscan.appsechq.com -- interactive dashboard of scan results
Cisco AI Defense skill-scanner
skills.sh
clawhub.ai
Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale -- large-scale security analysis of 31k+ skills from skills.rest and skillsmp.com

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
.claude		.claude
docs		docs
results		results
scripts		scripts
tests		tests
.gitignore		.gitignore
AUTOMATION-PLAN.md		AUTOMATION-PLAN.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
PROJECT-PLAN.md		PROJECT-PLAN.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skill Scanner Test

Status

Scan Results

Getting Started

Scan top skills from a registry

Scan a specific repo

Other options

Running tests

Project Structure

Known Limitations

Links

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

AppSecHQ/skill-scanner-test

Folders and files

Latest commit

History

Repository files navigation

Skill Scanner Test

Status

Scan Results

Getting Started

Scan top skills from a registry

Scan a specific repo

Other options

Running tests

Project Structure

Known Limitations

Links

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages