Audit any Claude skill or MCP server in 60 seconds.

Paste the GitHub URL below. Get a graded report card across security, permissions, credential handling, maintenance, client compatibility, and docs — before you claude plugin install it into your agent.

If we've already scanned it, we'll send you straight to the live report. Otherwise we'll email the report card within 24h. Free for public repos.

We ran the scanner against 101 real MCP servers — 50% shipped SSRF, 38% leaked credentials, only 19% earned an A. Every report is public.

Live corpus — updated each scan batch

We didn't just cite a scan. We ran one — and published all 101 reports.

101 real MCP servers audited

50% shipped an SSRF-prone fetch(url) with no allowlist

38% flagged for credential handling — env-var echoes or logged tokens

19% earned an A grade — clean tool surface across all six axes

Board covers vendor-official MCPs from AWS, Azure, Google Cloud, Google (mcp-toolbox), MongoDB, Elastic, Stripe, PayPal, Cloudflare, Heroku, Redis, Twilio, Neon, Qdrant, LangChain, Box, ClickHouse, Apollo GraphQL, Sentry, HubSpot, Algolia, Grafana, JetBrains, PostHog, Perplexity, Notion, Snowflake, dbt, Prisma, Confluent, Fastly, Honeycomb, Zilliz/Milvus, Xero, Couchbase, Pinecone, Axiom, Tavily, Microsoft (Playwright), Appwrite, Resend, Auth0, Weights & Biases, Pydantic (Logfire), Brave, Vectara, Meilisearch, JFrog, Pipedream, and Linear, plus Anthropic itself (across nine official language SDKs — TypeScript, Python, Ruby, Kotlin, Java, C#, Swift, Rust, Go) and popular indie frameworks (FastMCP, mcp-use, mcp-agent, Klavis, Figma-context, DuckDuckGo, korotovsky/slack, Neo4j). Read the full methodology →

The problem

The Claude skill ecosystem is exploding. The trust signal isn't.

There are now 8,000+ MCP servers across a dozen registries and Claude skills land on Anthropic's official directory daily. Anthropic itself requires a security review before listing — but there is no neutral, fast, reproducible audit a skill author or a team buyer can run today. Authors guess what reviewers want; buyers install community skills blind, then discover the credential-stealing prompt-injection in production.

Authors get rejected from listings — and have no idea what the reviewer flagged. Re-submitting is a guessing game across SSRF, secret handling, prompt-injection, and license compliance.
Buyers install blind — a green GitHub star count is not a security review. In our own scan of 101 real MCP servers — the kind of list you'd find on day one Googling "MCP server <vendor>" — 50% shipped an SSRF-prone fetch(url) with no allowlist and 38% had credential-handling findings. Every one of those is now a package someone's agent has already installed.
Security teams have no policy gate — there is no min-grade rule a tech lead can apply in CI to block a skill from being installed by their team. Approval becomes "I trust this username".

How it works

From URL to graded report card in three steps.

01
Paste a URL

GitHub repo, npm package, or upload a ZIP. Public scan free; private repo via single-repo OAuth scope, no org-wide access required.
02
Get graded

Static parse plus an LLM-assisted prompt-injection probe runs in about 60 seconds. The six-axis report card streams in as each check completes.
03
Earn the badge

Embed a public trust badge on your README so directory reviewers and buyers see your grade at a glance — or wire the CI Action to gate every install on a minimum grade.

Sample report excerpt — what the static scan returns for a real MCP server:

$ skillaudit scan github.com/user/mcp-weather
✓ Security         0 critical, 0 high, 1 low (informational)
✓ Permissions      requests only network:fetch, justified
✓ Credentials      no env-var echoes, no token logging
! Maintenance      last commit 87 days ago — flag for staleness
✓ Compatibility    Claude Code, Cursor, Windsurf, Codex
✓ Docs             README, runnable example, semver

Grade: A  · embed badge: [skillaudit.dev/badge/user/mcp-weather]

What you get

Six-axis scan. Public badge. CI gate. Done.

Six-axis security scan

Static SSRF, command-exec, secret-handling, plus an LLM-assisted prompt-injection red-team that probes your tool definitions for escape paths a simple grep can't find.

Public trust badge

Drop a Markdown badge on your README. Directory reviewers and buyers see your green grade before they read a line of code. Re-scans run on every push so the badge stays honest.

CI gate via GitHub Action

One line in your workflow blocks any PR from merging if the skill grade drops below your team policy. SBOM and audit log included for every scan, so compliance reviews stop being a manual scramble.

Cross-client compatibility

Per-client checks for Claude Code, Cursor, Windsurf, and Codex CLI. A Cursor-only quirk doesn't get reported as a Claude bug; a Claude-only feature doesn't fail your Codex install.

Pricing

Free for public repos. $19/mo when you ship for real.

Free

$0/mo

For authors trying out a public scan.

3 audits/month on public repos
Public trust badge
Basic six-axis report

Audit my repo

Pro

$19/mo

For indie authors and small teams shipping skills weekly.

Unlimited public + private repo audits
CI webhook + GitHub Action
Full report with remediation hints
Scan history + diff between versions

Audit my repo

Team

$99/mo

For 10-100 person orgs adopting community skills.

Everything in Pro for up to 10 seats
SSO + role-based access
Policy export (min-grade gate)
SBOM + audit log per scan

Questions

Frequently asked

Why not just use Snyk or Dependabot?

Snyk and Dependabot scan dependencies — they have no idea what your skill's prompt surface, MCP tool definitions, or credential handling actually do at runtime. SkillAudit's six-axis scan is purpose-built for the LLM-tooling stack: SSRF in tool calls, prompt-injection escape paths, env-var leakage in logs, and the prompt-surface checks that generic SAST tools cannot perform.

What clients does it work with?

Static analysis runs on any Claude skill or MCP server regardless of client. Our compatibility matrix flags client-specific issues for Claude Code, Cursor, Windsurf, and Codex CLI, so a Cursor-only quirk does not get reported as a Claude bug or vice versa.

Will Anthropic ship this themselves?

Possibly — Anthropic's official directory already requires a security review for listing. We are moving fast on the parts a first-party listing service is unlikely to ship: deeper LLM-assisted prompt-injection red-teaming, a CI gate for private-repo workflows, a public badge any author can embed regardless of where they publish, and cross-client compatibility testing.

What does "in 60 seconds" actually mean?

Static parse plus an LLM-assisted prompt-injection probe runs in roughly 60 seconds for typical skills under 2 MB. Larger MCP servers can take longer; we stream the report card section by section as each axis completes so you never stare at a spinner.

Do you store my source code?

We pull the repo into an ephemeral sandbox for analysis and discard the source as soon as the report is generated. Private-repo scans require an OAuth token scoped to single-repo access, never org-wide; we never request write permissions and we never train models on your code.

Get the green badge before you publish.

Queue an audit for your repo — free for public code. The first 100 authors who request a scan get Pro free for 6 months.

Audit my repo

More tools from the Startup Factory

runguard.dev — The circuit breaker your AI agent needed yesterday.
keeptier.com — Keep your tier. Lose the Apple tax.
chairhold.com — A $9 link that holds the chair — take a deposit before the appointment.
vialfile.com — The tracker the post-RFK peptide era needs.
clinicalingo.com — Spanish for the shift you're working tomorrow.
catalogscan.com — Is your store invisible to ChatGPT?
hourtab.com — Stop emailing clients "how many hours do I have left?"
rentceiling.com — Know your legal max. Serve the notice. Keep the receipts.
mcpreplay.com — Record. Replay. Catch every MCP regression.
foothold.community — Your paid Slack community has a first-week problem. We fix it.
glosscap.com — Captions that know your jargon.
alivemcp.com — Is your MCP server alive? We ping it every 60 seconds so you know before your users do.
keybrake.com — Put the brakes on your agent's keys.
claimhour.com — Claim every hour you bill.
whychose.com — The log of how you decided — auto-written from your AI chats.
therapydraft.com — HIPAA by architecture, not by contract.
glyphward.com — See what text-only scanners miss.

Audit any Claude skill or MCP server in 60 seconds.

We didn't just cite a scan. We ran one — and published all 101 reports.

The Claude skill ecosystem is exploding. The trust signal isn't.

From URL to graded report card in three steps.

Paste a URL

Get graded

Earn the badge

Six-axis scan. Public badge. CI gate. Done.

Six-axis security scan

Public trust badge

CI gate via GitHub Action

Cross-client compatibility

Free for public repos. $19/mo when you ship for real.

Free

Pro

Team

Frequently asked

Get the green badge before you publish.

More tools from the Startup Factory