Topic: mcp server security scan

How to run an MCP server security scan

If you're about to claude plugin install a community Model Context Protocol server, or you maintain one and want a green badge before publishing, here's the actual workflow — what to submit, what comes back, and what to do with it.

TL;DR

Paste a GitHub URL into the SkillAudit hero form, wait ~60 seconds, get a report card with a single A–F grade plus pass/warn/fail across six axes (security, permissions, credentials, maintenance, client compatibility, documentation). For buyers: A or B = install with confidence, C = install pinned to a reviewed commit, D or F = block or fix-then-revisit. For authors: re-scan after each release; the grade rebuilds from the latest commit. Across our 101-server corpus, only 19% earned an A; the median was a C. Don't install blind.

Why scan, and why now

2026 is the year MCP went mainstream and the year MCP-specific exploits stopped being theoretical. The public scan results are blunt: 50% of community MCP servers ship SSRF in tool handlers, 38% have credential-handling findings, 10% have command-exec sinks. None of these leave a CVE — they're first-party code, written this week, by indie authors who haven't shipped to a public marketplace before. Conventional dependency scanners pass them clean. The trust signal a buyer or marketplace reviewer needs has to come from somewhere else.

For three buyer profiles, scanning is a clear win:

Solo developer about to install one server. A 60-second public scan is cheaper than installing a credential-stealer and unwinding the blast radius after the fact. The scan output is a stable URL; bookmark it and re-check on each upgrade.
Author about to publish a Claude skill or MCP server. The Anthropic Skills Directory and most marketplaces now require a security review before listing. A public A or B grade short-circuits the back-and-forth — reviewers can read the report card themselves. The five patterns that get you an A are documented; most are mechanical.
Team lead approving a server for fleet-wide use. "I want one defensible reason this doesn't live in our agents" is the question. The grade plus the per-axis breakdown answers it on a single page. The Team plan adds a CI gate; the GitHub Action page covers the workflow.

Step-by-step: how to run a scan

Find the canonical source. The thing your client will actually install — usually a public GitHub repo, sometimes an npm package whose source has drifted from the README's GitHub link. If they disagree, scan the npm tarball; that's what installs. SkillAudit accepts all three: GitHub URL, npm package name, ZIP upload.
Submit it. Paste the URL into the hero form on the homepage. The scan starts immediately on a worker; no signup is required for public-repo audits on the free tier.
Wait ~60 seconds. The static layer (tree-sitter pattern matching tuned to MCP idioms) finishes first, in under 10 seconds. The LLM-assisted layer (prompt-injection probing of extracted tool handlers via Claude Haiku 4.5) takes longer — proportional to tool count. Each axis renders into the report card as it lands; you don't have to wait for everything.
Read the grade with the per-axis breakdown. The A–F grade is a single-glance signal. The six pass/warn/fail axes are how you'd defend the grade in a code review. Findings have file paths and line numbers; you can verify each one against the source.
Cite the report URL. If the scan helps an install decision or a publishing decision, link the stable /audits/owner-repo/ URL in your PR description, your commit message, your README, or your fleet-policy doc. The scan is reproducible from public source; readers can click through.

Run a scan now

What comes back: reading the six-axis report card

The report card has the same shape for every server. The score on each axis is independently produced and shown alongside the overall letter grade so you can read where the grade came from:

Security. SSRF, command-exec, secret-handling static checks plus the LLM-assisted prompt-injection probe. The single highest-weight axis. A finding here usually drives the overall grade down by at least one letter.
Permissions hygiene. Does the server ask for more scope than its tools demonstrate? OAuth scopes, env vars, file paths, network egress. Common pattern: a "read-only" tool that requests repo instead of public_repo. Findings here often cost a letter on their own.
Credential exposure. Are env-var reads traced into tool-response paths and logger calls? A walkthrough of how leaks land — the most embarrassing class of finding because authors almost always added the offending log line for debugging and forgot to remove it.
Maintenance. Last commit, open-issue ratio, archive flag. Nine of 101 servers we scanned were archived. Installing one of those means no future security patch — a non-trivial silent risk.
Client compatibility. Targets Claude Code, Cursor, Windsurf, Codex, JetBrains plugin? Protocol-version drift quietly breaks installs.
Documentation completeness. Runnable example, semver, changelog, README that matches the registered tools. Lower-weight axis but a high-correlation signal: poorly-documented servers cluster with the F-grade group.

What to do with each grade

Grade	Meaning	Recommended action (buyer)	Recommended action (author)
A	Clean across all axes; LLM probe found no exploitable injection vectors	Install with confidence. Re-check on major-version bumps.	Embed the badge. You're done.
B	Minor warnings, no high-severity findings	Install. Read the warnings; some are documentation-only.	Address the warnings if you have a 30-minute budget; A is reachable.
C	One mid-severity finding or multiple warnings; security axis is borderline	Install pinned to a specific commit you've reviewed. Don't auto-update.	Fix the one finding; you'll move to a B or A. The remediation hint will name the file.
D	One high-severity finding (e.g. SSRF in a registered tool) or multiple mid-severity	Block in fleet policy. If you must install, fork and patch.	Fix before publishing. The marketplace will reject this; the listing review uses the same axes.
F	Multiple high-severity findings, archived, or LLM probe successfully extracted credentials	Do not install. Note in your team's deny-list with the report URL as the citation.	The fixes are usually mechanical — re-read the A-grade patterns; most are absent.

When to re-scan

Before each install. The scan is free for public repos. A repo that was an A six months ago might be an F today if maintenance lapsed and the LLM-probe coverage caught up to a class of bug it didn't model before. The grade is timestamped.
On every major-version bump. A new tool registered in 1.0 → 1.1 is a new attack surface. Re-scan and re-read the per-axis breakdown.
After any maintainer change. Author handoffs are correlated with the introduction of new findings — particularly credential-echo, where the new maintainer added a debug log and didn't remove it. Trust handoff = re-scan.
On engine version bumps. When SkillAudit ships a new engine release (currently v0.3, with a calibration writeup at engine-v03-calibration-delta), the grade rebuilds. Some servers that were A on v0.2 dropped to B or C on v0.3 because the new probe caught additional patterns. We're explicit about each engine release's delta.