Install AgentGuardian and red-team a live testbench in three minutes. No keys to provision, no target to host — justDocumentation Index
Fetch the complete documentation index at: https://docs.agentguardian.io/llms.txt
Use this file to discover all available pages before exploring further.
pip install, paste the
command, watch the swarm work.
When to use this
- First time trying AgentGuardian and you want a real score on a real agent in the next three minutes.
- Showing a teammate what the tool actually does.
- Sanity-checking your install before pointing the scanner at your own agent (the first-scan tutorial covers that path).
Run the scan
Install
Expected output
The first two lines are emitted immediately — the auto-served dashboard URL goes live before the swarm starts so you can watch the agents work:--serve-grace-seconds 300), giving you time to open it and drill into
any finding.
How to interpret
AIVSS = 41
A 0-100 deterministic score where higher is safer. 41 falls in
the
POOR band (40-59). This testbench is intentionally
permissive — your own agent should score higher.band = not_evaluated
The scan covered 54% of the planned probes, below the
--mode full
authoritative threshold of 95%. The number is real; the grade is
withheld until coverage clears the bar. Raise --budget-usd to
finish the planned 156 turns.tier = T4
Tier auto-detected from the target’s surface. T4 = prompt-only.
Endpoints with tools or memory get classified T1-T3 and exercised
with more probes.
findings = 15
Twelve
high and three low, spanning prompt injection (ASI01),
PII leakage (ASI03), detection evasion (ASI08), denial-of-wallet
(ASI09), and drift (ASI10). Every finding has a reproducible PoV
in the report.AIVSS bands map score to grade:
EXCELLENT 90-100, GOOD 80-89,
WARNING 60-79, POOR 40-59, CRITICAL 0-39. A band of
not_evaluated means the scan is non-authoritative — either coverage
was below the mode floor, or the evaluator was a stub.You saw
AIVSS=41 and 15 findings stream into the terminal. The
dashboard at http://127.0.0.1:7474/scans/cli-... opened with the
swarm board and finding feed.Next step
Your first scan
Run the swarm against your own LangGraph / OpenAI-Agents / Strands
agent. Read every line of the resulting
scan.json.How the swarm works
The six phases — Recon, Decompose, Parallel launch, Checkpoint,
Budget donate, Finalise — and the 13 specialist agents that drive
them.
Open the report
Five report formats (JSON, SARIF, JUnit, Markdown, PDF) plus
Ed25519-signed evidence bundles.
Gate a PR on AIVSS
Wire AgentGuardian into GitHub Actions with SARIF upload and
--fail-under to block regressions.