Back to directory
Testing & Reviewgastonche/promptopus
promptopus
A config-driven LLM evaluation harness — CLI + dashboard. Define an eval in YAML, run it against many models, score every output with deterministic, LLM-as-judge, and cost/latency graders, and compare side by side.
Suggested install command
npx skills add gastonche/promptopus/promptopusAlways inspect the linked repository and skill instructions before running commands. Skills are instructions; permissions and execution still matter.
Compatibility
Agent support matrix
3 supported
| Agent | Status |
|---|---|
| Claude Code | Supported |
| OpenCode | Not listed |
| Cursor | Supported |
| MCP | Not listed |
| GitHub Copilot | Not listed |
| Windsurf |