Testing & Reviewgastonche/promptopus

promptopus

A config-driven LLM evaluation harness — CLI + dashboard. Define an eval in YAML, run it against many models, score every output with deterministic, LLM-as-judge, and cost/latency graders, and compare side by side.

Claude Code Codex Cursor

Suggested install command

npx skills add gastonche/promptopus/promptopus

Always inspect the linked repository and skill instructions before running commands. Skills are instructions; permissions and execution still matter.

Instala en 1 click

Submit a related skill

Compatibility

Agent support matrix

3 supported

Agent	Status
Claude Code	Supported
OpenCode	Not listed
Cursor	Supported
MCP	Not listed
GitHub Copilot	Not listed
Windsurf