Docs & WritingIgorGanapolsky/rlhf-feedback-loop

rlhf-feedback-loop

Feedback-Driven Development (FDD) for AI agents — capture preference signals, steer behavior via Thompson Sampling, and export KTO/DPO training pairs for downstream fine-tuning.

Claude Code Codex Cursor

Suggested install command

npx skills add IgorGanapolsky/rlhf-feedback-loop/rlhf-feedback-loop

Always inspect the linked repository and skill instructions before running commands. Skills are instructions; permissions and execution still matter.

Instala en 1 click

Submit a related skill

Compatibility

Agent support matrix

3 supported

Agent	Status
Claude Code	Supported
OpenCode	Not listed
Cursor	Supported
MCP	Not listed
GitHub Copilot	Not listed
Windsurf