Multi-turn Claude session driver
Eval pipeline orchestrator for Claude Code
Grade LLM outputs against checks files using an LLM judge