Test your LLM-powered apps with a TypeScript-native, Vitest-based eval runner. No API key required.
Test your LLM-powered apps with a TypeScript-native, Vitest-based eval runner. No API key required.
Test your LLM-powered apps with a TypeScript-native, Vitest-based eval runner. No API key required.
Test your LLM-powered apps with a TypeScript-native, Vitest-based eval runner. No API key required.
Evaluation tools for AI components, functions, workflows, and agents
Metrics for Open Evals
Evaluation suite for swarm-tools multi-agent coordination
eval-genius enables evals of arbitrary async code. It is generally intended for making multiple assertions on outputs which are generated nondeterministically. These assertions can be used to score algorithms on their effectiveness.
[Docs](https://docs.tscircuit.com) · [Website](https://tscircuit.com) · [Twitter](https://x.com/tscircuit) · [Discord](https://tscircuit.com/community/join-redirect) · [Quickstart](https://docs.tscircuit.com/quickstart) · [Online Playground](https://tscir
Test your LLM-powered healthcare and life sciences apps with a TypeScript-native, Vitest-based eval runner. No API key required.
No description provided.
An interactive CLI to iterate on project specifications with AI
Evaluation suite for swarm-tools multi-agent coordination
A Vitest-like CLI for AI agent evaluations. Test your LLM apps with simple, declarative evals.
Command-line workflows for ViteHub projects.
a toy interpreter
Numerical evaluator for mathlex ASTs with broadcasting support