Execute a string of JavaScript using Node.js and return the global variable values and functions.
> Semantically a dialect of ClojureScript. Built with Rust. Compiles to JavaScript ES Modules.
TypeScript SDK for content evaluation
A verification toolchain for TypeScript — generates Lean 4 or Dafny from annotated TS
Structured logging, OpenTelemetry tracing, and metrics for RAG evaluations
A Webpack plugin to transpile async module output using Babel. Allows transpiling top level await to ES5.
Cost tracking, pricing, budgeting, and reporting for RAG evaluations
The command-line interface for Gadget
Fast, compiled, eval-free data validator/transformer
Much like tests in traditional software, evals are an important part of bringing LLM applications to production. The goal of this package is to help provide a starting point for you to write evals for your LLM applications, from which you can write more c
A virtual console for capturing and manipulating terminal output.
SWE-bench (Lite/Verified/Full) evaluation harness for nexus-agents — clean-room implementation, model-only baseline
sort ssb messages by cryptographic order
CLI entry point and commands for the RAG evaluation toolkit
Central evaluation orchestrator that ties metrics, judge, cost, gate, and dataset together
Trajectory loading, evaluation, and comparison for agent-eval-harness
Swagger 2.0 and OpenAPI 3.0/3.1 parser and validator for Node and browsers
LLM-as-judge with calibration, consensus voting, and cost tracking
Generic wave-based multi-agent orchestration for repository work.
Lezer-based Clojure Codemirror 6 extension with live evaluation
Statsig helps you move faster with feature gates (feature flags), and/or dynamic configs. It also allows you to run A/B/n tests to validate your new features and understand their impact on your KPIs. If you're new to Statsig, check out our product and cre
Evaluation toolkit for HazelJS AI apps — golden datasets, RAG metrics, agent trajectories, LLM-as-judge, CI reports
Evaluation CLI for AI Observability on Dynatrace
No description provided.