BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

cross-ecosystem search · live

Results for evals

Found in 5 of 7 ecosystemsnpm 1–24 of 547 · 2322 matches across other registries

npm547 PyPI1 crates.io2298 RubyGems12 NuGet11

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 547 · JavaScript

Arize evals package

Maintained. Maintained, actively maintained.

axiomv0.52.2

npm

Axiom AI SDK provides - an API to wrap your AI calls with observability instrumentation. - offline evals - online evals

Maintained. Maintained, actively maintained.

@harnessio/react-ai-evals-service-clientv0.25.0

npm

Harness AI Evals Service APIs integrated with react hooks

Maintained. Maintained, actively maintained.

@hamming/hamming-sdkv1.0.27

npm

SDK for Hamming Evals Framework

Abandoned. Last published over a year ago.

@vitest-evals/harness-ai-sdkv0.11.0

npm

AI SDK harness adapter for vitest-evals.

Maintained. Maintained, actively maintained.

@runflow-ai/evalsv0.0.8

npm

Runflow Evals — project-local evals framework for Runflow agents (datasets, scorers, journey/conversation validation, LLM judge, viewer)

Maintained. Maintained, actively maintained.

@vitest-evals/harness-openai-agentsv0.11.0

npm

OpenAI Agents SDK harness adapter for vitest-evals.

Maintained. Maintained, actively maintained.

openevalsv0.2.0

npm

Much like tests in traditional software, evals are an important part of bringing LLM applications to production. The goal of this package is to help provide a starting point for you to write evals for your LLM applications, from which you can write more c

Maintained. Maintained, actively maintained.

@vitest-evals/harness-pi-aiv0.11.0

npm

pi-ai harness adapter with tool replay for vitest-evals.

Maintained. Maintained, actively maintained.

@vitest-evals/github-reporterv0.11.0

npm

GitHub Actions reporting internals for vitest-evals runs.

Maintained. Maintained, actively maintained.

@mcpjam/sdkv1.10.1

npm

MCP server unit testing, end to end (e2e) testing, and server evals

Maintained. Maintained, actively maintained.

mcp-evalsv2.0.1

npm

GitHub Action for evaluating MCP server tool calls using LLM-based scoring

Aging — last published 11 months ago — check before adopting.

evalzv0.2.2

npm

Model graded evals with typescript

Abandoned. Last published over a year ago.

@silvermine/undertemplatev1.0.2

npm

Replacement for _.template (underscore or lodash) without unsafe evals.

Abandoned. Last published 5 years ago.

@mastra/evalsv1.2.4

npm

No description provided.

Maintained. Maintained, actively maintained.

maestro-evalsv1.0.0

npm

Golden-prompt regression guard for the Maestro agent runtime. Static evals (mock-based, every CI) plus live evals (real Anthropic, scheduled) that catch the four Anthropic tool-calling traps before they ship.

Maintained. Maintained, actively maintained.

@viteval/uiv0.5.9

npm

Viteval UI - local UI for viewing the results of your evals