BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

cross-ecosystem search · live

Results for judges

Found in 5 of 7 ecosystemsnpm 1–24 of 179 · 457 matches across other registries

npm179 PyPI1 crates.io442 RubyGems12 NuGet2

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 179 · JavaScript

See all npm →

@kevinrabun/judgesv3.129.9

npm

45 specialized judges that evaluate AI-generated code for security, cost, and quality.

Maintained. Maintained, actively maintained.

@kevinrabun/judges-cliv3.129.9

npm

CLI wrapper for the Judges code review toolkit.

Maintained. Maintained, actively maintained.

Coming soon.

Aging — last published 6 months ago — check before adopting.

@agentv/evalv4.31.3

npm

Evaluation SDK for AgentV - build custom code judges

Maintained. Maintained, actively maintained.

backant-kairosv1.0.2

npm

Autonomous AI engineer — observes, judges, builds, ships

Maintained. Maintained, actively maintained.

@wifo/factory-spec-reviewv0.0.14

npm

LLM-judged spec quality reviewer — runs subscription-paid `claude -p` judges against software-factory specs and emits findings in factory-spec-lint's output format

Maintained. Maintained, actively maintained.

ohbemv1.5.3

npm

Ohbem judges your Pokemon GO IVs.

Abandoned. Last published 2 years ago.

mahout-benchv1.0.1

npm

CLI benchmark for measuring and mitigating sycophancy in LLMs. Supports multi-provider execution, configurable judges, and long-running evaluation campaigns.

Maintained. Maintained, actively maintained.

agent-skills-evalv0.1.1

npm

TypeScript SDK and CLI for evaluating agentskills.io-style AI agent skills with LLM judges, baseline comparison, YAML config, JSONL logs, and HTML reports.

Maintained. Maintained, actively maintained.

splitifi-mcpv2.2.0

npm

Splitifi Intelligence MCP — outcome predictions, judge profiles, and legal workflows powered by 4,819 ML models trained on 102M+ court records. Serves litigants, attorneys, judges, mediators, CDFAs, and litigation funders.

Maintained. Maintained, actively maintained.

@restormel/testing-runnerv0.1.8

npm

Suite execution: browser goals, Keys-backed judges, retries, artefacts.

Maintained. Maintained, actively maintained.

@iflow-mcp/kevinrabun-judgesv3.38.0

npm

45 specialized judges that evaluate AI-generated code for security, cost, and quality.

Maintained. Maintained, actively maintained.

@keptn/pitometerv1.1.0

npm

Collects metrics and judges the health of a deployment

Abandoned. Last published 6 years ago.

@jgsheppa/llm-as-judge-mcp-serverv0.8.3

npm

An MCP server that uses large language models (LLMs) as judges to evaluate the responses of other LLMs.

Aging — last published 7 months ago — check before adopting.

judge-cliv0.1.6

npm

AI-Powered Code Quality Assistant utilizing parallel specialized expert judges.

Maintained. Maintained, actively maintained.

@apps-machine/selection-agentv0.11.0

npm

Apps Machine — Selection Agent. Ranks app opportunities globally via dual-store (Apple App Store + Google Play) scraping, heuristic scoring, and Claude judges. Run `npx @apps-machine/selection-agent demo` for a 30s magical moment.

Maintained. Maintained, actively maintained.

discord-osirisv2.0.2

npm

Judges suspicious Discord links

Abandoned. Last published 3 years ago.

how-good-is-your-filmv1.3.1

npm

How good is your film? Sarah judges all

Abandoned. Last published 2 years ago.

pi-until-donev0.2.2

npm

Pi extension that brings Hermes Agent's /goal (Ralph loop with judge) to Pi as /until-done. Pi self-judges every turn, runs verifyCommand to confirm done, and routes all CI/CD through mise across 18 language profiles.

Maintained. Maintained, actively maintained.

vitest-evalsv0.11.0

npm

Harness-backed AI testing on top of Vitest.

Maintained. Maintained, actively maintained.

@versuz/mcpv0.2.0

npm

MCP server for Claude Code — expose the Versuz marketplace as native tools. Search, inspect, install, and battle 100k+ ranked SKILL.md and CLAUDE.md files inline. Daily benchmark with 3 frontier judges.

Maintained. Maintained, actively maintained.

online-judge-scraperv0.2.0

npm

A library to extract information easily from various online judges.

Aging — last published over a year ago — check before adopting.

@dvg-os/agency-os-corev0.1.5

npm

agency-os core: MCP server, orchestrator, Minerva KG, observability, quality-gate judges. Imported by the @dvg-os/agency-os Claude Code plugin.

Maintained. Maintained, actively maintained.

@mp3wizard/agent-skills-evalv0.1.1

npm

TypeScript SDK and CLI for evaluating agentskills.io-style AI agent skills with LLM judges, baseline comparison, YAML config, JSONL logs, and HTML reports.

Maintained. Maintained, actively maintained.