BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for llm-test

Found in 5 of 7 ecosystemsnpm 1–24 of 346,392 · 72 matches across other registries

npm346392 crates.io3 RubyGems12 Maven1 NuGet56

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 346,392 · JavaScript

See all npm →

llm-testv3.68.10

.. raw:: html <iframe width="560" height="315" src="ahttps://www.youtube.com/embed/17ozSeGw-fY?si=8vbGltLVhtoMYbCT" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-pictu

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

llm-testrunner-componentsv2.0.0

A Stencil web component library for LLM test runner functionality

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@akhilan-fluxon/llm-testrunner-componentsv1.1.4

A Stencil web component library for LLM test runner functionality

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

node-llm-testv0.18.6

Generate tests to evaluate the intelligence of large language models.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

playwrightv1.60.0

A high-level API to automate web browsers

MaintenanceHealthy

PopularityTop 1%

Safe default. Widely trusted across the ecosystem, actively maintained.

@playwright/testv1.60.0

A high-level API to automate web browsers

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

llm-test-pubv1.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

@llm-dev-ops/test-bench-cliv0.2.0

CLI wrapper for LLM Test Bench - A production-grade framework for testing and benchmarking Large Language Models

MaintenanceAging

PopularityUnknown

Aging — last published 6 months ago — check before adopting.

llm-spend-guardv2.0.6

Enforce real-time token budgets and spending limits for OpenAI, Anthropic Claude, and Google Gemini API calls in Node.js

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@clipboard-health/playwright-reporter-llmv2.4.6

Playwright reporter that outputs structured JSON for LLM agents. Minimal console output, flat schema, easy to filter to failures.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@typia/interfacev12.1.1

Superfast runtime validators with only one line

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

ai-cost-meterv1.0.0

Lightweight, zero-dependency LLM API cost & token usage tracker for OpenAI, Anthropic, Gemini, Mistral, Groq, and DeepSeek

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

langchainv1.4.4

Typescript bindings for langchain

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@typia/utilsv12.1.1

Superfast runtime validators with only one line

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@llm-ui/reactv0.13.3

Display language model outputs in your React project.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

@llm-ui/markdownv0.13.3

[llm-ui](https://llm-ui.com) markdown block.

MaintenanceAbandoned

PopularityRising

Abandoned. Last published 2 years ago.

@mlc-ai/web-llmv0.2.84

Hardware accelerated language model chats on browsers

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@llm-ui/codev0.13.3

[llm-ui](https://llm-ui.com) code block.

MaintenanceAbandoned

PopularityRising

Abandoned. Last published 2 years ago.

promptfoov0.121.13

LLM eval & testing toolkit

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

llm-chunkv0.0.1

A super simple text splitter for LLM

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

@llm-ui/jsonv0.13.3

[llm-ui](https://llm-ui.com) JSON blocks for building custom components.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

@earendil-works/pi-agent-corev0.78.0

General-purpose agent with transport abstraction, state management, and attachment support

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

Test your LLM-powered apps with a TypeScript-native, Vitest-based eval runner. No API key required.

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

@wix-pilot/detoxv1.0.13

Detox driver for Wix Pilot usage

MaintenanceAging

PopularityUnknown

Aging — last published 11 months ago — check before adopting.

1 2 3 4 5…14433

crates.io matches

3 matches · Rust

llm-test-benchv0.1.0

A production-grade CLI for testing and benchmarking LLM applications with support for GPT-5, Claude Opus 4, Gemini 2.5, and 65+ models

MaintenanceAging

PopularityNiche

Aging — last published 7 months ago — check before adopting.

llm-test-bench-corev0.1.0

Core library for LLM Test Bench - comprehensive testing framework for Large Language Models with 65+ supported models across 14+ providers

MaintenanceAging

PopularityNiche

Aging — last published 7 months ago — check before adopting.

llm-test-bench-datasetsv0.1.0

Dataset management and utilities for LLM Test Bench - load, validate, and manage test datasets

MaintenanceAging

PopularityNiche

Aging — last published 7 months ago — check before adopting.

RubyGems matches

Exact match · Ruby

ruby_llm-testv0.2.0

Provides a RubyLLM::Provider that allows you to stub responses for testing purposes. You can stub individual responses or a sequence of responses, and you can also temporarily stub responses within a block.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

TNG (Test Next Generation) is a Rails gem that automatically generates comprehensive test files by analyzing your Ruby code using static analysis and AI. It supports models, controllers, and services with intelligent test case generation.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

dotcodegenv0.1.5

Generate tests for your code using LLMs. This gem is a CLI tool that uses OpenAI to generate test code for your code. It uses a configuration file to match files with the right test code generation instructions. It is designed to be used with Ruby on Rails, but it can be used with any codebase. It is a work in progress.

MaintenanceArchived

PopularityNiche

Archived. Source repository is archived on GitHub.

Output test code using LLM agents.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 2 years ago.

Define qualitative evaluation criteria and let an LLM judge if responses pass. Perfect for testing AI agents, comparing models, and evaluating subjective qualities.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

The Prompt testing library for LLM that allows comparing patterns of prompts.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 2 years ago.

probatio_diabolicav0.4.5

Probatio Diabolica runs custom *_spec.rb files with a DSL inspired by RSpec and supports text/image/PDF reporting.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

minitest-promptfoov0.1.4

A thin Minitest wrapper around promptfoo that brings prompt testing to Ruby projects. Test LLM prompts with a familiar Minitest-like DSL, supporting multiple providers and assertion types.

MaintenanceAging

PopularityNiche

Aging — last published 7 months ago — check before adopting.

completion-kitv0.12.0

CompletionKit is a prompt testing platform that runs as a Rails engine or a standalone app. Run prompts against real datasets, score every output with an LLM judge against criteria you define, track prompt versions, and get AI-generated improvement suggestions grounded in your actual results. Includes a web UI, REST API, and a built-in MCP server with 34 tools.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

lex-validatorv0.1.0

Fleet pipeline validation: tests, lint, security scan, adversarial LLM review

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

rubyllm-observv0.6.9

A Rails engine providing comprehensive observability for LLM-powered applications. Features include session tracking, trace analysis, prompt management, cost monitoring, and optional chat/agent testing UI (with RubyLLM integration).

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

rubric_llmv0.1.2

Provider-agnostic LLM evaluation with pluggable metrics, statistical A/B comparison, and test framework integration. Ragas for Ruby, powered by RubyLLM.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

Maven matches

1 match · Java

tech.harmonysoft:mental-mate-llm-testv2.8.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

NuGet matches

Showing 12 of 56 · .NET

See all NuGet →

foundationallm.testsv0.9.7

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published 6 months ago — check before adopting.

okeydokey.test.apploggerv1.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

zeromcp.testkit.xunitv0.1.3.4

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

zeromcp.testkitv0.1.3.4

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

llmprompttestingv2.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

llmprompttesting.anthropicv2.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

achieveai.lmdotnettools.lmtestutilsv1.0.33

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

novacore.agents.testingv2.1.9

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

mapache.promptevaldotnetv0.0.2

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

cisharpai.testingv0.2.2

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

rca-cliv1.0.0.19

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.