BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for chunker

Found in 6 of 7 ecosystemsnpm 1–24 of 178 · 155 matches across other registries

npm178 PyPI1 crates.io134 RubyGems7 Maven6 NuGet7

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 178 · JavaScript

See all npm →

rabin-streamv2.0.0

Streaming Rabin chunker

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

Rabin chunker for IPFS implementation in Rust

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

Chunk/split your stream without eating the splitter char.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 12 years ago.

stream-chunkerv1.2.8

A transform stream which chunks incoming data into chunkSize byte chunks

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 10 years ago.

pos-chunkerv1.3.3

A parts-of-speech chunker.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 10 years ago.

object-chunkerv1.0.1

Chunk object-mode streams

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

breadchunksv0.2.1

Heading-aware, token-budgeted semantic chunker for Markdown — for RAG and embedding pipelines.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@msbayindir/rag-chunkerv3.1.0

Mistral OCR + deterministic AST chunker for RAG pipelines

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@google-labs/chunkerv0.0.1

A simple chunker for breaking up structured data into chunks, suitable for RAG

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

@orama/chunkerv0.0.3

Split large texts into chunks with a maximum number of token. Split by fixed size or by sentence.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

ielts2go-chunkerv2.0.10

IELTS2GO Video Chunker - Professional video splitting tool for educational content

MaintenanceAging

PopularityUnknown

Aging — last published 10 months ago — check before adopting.

@askdb/ragv0.2.0-beta.13

AskDB RAG layer: deterministic chunker over Schema v2, BYO embedder + vector store (in-memory, file-backed, pgvector), and an optional retriever wired into @askdb/core ask().

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

Array chunker for JavaScript.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

@zooid/corev0.7.3

zooid core: SessionRunner, Chunker, hooks, config parsing, and the Runtime/Adapter/Transport interfaces.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@meddevkit/chunkerv0.1.2

PHI-aware medical text chunker for RAG applications

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@florexlabs/docs-to-mcp-chunkerv0.2.2

Heading-aware Markdown chunking for documentation embeddings

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@sivru/searchv0.8.0

Sivru search engine — gitignore-aware walker, code-aware chunker, BM25, cosine top-k, ranking signals, on-disk cache.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@tobinbc/chunkerv1.1.1

Tiny function to split an array into chunks, returned as an array of arrays

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

scout-text-chunkerv0.1.0

Scout Text Chunker provides text chunking strategies for RAG pipelines.

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

chunker-encoderv1.0.0

Chunk buffers using an arbitrary chunker

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@akiroz/size-chunker-streamv0.0.1

A NodeJS transform stream for chunking raw data into constant-size chunks. Useful for consuming raw media streams where chunk size = 1 frame.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

chunkosaurusv0.0.2

Easy to understand arbitrary data chunker

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

varak-chunkerv1.1.0

Thai legal document processing — chunking, paragraph extraction, varak segmentation

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

n8n-nodes-markdown-chunkerv0.1.2

n8n node that splits Markdown into retrieval-ready chunks with heading-aware metadata for RAG and vector stores

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

1 2 3 4 5…8

PyPI matches

Exact match · Python

Easy Chunk-Based File Structure Parser

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 12 years ago.

crates.io matches

Showing 12 of 134 · Rust

See all crates.io →

Minimalistic parallel executor

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 3 years ago.

Fast text chunking for Rust

MaintenanceAging

PopularityNiche

Aging — last published 7 months ago — check before adopting.

vectradb-chunkersv0.1.0

Chunking utilities for VectraDB in Rust

MaintenanceAging

PopularityNiche

Aging — last published 7 months ago — check before adopting.

code-chunkerv0.2.0

AST-aware code chunking and late chunking for RAG

MaintenanceDeprecated

PopularityNiche

Deprecated. Don't start a new project on this.

A high-performance, deterministic, flexible and portable zero-copy streaming Content-Defined Chunking (CDC) and hashing infrastructure library. Bytes in → Chunks & hashes out

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

synwire-chunkerv0.1.0

Tree-sitter AST-aware code chunking for Synwire semantic search

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

cdc-chunkersv0.1.3

A collection of Content Defined Chunking algorithms

MaintenanceAging

PopularityNiche

Aging — last published over a year ago — check before adopting.

goxoy-file-chunkerv0.0.3

Goxoy File Chunker splits files into equal chunks

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 2 years ago.

Retrieval-Augmented Generation for Rust Agent Development Kit (ADK-Rust) agents

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

datagram-chunkerv0.0.2

Serialize and deserialize messages in datagrams

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published over a year ago.

regex-chunkerv0.3.0

Iterate over the data in a `Read` type in a regular-expression-delimited way.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 2 years ago.

vil_chunkerv0.4.0

VIL Advanced Semantic Chunker — SIMD-optimized, zero-alloc text chunking with sentence-boundary, sliding-window, code-aware, and table strategies

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

RubyGems matches

7 matches · Ruby

Embed arbitrary data and multiple, distinct documents within ruby files.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 15 years ago.

salesforce_chunkerv1.2.2

Salesforce client and extractor designed for handling large amounts of data

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 5 years ago.

rack-chunkerv1.0.0

Middleware for chunking the body of a response

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

chunker-rubyv0.2.0

Multiple chunking strategies to split documents into optimal pieces for embedding and vector search. Supports character, recursive, sentence, markdown, HTML, code, token, and semantic splitting.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

semantic_chunkerv0.6.4

A powerful tool for RAG (Retrieval-Augmented Generation) that splits text into chunks based on semantic meaning rather than just character counts. Supports sliding windows, adaptive buffering, and dynamic percentile-based thresholding.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

semantic_text_chunkerv0.2.0

Detects topic boundaries using embedding similarity to produce semantically coherent chunks from books, articles, and documents. Supports Cohere, OpenAI, and OpenRouter embedders.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

pikuri-vectordbv0.0.4

pikuri-vectordb gives a pikuri-core agent a +vectordb_search+ tool over a local document corpus — agentic search, the agent decides when to retrieve. Ships a swappable backend (a pure-Ruby +Backend::InMemory+ for teaching and a thin +Backend::Chroma+ HTTP client for persistence), a chunker, an embedder wrapper over +RubyLLM.embed+, and an optional +Reranker::LlamaServer+ that speaks +/v1/rerank+ against a cross-encoder model. Text extraction goes through +Pikuri::FileType.read_as_text+ in pikuri-core, which handles plain text / Markdown / PDF; HTML extraction is a deferred follow-up. Hosts wire the feature via +c.add_extension Pikuri::VectorDb::Extension.new(...)+ inside the +Agent.new+ block — same opt-in shape as +pikuri-tasks+ / +pikuri-skills+. The bundled +Pikuri::VectorDb::LIBRARIAN+ persona is the privilege-separated sub-agent counterpart for hosts that want recall to flow through a child rather than the parent's context. Three model endpoints in the full setup — chat (via ruby_llm), an embedder (via +RubyLLM.embed+), and an optional reranker (HTTP +/v1/rerank+). A single +llama-server+ in router mode serves all three by default, loading each cached GGUF on demand; see the gem's README for details.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

Maven matches

6 matches · Java

org.apache.ctakes:ctakes-chunker-modelsv5.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

org.apache.ctakes:ctakes-chunker-resv4.0.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

org.apache.ctakes:ctakes-chunkerv6.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published over a year ago.

com.github.monnetproject:translation.chunkerv1.18.4

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

eu.crydee.uima.opennlp.resources:en-chunkerv1.5

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 11 years ago.

org.apache.stanbol:org.apache.stanbol.enhancer.engines.opennlp.chunkerv1.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

NuGet matches

7 matches · .NET

opennlp.netv1.9.4.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

markdownstructurechunkerv1.0.7

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

documentchunkerv1.0.0

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

graphemechunker.diffplexv1.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

vectorsharp.chunkingv1.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

levelup.strategos.ontologyv2.8.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.