BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for sentence-tokenizer

Found in 3 of 7 ecosystemsnpm 1–24 of 15,398 · 12 matches across other registries

npm15398 RubyGems5 NuGet7

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 15,398 · JavaScript

See all npm →

sentence-tokenizerv1.0.1

Tokenize paragraphs into sentences, and smaller tokens.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

A port of NLTK's Punkt sentence tokenizer to JS.

MaintenanceAging

PopularityUnknown

Aging — last published 9 months ago — check before adopting.

wakachigakiv1.3.2

Minimal japanese sentence tokenizer written in 100% pure TypeScript.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

English word and sentence tokenizer, for natural language processing.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

@csstools/css-tokenizerv4.0.0

Tokenize CSS

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

A promise based streaming tokenizer

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@tokenizer/inflatev0.4.1

Tokenized zip support

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

wink-tokenizerv5.3.0

Multilingual tokenizer that automatically tags each token with its type

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

@tokenizer/tokenv0.3.0

TypeScript definition for strtok3 token

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

@csstools/css-parser-algorithmsv4.0.0

Algorithms to help you parse CSS from an array of tokens.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

Split text into sentences with Sentence Boundary Detection (SBD).

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

css-selector-tokenizerv0.8.0

Parses and stringifies CSS selectors

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

gpt-tokenizerv3.4.0

A pure JavaScript implementation of a BPE tokenizer (Encoder/Decoder) for GPT-2 / GPT-3 / GPT-4 and other OpenAI models

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

@csstools/css-calcv3.2.1

Solve CSS math expressions

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

scss-tokenizerv0.4.3

A tokenzier for Sass' SCSS syntax

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

simple-html-tokenizerv0.5.11

Simple HTML Tokenizer is a lightweight JavaScript library that can be used to tokenize the kind of HTML normally found in templates.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

prosemirror-markdownv1.13.4

ProseMirror Markdown integration

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@jscpd/tokenizerv4.2.4

tokenizer of source code for jscpd

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

token-typesv6.1.2

Common token types for decoding and encoding numeric and string values

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

llama-tokenizer-jsv1.2.2

JS tokenizer for LLaMA-based LLMs

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published over a year ago.

ja-sentencev1.0.2

Light-weight sentence tokenizer for Japanese.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

glsl-tokenizerv2.1.5

r/w stream of glsl tokens

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

@anthropic-ai/tokenizerv0.0.4

Claude tokenizer

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

Tokenizes a string that represents a regular expression.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

1 2 3 4 5…642

RubyGems matches

5 matches · Ruby

llt-tokenizerv0.0.8

LLT's Tokenizer

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

tactful_tokenizerv0.0.5

TactfulTokenizer uses a naive bayesian model train on the Brown and WSJ corpuses to provide high quality sentence tokenization.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 12 years ago.

tokeneyesv0.1.1

A simple string tokenizer designed to capture punctuation and sentence flow information.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 6 years ago.

ms_paraphrasev0.0.1

Provides a connectivity wrapper around the microsoft Paraphrase API. Token management and paraphrasing of sentences.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

chunker-rubyv0.2.0

Multiple chunking strategies to split documents into optimal pieces for embedding and vector search. Supports character, recursive, sentence, markdown, HTML, code, token, and semantic splitting.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

NuGet matches

7 matches · .NET

sentencepiecetokenizerv0.1.6

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

allminilml6v2sharpv0.0.3

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

opennlp.netv1.9.4.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

sentencetransformerscsharpv1.0.4

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published 8 months ago — check before adopting.

zemberekdotnet.tokenizationv0.19.5

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

allmpnetbasev2sharpv0.1.3

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

e5embedding.netv2.0.2

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.