🤗 Tokenizers.js: A pure JS/TS implementation of today's most used tokenizers
a lightweight no-dependency fork of transformers.js (only tokenizers)
Multi-arch builds of HuggingFace tokenizers
Multi-arch builds of HuggingFace tokenizers
Additional tokenizers for Orama
| [NPM Package](https://www.npmjs.com/package/@mlc-ai/web-tokenizers) | [WebLLM](https://github.com/mlc-ai/web-llm) |
This [remark][remark] plugin can disable any or all remark `blockTokenizers` and `inlineTokenizers`. It can not only disable the ones provided by remark core, but also any other tokenizer that has been added to the remark parser whether through plugins or
Multi-arch builds of HuggingFace tokenizers
Multi-arch builds of HuggingFace tokenizers
NodeJS implementation of @Qdrant/fastembed
Optimised tokenizer/lexer generator! 🐄 Much performance. Moo!
JavaScript port of tiktoken
gemma3 tokenizer for NodeJS/Browser
Template project for writing node package with napi-rs
Multi-arch builds of HuggingFace tokenizers
Fork of `tokenizers`
<!-- :begin use tokenizer/banner -->
<!-- :begin use tokenizer/banner -->
A promise based streaming tokenizer
<!-- :begin use tokenizer/banner -->
<!-- :begin use tokenizer/banner -->
<!-- :begin use tokenizer/banner -->
<!-- :begin use tokenizer/banner -->
<!-- :begin use tokenizer/banner -->