A lightweight implementation of the Unicode Text Segmentation (UAX #29)
Polyfill for Intl.Segmenter
This repo builds .wasm module using icu4c for breaking text into words, so that we can polyfill [Intl Segmenter Proposal](https://github.com/tc39/proposal-intl-segmenter) with full compatibility, even on browsers that do not expose v8BreakIterator api.
WebVTT parser, compiler, and segmenter with HLS support
Super compact Japanese tokenizer in Javascript. http://chasen.org/~taku/software/TinySegmenter/
segments Bluesky's rich text facets into tokens
TypeScript definitions for tiny-segmenter
Pretrained body segmentation model
A polyfill for Intl.Segmenter
Split a string in to sentences. Supports multiple languages.
[](https://www.npmjs.com/package/@knaw-huc/text-annotation-segmenter)
Lightweight Japanese word segmenter
Rule-based sentence segmentation library; TS port of sentencex
Identify objects in an image, additionally assigning each pixel of the image to a particular object.
A high-performance wrapper around `Intl.Segmenter` for efficient text segmentation. This class resolves memory handling issues seen with large strings and can enhance performance by 50-500x. Only ~70 loc (with comments) and no dependencies.
A small chunk segmenter.
Slice a line whenever it intersects other features
Clause segmentation extension for GLOST - segments sentences into clauses
`data-segmenter` is a tool that allows package consumers to define segments from their data regardless of data source like MongoDB or SQL in the backend and provide those segments to a client consumer or user in the frontend.
JavaScript version of GPAC's MP4Box tool
Extensible utilities for predictably bucketing data (A/B testing, etc)
Scaffold a ShaderPad starter project.
MP4 video file segmenter for MPEG-DASH usage, based on MP4Box
Work with grapheme, words, and sentences with small, simple, and fast API using Intl.Segmenter