A lightweight implementation of the Unicode Text Segmentation (UAX #29)
Polyfill for Intl.Segmenter
WebVTT parser, compiler, and segmenter with HLS support
This repo builds .wasm module using icu4c for breaking text into words, so that we can polyfill [Intl Segmenter Proposal](https://github.com/tc39/proposal-intl-segmenter) with full compatibility, even on browsers that do not expose v8BreakIterator api.
Super compact Japanese tokenizer in Javascript. http://chasen.org/~taku/software/TinySegmenter/
segments Bluesky's rich text facets into tokens
TypeScript definitions for tiny-segmenter
Pretrained body segmentation model
A polyfill for Intl.Segmenter
Split a string in to sentences. Supports multiple languages.
[](https://www.npmjs.com/package/@knaw-huc/text-annotation-segmenter)
Lao word segmenter using maximal matching with a 34k-word dictionary — works in Node.js and browsers
Lightweight Japanese word segmenter
Components ui
Rule-based sentence segmentation library; TS port of sentencex
Identify objects in an image, additionally assigning each pixel of the image to a particular object.
A small chunk segmenter.
A high-performance wrapper around `Intl.Segmenter` for efficient text segmentation. This class resolves memory handling issues seen with large strings and can enhance performance by 50-500x. Only ~70 loc (with comments) and no dependencies.
Slice a line whenever it intersects other features
Clause segmentation extension for GLOST - segments sentences into clauses
Extensible utilities for predictably bucketing data (A/B testing, etc)
leany-tunnel exposes your localhost to the world for easy testing and sharing! No need to mess with DNS or deploy just to have others test out your changes.
JavaScript version of GPAC's MP4Box tool
`data-segmenter` is a tool that allows package consumers to define segments from their data regardless of data source like MongoDB or SQL in the backend and provide those segments to a client consumer or user in the frontend.