A lightweight implementation of the Unicode Text Segmentation (UAX #29)
unicode-segmenter for miniprogram
Polyfill for Intl.Segmenter
This repo builds .wasm module using icu4c for breaking text into words, so that we can polyfill [Intl Segmenter Proposal](https://github.com/tc39/proposal-intl-segmenter) with full compatibility, even on browsers that do not expose v8BreakIterator api.
Detect whether the terminal supports Unicode
Compile regular expressions' unicodeSets (v) flag.
Compile ES2015 Unicode regex to ES5
Compile ES2015 Unicode escapes to ES5
Compile Unicode property escapes in Unicode regular expressions to ES5.
Match a Unicode property or property alias to its canonical property name per the algorithm used for RegExp Unicode property escapes in ECMAScript.
Regenerate sets for Unicode properties and values.
A JavaScript library that breaks strings into their individual user-perceived characters (including emojis!)
Unicode property alias mappings in JavaScript format for property names that are supported in ECMAScript RegExp property escapes.
Parse regular expressions' unicodeSets (v) flag.
Match a Unicode property or property alias to its canonical property name per the algorithm used for RegExp Unicode property escapes in ECMAScript.
WebVTT parser, compiler, and segmenter with HLS support
Super compact Japanese tokenizer in Javascript. http://chasen.org/~taku/software/TinySegmenter/
segments Bluesky's rich text facets into tokens
The set of canonical Unicode property names supported in ECMAScript RegExp property escapes.
TypeScript definitions for tiny-segmenter
Normalize unicode-range descriptors, and can convert to wildcard ranges.
Provides fast access to unicode character properties
The set of Unicode symbols that can serve as a base for emoji modifiers, i.e. those with the `Emoji_Modifier_Base` property set to `Yes`.
Webpack loader that resolves relative paths in url() statements based on the original source file
This crate provides Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules.
Provides alternatives to BufRead's read_line & lines that stop not on newlines
Validation library
Validation library
Derive macros for garde
A fun reimplementation of hangman in rust :)
A simple Trie implementation in Rust
Laravel/AdonisJS-inspired utility helpers for the Rok ecosystem
A comprehensive Rust library for advanced string manipulation and processing.
A comprehensive Rust library for advanced string manipulation and processing.
Terminal session manager for Claude Code — browse, preview, and resume sessions with per-model cost tracking.
Retained-mode, CSS-styled terminal UI framework
Unicode annex 29 compliant word segmentation
ICU4X provides Ruby bindings for the ICU4X library, offering Unicode internationalization support including locale handling, number formatting, date/time formatting, collation, segmentation, and more.