unicode-segmenter for miniprogram
A lightweight implementation of the Unicode Text Segmentation (UAX #29)
Polyfill for Intl.Segmenter
This repo builds .wasm module using icu4c for breaking text into words, so that we can polyfill [Intl Segmenter Proposal](https://github.com/tc39/proposal-intl-segmenter) with full compatibility, even on browsers that do not expose v8BreakIterator api.
Detect whether the terminal supports Unicode
Type definitions for APIs of Wechat Mini Program in TypeScript
Compile ES2015 Unicode escapes to ES5
Compile Unicode property escapes in Unicode regular expressions to ES5.
Match a Unicode property or property alias to its canonical property name per the algorithm used for RegExp Unicode property escapes in ECMAScript.
Regenerate sets for Unicode properties and values.
Compile ES2015 Unicode regex to ES5
A JavaScript library that breaks strings into their individual user-perceived characters (including emojis!)
Parse regular expressions' unicodeSets (v) flag.
Match a Unicode property or property alias to its canonical property name per the algorithm used for RegExp Unicode property escapes in ECMAScript.
Compile regular expressions' unicodeSets (v) flag.
WebVTT parser, compiler, and segmenter with HLS support
Super compact Japanese tokenizer in Javascript. http://chasen.org/~taku/software/TinySegmenter/
segments Bluesky's rich text facets into tokens
The set of canonical Unicode property names supported in ECMAScript RegExp property escapes.
Unicode property alias mappings in JavaScript format for property names that are supported in ECMAScript RegExp property escapes.
TypeScript definitions for tiny-segmenter
Normalize unicode-range descriptors, and can convert to wildcard ranges.
Provides fast access to unicode character properties
The set of Unicode symbols that can serve as a base for emoji modifiers, i.e. those with the `Emoji_Modifier_Base` property set to `Yes`.