Provides fast access to unicode character properties
Provides fast access to unicode character properties
Basic CLDR supplemental data
A regular expression to match all Emoji-only symbols as per the Unicode Standard.
Provides fast access to unicode character properties
Data for number formatting.
Data for date/time formatting, including data for Gregorian calendar.
regular expression for matching unicode category
unicode lookup table
Translated versions of locale display name elements: languages, scripts, territories, and variants.
a javascript porting of C's wcwidth()
Compile ES2015 Unicode regex to ES5
Data for units formatting.
Detect whether the terminal supports Unicode
Compile regular expressions' unicodeSets (v) flag.
Character annotation data
Compile Unicode property escapes in Unicode regular expressions to ES5.
Parse regular expressions' unicodeSets (v) flag.
Compile ES2015 Unicode escapes to ES5
Match a Unicode property or property alias to its canonical property name per the algorithm used for RegExp Unicode property escapes in ECMAScript.
Regenerate sets for Unicode properties and values.
Character annotation data, including derived data.
JavaScript-compatible Unicode data. Arrays of code points, arrays of symbols, and regular expressions for Unicode v17.0.0’s categories, scripts, blocks, bidi, and other properties.
A JavaScript library that breaks strings into their individual user-perceived characters (including emojis!)
Fast lookup of the Unicode General Category property for char
Fast, zero-allocation pretokenizers for BPE tokenizers
Native iXML (Invisible XML) parser with left-recursion support - 76.9% spec conformance, works in Rust and WebAssembly
Work with Unicode General Category.
[Unicode 17.0.0] Determine which Unicode "General Categories" a string belongs to
[Unicode 17.0.0] Determines the monospace display width of a string using EastAsianWidth.txt, Unicode general category, Emoji specification, and other data.