Remove diacritics from letters, for example when standardizing input for a search
A lightweight library for removing hebrew diacritics
Ultra-fast, zero-copy text normalization for Rust NLP pipelines & tokenizers
No Diacr!
A lightweight library for removing Arabic diacritics
A Rust library for Arabic text processing and manipulation
A Rust crate to facilitate working with Ancient Greek words
Person matcher for healthcare information exchange: deterministic and probabilistic matching with multinational national identifiers (UK United Kingdom National Health Service Number / FR NIR / ES TSI / IE IHI / UK NI H&C / US SSN), E.164 phone normalisation, address parsing, nickname dictionary, email scoring, and explainable per-field breakdowns.
A Rust crate to remove accents from strings, inspired by PostgreSQL's unaccent extension.
Worker matcher for healthcare information exchange: deterministic and probabilistic matching with multinational national identifiers (UK NHS / FR NIR / ES TSI / IE IHI / UK NI H&C / US SSN), E.164 phone normalisation, address parsing, nickname dictionary, email scoring, and explainable per-field breakdowns.
A library to convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones.
A Rust crate for converting Mandarin Chinese to Pinyin.
Replace characters in an existing string with similar-looking diacritical ones (e.g. vowels with dots over them, characters with accents on them, etc.). This allows you to create strings that have different characters in them, but still keep the string human readable.
Support diacritics in String
A small library for converting accented characters to their ASCII equivalents.
Jekyll plugin to substitute combining diacritical marks
Gem extends some of the Ruby String class methods, such as #downcase and #upcase, by handling Polish diacritics. It also adds #no_pl method.
Add "String#accent_buster" that replaces diacritics marks by their non-diacritic equivalents.
Detects profane words using multiple strategies, including similarities, diacritics(sound alterations), constructions (multi-part), injections and unicode.
Determines which letter in a Vietnamese word should receive accent marks, and provides IME input methods (VNI) for typing Vietnamese.
Polish extensions for Ruby 1.9 String #upcase(!), #downcase(!), #capitalize(!) and #<=> supporting polish diacritics
NoAccent is a Ruby gem designed to remove diacritic accents from text, providing cleaner and simpler text processing.