This crate detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39.
RFC 5321/5322/6531 email address parser, validator, and normalizer. Subaddress extraction, provider-aware normalization, PSL domain validation, anti-homoglyph protection.
Drop-in Rust sidecar that wraps any MCP server: scans tool calls for prompt injection (UTS-39 confusable-folding + Unicode strip + Aho+Regex), validates Ed25519 manifest signatures with TOFU keystore + Sigstore Rekor bridge, strips loader-class env keys (LD_PRELOAD, NODE_OPTIONS, ...) from spawned children, detects tools/list schema drift (Layer 7 closes rug-pull / silent-redefinition class with NFKC-canonical fingerprint + bidirectional inbound/outbound gate + BLAKE3 or SHA-256 backend for FIPS/PCI/HIPAA + optional JCS RFC 8785 canonicalisation + _meta.fingerprint cross-tool audit injection), serves a fully-wired rmcp 1.5 control-plane behind the rmcp-control feature (MCP spec 2025-11-25, closes CVE-2026-42559 transitively), exports OTLP gRPC telemetry, blocks marketplace-poisoning vectors. <5ms p99 overhead.
Text fingerprinting: MinHash + LSH, SimHash, and ONNX semantic embeddings
Fast string-distance CLI for typosquatting and homoglyph detection (Levenshtein, Damerau-Levenshtein, UTS#39 confusable-weighted).
Human-readable formatting of dates, times, and time zones in hundreds of locales
SIMD-accelerated Unicode normalization (NFC, NFD, NFKC, NFKD)
Reverse-engineers a diff on a rendered minijinja template back into template-level and variable-level changes via shadow rendering + LCS skeleton alignment.
CLI for burgertocow: reverse-engineers diffs on rendered minijinja templates back into template or variable changes.
Tiny transformer inference for the web. One file. No build step.
Unicode confusable character detection and canonicalization
A terminal typing game where the answer is never shown before you type it