A Javascript implementation of Charikar's hash for identification of similar documents.
Calculate the simhash value for a list of tokens
Command Line tool that compares two text files using simhash
A TypeScript implementation of Simhash algorithm for near-duplicate detection
Simhash implementation for detecting near-duplicate text using various hash functions like SipHash, MD5, and SHA256
SimHash implementation for detecting near-duplicate text using SipHash-2- function
Simhash implementation for detecting near-duplicate text using various hash functions like SipHash, MD5, and SHA256
Multi-layer text deduplication using SimHash, N-gram containment, and sentence-sequence LCS for Tiptap/ProseMirror documents
Complete string distance and similarity algorithms package with WebAssembly and JavaScript implementations
Javascript implementation for `simhash` algorithm which is widely used by Google for massive web pages
A TypeScript npm package for finding near duplicate string pairs
Information-theoretic context optimization MCP server with deterministic hallucination detection and suppression for AI coding agents.
Array#isArray for older browsers
CLI arguments parser. Native port of python's argparse.
Get the command from a shebang
Callback wrapping utility
Resolve the path of a module like `require.resolve()` but from a given path
JSON parse & stringify that supports binary via bops & base64
Run a function exactly one time
Allows users to use generators in order to write common functions that can be both sync or async.
ECMAScript AST recursive visitor
deterministic JSON.stringify() with custom sorting to get deterministic hashes from stringified results, with no public domain dependencies
process.nextTick but always with args
deterministic `JSON.stringify()` - a faster version of substack's json-stable-strigify without jsonify