A module for node.js and the browser that takes in text and returns text that is stripped of stopwords. Has pre-defined stopword lists for 62 languages and also takes lists with custom stopwords as input.
TypeScript definitions for stopword
A module for creating stopword lists for any language, based on a set of documents.
Sami stopword lists (North-, Lule- and South Sami) for natural language processing. Code to create and refine them. Examples usage could be search engines and machine learning.
No description provided.
No description provided.
Norwegian trimmer, stemmer and stopword filter for Lunr
A module for node.js that takes in text and returns text that is stripped of stopwords. Has pre-defined stopword lists for 19 languages and also takes lists with custom stopwords as input.
Crawler for NRK Sapmi news bulletins that will be the basis for Sami stopword lists and an example search engine for content in Sami.
Slug is a custom field for Strapi v5 that generates clean, SEO-friendly slugs with stopword trimming.
An javascript implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm. Forked from https://github.com/sleepycat/rapid-automated-keyword-extraction
A simple package to remove stop words from text.
Core
Core
Core
Core
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
A package to remove common stopwords from an array, it covers most languages and is optimized primarily for WorldBrain
A simple Node.js text utility library for word/char count, stopword removal, and more.
NLP (natural language processing) for server and the browser in TypeScript. All lightweight and super-fast.
Deny list dictionaries and config data for @stll/anonymize
Amharic Language Pre-processor toolkit
Stopwords for various languages in JSON format.
Textlint rule to ensure that titles are using AP/APA style
Stopwords from popular text processing frameworks.
This is a list of common stop words in both Chinese and English.
High-performance Indonesian stemmer (Nazief-Adriani + ECS). Zero-regex, FST-powered, Rust 2024.
An open source search engine for building delightful search experiences.
Types for typesense generated with openapi spec
Rust port of JusText — paragraph-level boilerplate removal for HTML
A package to assess the complexity of texts using a variety of readability formulas.
Fast document indexer for finding duplicates and searching content
Trainable, modular AI engine in Rust with compile-time knowledge
Command-line interface for the kham Thai word segmenter
Pure Rust Thai word segmentation engine — no_std compatible
Common stop words in many languages
A stopword library
Small library that allows you to create a simple stopwords filter or use some based on Snowball stopwords lists
Small library that allows you to create a simple stopwords filter or use some based on Snowball stopwords lists
Small library that allows you to create a simple stopwords filter or use some based on Snowball stopwords lists
Clarifier is a stopwords library for removing common words from text
list of stopwords handy to remove words <a href='http://www.pledgie.com/campaigns/17816'><img alt='Click here to lend your support to: Stopwords Fund raising and make a donation at www.pledgie.com !' src='http://www.pledgie.com/campaigns/17816.png?skin_name=chrome' border='0' /></a>
Linnaeus provides a redis-backed Bayesian classifier. Words are stemmed, stopwords are stopped, and redis is used to allow for persistent and concurrent training and classification.
This gem removes punctuation and digits(optional), filters stopwords for the chosen language ('tr', 'en' or 'fr'), does stemming on the words and outputs an array of words with their frequencies.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.