Results for tokenize-words

Abandoned. Last published 3 years ago.

Break down text into array of words.

@alcalzone/ansi-tokenizev0.3.0

Efficiently modify strings containing ANSI escape codes

css-tokenizev1.0.1

Abandoned. Last published 11 years ago.

Transform stream that tokenizes CSS

@stdlib/string-base-format-tokenizev0.2.4

Tokenize a string into an array of string parts and format identifier objects.

string-punctuation-tokenizerv2.2.0

Small library that provides functions to tokenize a string into an array of words with or without punctuation

scss-tokenizerv0.4.3

Abandoned. Last published 3 years ago.

A tokenzier for Sass' SCSS syntax

html-tokenizev2.0.1

Abandoned. Last published 6 years ago.

transform stream to tokenize html

@babel/plugin-transform-reserved-wordsv7.29.7

Ensure that no reserved words are used.

wink-tokenizerv5.3.0

Multilingual tokenizer that automatically tags each token with its type

simple-html-tokenizerv0.5.11

Abandoned. Last published 5 years ago.

Simple HTML Tokenizer is a lightweight JavaScript library that can be used to tokenize the kind of HTML normally found in templates.

@stdlib/nlp-tokenizev0.2.3

Tokenize a string.

@csstools/css-color-parserv4.1.1

Parse CSS color values

@csstools/css-tokenizerv4.0.0

Tokenize CSS

change-casev5.4.4

Abandoned. Last published 2 years ago.

Transform a string between `camelCase`, `PascalCase`, `Capital Case`, `snake_case`, `kebab-case`, `CONSTANT_CASE` and others

tokenize-whitespacev0.0.1

Abandoned. Last published 10 years ago.

Tokenize a string into words and whitespace tokens

libmimev5.3.8

Encode and decode quoted printable and base64 strings

Tiny Casing utils

kuromojinv3.0.1

Aging — last published over a year ago — check before adopting.

Provide a high level wrapper for kuromoji.js

tokenize-commentv3.0.1

Abandoned. Last published 7 years ago.

Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.

excel-formula-tokenizerv3.0.0

Tokenize Excel formulas

toidentifierv1.0.1

Convert a string of words to a JavaScript identifier

@materializeinc/sql-lexerv26.27.0

The lexer for Materialize's SQL dialect, with wasm build targets.

@stdlib/number-float64-base-to-wordsv0.2.3

Split a double-precision floating-point number into a higher order word and a lower order word.

@stdlib/number-float64-base-from-wordsv0.2.3

Create a double-precision floating-point number from a higher order word and a lower order word.

RubyGems matches

9 matches · Ruby

word-tokensv0.0.1

Abandoned. Last published 15 years ago.

Generates tokens consisting of readable words from your system dictionary

freqlv0.1.0

Abandoned. Last published 2 years ago.

Right now all we do is convert fpmw to zipf and other units.

tokenkitv0.1.0.pre.1

Aging — last published 8 months ago — check before adopting.

TokenKit provides lightweight, Unicode-aware word-level tokenization with pattern preservation, backed by Rust for performance.

thailang4rv0.1.0

Abandoned. Last published 5 years ago.

Thai language tools for Ruby, i.e. a word tokenizer, a character level indentifier, and a romanization tool

filtrav0.0.2

Abandoned. Last published 10 years ago.

Filtra filters an array of tokens or words so they can be indexed by Busca, the simple redis search

textokenv1.2.1

Abandoned. Last published 7 years ago.

Textoken is a Ruby library for text tokenization. This gem extracts words from text with many customizations. It can be used in many fields like Web Crawling and Natural Language Processing.

ruby_tokenizerv0.1.3

Abandoned. Last published 10 years ago.

RubyTokenizer is a simple language processing command-line tool. It performs low-level tokenization and returns the top 10 most frequent words in a body of text. At the moment it's only available for English texts and it segments words by filtering whitespaces, punctuation marks, parantheses and other special characters.

dubsv0.1.5

Maintained. Niche but maintained, actively maintained.

Generate random names from themed word lists (Gundam, Star Trek, Star Wars, Transformers, and more) with configurable patterns and token formats.

jekyll-related-postsv0.1.2