Extract text from HTML. Excludes content from metadata tags by default.
A light-weight module that brings Fetch API to node.js
extracts CSS into separate files
A Webpack plugin to optimize \ minimize CSS assets.
Extract and inline critical css with emotion for server side rendering.
A light-weight module that brings window.fetch to node.js
webpack loader to extract HTML and CSS from the bundle
Advanced html to plain text converter
Extract the CSS from an HTML document.
PDF extraction and rendering across all JavaScript runtimes
Compare strings containing a mix of letters and numbers in the way a human being would in sort order.
Remove reply quotations from emails
A light-weight module that brings Fetch API to node.js
A collection of utilities for emojis
A CSS Modules transform to extract local aliases for inline imports
Vanilla-extract integration for capsize
unzip a zip file into a directory using 100% javascript
Highly configurable, well-tested, JavaScript-based HTML minifier.
HTTP content negotiation
Manipulate TopoJSON and convert it to GeoJSON.
TypeScript definitions for html-to-text
Extract urls from a string and returns an array
Visualize flow between nodes in a directed acyclic network.
Extract text from pdfs that contain searchable pdf text
Deba takes a HTML document or fragment and extracts the textual content into a plaintext format that is easy for humans to read.
This module is to extract the text from web page(html).
This is a ChupaText decomposer plugin for to extract text and meta-data from HTML. You can use `html` decomposer.
A tool for extracting and replacing URLs from inside a block of text or HTML.
A simple Ruby API for extracting contact data, such as emails, addresses, and phone numbers from text documents and hyperlinks. Also has the ability to save the extracted data as JSON objects and files. For more info, see https://github.com/jweinst1/ContactDetective
Provides methods to extract texts from various file formats like Microsoft Office (<= 2002, as well as >= 2007,) PDF and HTML.
Generate HTML tables which popular spreadsheet software packages know how to read
Given a HTML formatted string, escapement will extract descendant tags into a device agnostic attributes array that can be used for formatting the text anywhere.
Pismo extracts and retrieves content-related metadata from HTML pages - you can use the resulting data in an organized way, such as a summary/first paragraph, body text, keywords, RSS feed URL, favicon, etc.
Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction.
Pismo extracts and retrieves content-related metadata from HTML pages - you can use the resulting data in an organized way, such as a summary/first paragraph, body text, keywords, RSS feed URL, favicon, etc.
Extract structured information from text with source grounding, deterministic serialization, and HTML visualization.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.