BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for utf8-encoding

Found in 3 of 7 ecosystemsnpm 1–24 of 16,494 · 93 matches across other registries

npm16494 RubyGems6 NuGet87

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 16,494 · JavaScript

See all npm →

utf8-encodingv0.1.2

utf8 encoder/decoder of whatwg Encoding Living Standard https://encoding.spec.whatwg.org/

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 11 years ago.

encode-utf8v2.0.0

Turn a string into an ArrayBuffer by using the UTF8 encoding.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

read-filev0.2.0

Thin wrapper around fs.readFile and fs.readFileSync that also strips byte order marks when `utf8` encoding is chosen. Also optionally replaces windows newlines with unix newlines.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 10 years ago.

@thi.ng/bencodev3.0.66

Bencode binary encoder / decoder with optional UTF8 encoding & floating point support

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

utf8-js-toolsv1.0.2

Encode/Decode text in utf8 encoding

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

unzip-mbcsv0.2.8

UnZip for non-UTF8 encoding such as cp949, sjis, gbk, euc-kr, euc-jp, and gb2312

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published over a year ago.

@nathanfaucett/utf8_encodingv0.0.1

utf8 encoding/decoding for the browser and node.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

buffer operations

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

write-file-atomicv8.0.0

Write files in an atomic fashion w/configurable ownership

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

dom-serializerv3.1.1

render domhandler DOM nodes to a string

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

A well-tested UTF-8 encoder/decoder written in JavaScript.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

@protobufjs/utf8v1.1.1

A minimal UTF8 implementation for number arrays.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@borewit/text-codecv0.2.2

Text Decoder

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

text-decoderv1.2.7

Streaming text decoder that preserves multibyte Unicode characters

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

utf8-codecv1.0.0

utf8 to/from bytes codec (esm/cjs)

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

A high-performance string compression library

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

pvtsutilsv1.3.6

pvtsutils is a set of common utility functions used in various Peculiar Ventures TypeScript based projects.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published over a year ago.

http-encodingv2.2.0

Everything you need to handle HTTP message body content-encoding

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@aws-sdk/util-utf8-browserv3.259.0

A browser UTF-8 string <-> UInt8Array converter

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

@walletconnect/encodingv1.0.2

Byte encoding utils

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

remove-bom-streamv2.0.0

Remove a UTF8 BOM at the start of the stream.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

fast-is-utf8v1.0.0

Fast check if buffer is UTF8 encoding

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

tweetnacl-utilv0.15.1

String encoding utilitlies extracted from TweetNaCl.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

detect-conflictv1.0.1

Small utility library that check if a new file content can be merged safely in the on-disk existing file.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

1 2 3 4 5…688

RubyGems matches

6 matches · Ruby

string_utf8v0.1.1

Convert a string's encoding to utf8, whithout caring which encoding used before converting.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 12 years ago.

utf8_converterv0.1.1

This gem attempts to convert the received text to UTF8. It works by trying to convert the given text with a list of possible common encodings. This is useful if the developer knows the most common encodings that the application is going to be receiving, leaving the guessing work to this gem and by safely converting (without crash) the received text.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

subconvtrv0.0.10

Detects and converts windows-1254 encoding .srt files to utf8 encoding in the folder subconvtr runs.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

embulk-decoder-remove_nonstandard_utf8_bytesv0.1.0

Decodes Remove Nonstandard Utf8 Bytes-encoded files read by other file input plugins.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

== ICU4R - ICU Unicode bindings for Ruby ICU4R is an attempt to provide better Unicode support for Ruby, where it lacks for a long time. Current code is mostly rewritten string.c from Ruby 1.8.3. ICU4R is Ruby C-extension binding for ICU library[1] and provides following classes and functionality: * UString: - String-like class with internal UTF16 storage; - UCA rules for UString comparisons (<=>, casecmp); - encoding(codepage) conversion; \ - Unicode normalization; - transliteration, also rule-based; Bunch of locale-sensitive functions: - upcase/downcase; - string collation; \ - string search; - iterators over text line/word/char/sentence breaks; \ - message formatting (number/currency/string/time); - date and number parsing. * URegexp - unicode regular expressions. * UResourceBundle - access to resource bundles, including ICU locale data. * UCalendar - date manipulation and timezone info. * UConverter - codepage conversions API * UCollator - locale-sensitive string comparison == Install and usage > ruby extconf.rb > make && make check > make install Now, in your scripts just require 'icu4r'. To create RDoc, run > sh tools/doc.sh == Requirements To build and use ICU4R you will need GCC and ICU v3.4 libraries[2]. == Differences from Ruby String and Regexp classes === UString vs String 1. UString substring/index methods use UTF16 codeunit indexes, not code points. 2. UString supports most methods from String class. Missing methods are: capitalize, capitalize!, swapcase, swapcase! %, center, ljust, rjust chomp, chomp!, chop, chop! \ count, delete, delete!, squeeze, squeeze!, tr, tr!, tr_s, tr_s! crypt, intern, sum, unpack dump, each_byte, each_line hex, oct, to_i, to_sym reverse, reverse! succ, succ!, next, next!, upto 3. Instead of String#% method, UString#format is provided. See FORMATTING for short reference. 4. UStrings can be created via String.to_u(encoding='utf8') or global u(str,[encoding='utf8']) calls. Note that +encoding+ parameter must be value of String class. 5. There's difference between character grapheme, codepoint and codeunit. See UNICODE reports for gory details, but in short: locale dependent notion of character can be presented using more than one codepoint - base letter and combining (accents) (also possible more than one!), and each codepoint can require more than one codeunit to store (for UTF8 codeunit size is 8bit, though \ some codepoints require up to 4bytes). So, UString has normalization and locale dependent break iterators. 6. Currently UString doesn't include Enumerable module. 7. UString index/[] methods which accept URegexp, throw exception if Regexp passed. 8. UString#<=>, UString#casecmp use UCA rules. === URegexp UString uses ICU regexp library. Pattern syntax is described in [./docs/UNICODE_REGEXPS] and ICU docs. There are some differences between processing in Ruby Regexp and URegexp: 1. When UString#sub, UString#gsub are called with block, special vars ($~, $&, $1, ...) aren't set, as their values are processed through deep ruby core code. Instead, block receives UMatch object, which is essentially immutable array of matching groups: "test".u.gsub(ure("(e)(.)")) do |match| \ puts match[0] # => 'es' <--> $& puts match[1] # => 'e' \ <--> $1 puts match[2] # => 's' <--> $2 end 2. In URegexp search pattern backreferences are in form \n (\1, \2, ...), in replacement string - in form $1, $2, ... NOTE: URegexp considers char to be a digit NOT ONLY ASCII (0x0030-0x0039), but any Unicode char, which has property Decimal digit number (Nd), e.g.: a = [?$, 0x1D7D9].pack("U*").u * 2 puts a.inspect_names <U000024>DOLLAR SIGN <U01D7D9>MATHEMATICAL DOUBLE-STRUCK DIGIT ONE <U000024>DOLLAR SIGN <U01D7D9>MATHEMATICAL DOUBLE-STRUCK DIGIT ONE puts "abracadabra".u.gsub(/(b)/.U, a) abbracadabbra \ 3. One can create URegexp using global Kernel#ure function, Regexp#U, Regexp#to_u, or from UString using URegexp.new, e.g: /pattern/.U =~ "string".u 4. There are differences about Regexp and URegexp multiline matching options: t = "text\ntest" # ^,$ handling : URegexp multiline <-> Ruby default t.u =~ ure('^\w+$', URegexp::MULTILINE) => #<UMatch:0xf6f7de04 @ranges=[0..3], @cg=[\u0074\u0065\u0078\u0074]> t =~ /^\w+$/ => 0 # . matches \n : URegexp DOTALL <-> /m t.u =~ ure('.+test', URegexp::DOTALL) \ => #<UMatch:0xf6fa4d88 ... t.u =~ /.+test/m 5. UMatch.range(idx) returns range for capturing group idx. This range is in codeunits. === References 1. ICU Official Homepage http://ibm.com/software/globalization/icu/ 2. ICU downloads \ http://ibm.com/software/globalization/icu/downloads.jsp 3. ICU Home Page http://icu.sf.net 4. Unicode Home Page http://www.unicode.org ==== BUGS, DOCS, TO DO The code is slow and inefficient yet, is still highly experimental, so can have many security and memory leaks, bugs, inconsistent documentation, incomplete test suite. Use it at your own risk. Bug reports and feature requests are welcome :) === Copying This extension module is copyrighted free software by Nikolai Lugovoi. You can redistribute it and/or modify it under the terms of MIT License. Nikolai Lugovoi <meadow.nnick@gmail.com>

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

has_unique_three_letter_codev0.0.1

Assigns a case-insensitive unique three-letter code to each record in a scope, based loosely on some other attribute of the record

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 12 years ago.

NuGet matches

Showing 12 of 87 · .NET

See all NuGet →

system.text.encoding.extensionsv4.3.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

runtime.any.system.text.encoding.extensionsv4.3.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

runtime.aot.system.text.encoding.extensionsv4.3.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

s22.imapwithutf8v3.6.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 12 years ago.

unicode.netv2.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

imageresizer.plugins.prettygifsv4.2.8

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

magicfileencodingv4.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

retyped.text-encoding-utf-8v1.0.6733

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

utf8-stringv0.1.2

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

nstack.corev1.1.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

lindexi.src.encodingutf8andgbkdifferentiaterv1.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

myitian.text.modifiedutf8encodingv1.2.2

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.