A rule-based sentence_segmenter, inspired by ruby pragmatic segmenter by diasks2 (repo: https://github.com/diasks2/pragmatic_segmenter). Now with optional AI-based Thai support.
Sentence segmentation library with wide language support optimized for speed and utility.
Sentence segmentation and word tokenization tools
Rust port of pySBD v3.1.0.
Vietnamese NLP library — tokenization, normalization, segmentation
Rule based sentence segmentation library.
Unicode line breaking and text segmentation algorithms for text boundaries analysis
Vietnamese sentence segmentation for vn-nlp
Litsea is an extreamely compact word segmentation and model training tool implemented in Rust.
Litsea is an extreamely compact word segmentation and model training tool implemented in Rust.
Command-line interface for the kham Thai word segmenter
Pure Rust Thai word segmentation engine — no_std compatible
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.