Polyglot is a natural language pipeline that supports massive multilingual applications.
Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation (ACL 2020 Findings) Stacked Ense...
Thai Word Segmentation using TCC + Bidirectional RNNs
Python binding for nlpO3 Thai language processing library
Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted f...
A library for searching and analyzing Thai data
Thai abbreviation to full text library
Tokenizer POS-tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other l...
Fast and Reasonably Accurate Word Tokenizer for Thai
Provides script conversion (a.k.a transliteration) between various scripts