Skip to content

Software

<- back to homepage

Menu

[WIP]

Word Segmentation

Name Description Status Language License
ICU ICU - International Components for Unicode active C/C++/Java Unicode License
libthai is a set of Thai language support routines aimed to ease developers' tasks to incorporate Thai language support in their applications. active C/C++ LGPL-2.1 License
SWATH Smart Word Analysis for THai active C/C++ GPL-2.0 License
AttaCut Fast and Reasonably Accurate Word Tokenizer for Thai. active Python 3.X MIT License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0
PyWordCut wordcutpy is a simple Thai word breaker written in Python 3+ active Python 3.X LGPLv3
DeepCut A Thai word tokenization library using Deep Neural Network. active Python 3.X MIT License
TLTK Thai Language Toolkit active Python 3.X BSD License (BSD-3-Clause)
KUCut Thai word segmentor that is difference from existing segmentor such as CTTEX or SWATH. deactive Python 2.4-2.5 GPL-2.0 License
SEFR CUT Stacked Ensemble Filter and Refine for Word Segmentation active Python 3.X MIT License
CutKum Thai Word-Segmentation with LSTM in Tensorflow - Python 3.X MIT License
ThaiLMCut Word Tokenizer for Thai Language based on Transfer Learning and bidirectional-LSTM active Python 3.X MIT License
LexTo Thai word segmentation ( Longest Matching ) - Java LGPLv2.1
sertiscorp /thai-word-segmentation Thai word segmentation with bi-directional RNN - Python 3.X MIT License
Thai Analysis Plugin for Elasticsearch The Thaichub2 (thai-chub-chub) Analysis Plugin integrates the Thai word segmentation modules into Elasticsearch. active Java Apache-2.0 License
Wordcut Thai word breaker for Node.js active JavaScript, Node.JS LGPLv3
newmm-tokenizer Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP. active Python 3.X Apache License 2.0
Stanza Official Stanford NLP Python Library for Many Human Languages active Python 3.X Apache License 2.0
Multi Candidate Thai Word Segmentation Most existing word segmentation methods output one single segmentation solution. active Python 3.X MIT License
PhlongTaIam PHP Thai word breaker active PHP LGPL-2.1 License
Chamkho Rust Thai word breaker active Rust LGPL-3 License
oxidized-thainlp Thai Natural Language Processing in Rust, with Python-binding. active Python & Rust Apache License 2.0

up to menu

Syllable Segmentation

Name Description Status Language License
ssg CRF syllable segmenter for Thai active Python 3.X Apache License 2.0
TLTK Thai Language Toolkit active Python 3.X BSD License (BSD-3-Clause)

up to menu

Sentence Segmentation

Name Description Status Language License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0
TLTK Thai Language Toolkit active Python 3.X BSD License (BSD-3-Clause)
BoydCut Bidirectional LSTM-CNN Model for Thai Sentence Segmenter active Python 3.X MIT License

up to menu

Part Of Speech

Name Description Status Language License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0
TLTK Thai Language Toolkit active Python 3.X BSD License (BSD-3-Clause)

up to menu

OCR

Name Description Status Language License
Tesseract OCR Tesseract Open Source OCR Engine active C/C++ Apache License 2.0
Easy OCR Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai. active Python 3.X Apache License 2.0

up to menu

Spell Check

Name Description Status Language License
Hunspell Hunspell is the spell checker of LibreOffice, OpenOffice.org, Mozilla Firefox 3 & Thunderbird, Google Chrome. active C/C++ GNU Lesser General Public License and Mozilla Public License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0

up to menu

Dependency parser

Name Description Status Language License
spaCy-Thai Tokenizer, POS-tagger, and dependency-parser for Thai language, working on Universal Dependencies. active Python 3.X MIT License

up to menu

Grapheme to Phoneme

Name Description Status Language License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0
TLTK Thai Language Toolkit active Python 3.X BSD License (BSD-3-Clause)
thpronun thpronun is a program for analyzing pronunciation of Thai words. The output can be in Thai pronunciation, Romanization, or in any other phonetic systems. It is designed to be extensible. active C/C++ GPL-3.0 License
Thai G2P (grapheme to phoneme) dictionary-based conversion + BiLSTM seq2seq model (under construction) active Python 3.X

up to menu

Named Entity Recognition

Name Description Status Language License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0
TLTK Thai Language Toolkit active Python 3.X BSD License (BSD-3-Clause)

up to menu

Soundex

Name Description Status Language License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0

up to menu

Text Generator

Name Description Status Language License
TTG Thai Text Generator active Python 3.X Apache License 2.0

up to menu

Text-to-Speech

Name Description Status Language License
Thai TTS Tacotron Thai_TTS is the project about training "Text to Speech in Thai" using Tacotron2 by NVIDIA. active Python 3.X Apache License 2.0

up to menu

Speech Emotion Recognition

Name Description Status Language License
Vistec-AIS Speech Emotion Recognition Speech Emotion Recognition Model and Inferencing using Pytorch active Python 3.X Apache License 2.0

up to menu

Sentiment Analysis

Name Description Status Language License
thai_sentiment The naive sentiment classification function based on NBSVM trained on wisesight_sentiment active Python 3.X Apache License 2.0

up to menu

Machine Translation

Name Description Status Language License
Lalita Chinese-Thai Machine Translation Chinese-Thai Machine Translation by AI Builder active Python 3.X Apache License 2.0
English-Thai Machine Translation Models English-Thai Machine Translation Models by VISTEC-depa Thailand Artificial Intelligence Research Institute active Python 3.X Apache License 2.0

up to menu

Image Captioning

Name Description Status Language License
Image Captioning in Thai: AI ช่วยผู้พิการทางสายตา Image Captioning in Thai from AI Builder https://www.facebook.com/aibuildersx/posts/175053151329799 Python 3.X ?

up to menu