Spell Correct

Corpus

Name Description Size License Creator Download
VISTEC-TP-TH-21 The largest social media domain datasets for Thai text processing (word segmentation, misspell correction and detection, and named-entity boundary) called "VISTEC-TP-TH-2021" or VISTEC-2021. 49,997 sentences with 3.39M words CC-BY-SA 3.0 VISTEC & Chiang Mai University GitHub

Software

Name Description Status Language License
Hunspell Hunspell is the spell checker of LibreOffice, OpenOffice.org, Mozilla Firefox 3 & Thunderbird, Google Chrome. active C/C++ GNU Lesser General Public License and Mozilla Public License
PyThaiNLP It's part of PyThaiNLP. active Python 3.X Apache License 2.0
khanaa Khanaa is a tool to make spelling Thai more convenient. active Python 3.X MIT license