Spell Correct
Corpus
Name | Description | Size | License | Creator | Download |
---|---|---|---|---|---|
VISTEC-TP-TH-21 | The largest social media domain datasets for Thai text processing (word segmentation, misspell correction and detection, and named-entity boundary) called "VISTEC-TP-TH-2021" or VISTEC-2021. | 49,997 sentences with 3.39M words | CC-BY-SA 3.0 | VISTEC & Chiang Mai University | GitHub |
Software
Name | Description | Status | Language | License |
---|---|---|---|---|
Hunspell | Hunspell is the spell checker of LibreOffice, OpenOffice.org, Mozilla Firefox 3 & Thunderbird, Google Chrome. | active | C/C++ | GNU Lesser General Public License and Mozilla Public License |
PyThaiNLP | It's part of PyThaiNLP. | active | Python 3.X | Apache License 2.0 |
khanaa | Khanaa is a tool to make spelling Thai more convenient. | active | Python 3.X | MIT license |