Optical Character Recognition
Corpus
Name | Description | Size | License | Creator | Download |
---|---|---|---|---|---|
KVIS Thai OCR Dataset | Offline Thai Handwritten Character Dataset | CC BY 4.0 | John Joseph, Ferdin Joe | Website | |
Thai OCR | Thai ocr dataset from NECTEC | Training set: 81,100 image | CC BY-SA-NC 3.0 | NECTEC | aiforthai (registration required) |
Thai handwriting number dataset | Create Thai handwriting number dataset | MIT | @kittinan | GitHub |
Software
Name | Description | Status | Language | License |
---|---|---|---|---|
Tesseract OCR | Tesseract Open Source OCR Engine | active | C/C++ | Apache License 2.0 |
Easy OCR | Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai. | active | Python 3.X | Apache License 2.0 |
Thai National Document Optical Character Recognition (THND OCR) | Tesseract OCR tools for read Thai National Document used TH Sarabun National Font trained and fine-tuned. Read README.md to see about my process. | active | Python 3.X |