Parser
Corpus
Name | Description | Size | License | Creator | Download |
---|---|---|---|---|---|
UD Thai PUD | This is a part of the Parallel Universal Dependencies (PUD) treebanks created for the CoNLL 2017 shared task on Multilingual Parsing from Raw Text to Universal Dependencies. | 1,000 sentences | CC BY-SA 3.0 | Universal Dependencies | GitHub |
Thai Discourse Treebank | The Thai Discourse Treebank (TDTB) at Chulalongkorn University annotates 180 documents from the LST20 corpus with 10,868 discourse relations. | 6,534 sentences | Prasertsom, P., Jaroonpol, A., & Rutherford, A. T. | Github | |
TUD Treebank | Thai Universal Dependency Treebank, annotating TNC | 3,627 sentences | nlp-chula | Github |
Software
Name | Description | Status | Language | License |
---|---|---|---|---|
spaCy-Thai | Tokenizer, POS-tagger, and dependency-parser for Thai language, working on Universal Dependencies. | active | Python 3.X | MIT License |
Link Grammar Parser | A syntactic parser based on link grammar | active | Python 3.X | LGPL |