Parser

Corpus

Name Description Size License Creator Download
UD Thai PUD This is a part of the Parallel Universal Dependencies (PUD) treebanks created for the CoNLL 2017 shared task on Multilingual Parsing from Raw Text to Universal Dependencies. 1,000 sentences CC BY-SA 3.0 Universal Dependencies GitHub
Thai Discourse Treebank The Thai Discourse Treebank (TDTB) at Chulalongkorn University annotates 180 documents from the LST20 corpus with 10,868 discourse relations. 6,534 sentences Prasertsom, P., Jaroonpol, A., & Rutherford, A. T. Github
TUD Treebank Thai Universal Dependency Treebank, annotating TNC 3,627 sentences nlp-chula Github

Software

Name Description Status Language License
spaCy-Thai Tokenizer, POS-tagger, and dependency-parser for Thai language, working on Universal Dependencies. active Python 3.X MIT License
Link Grammar Parser A syntactic parser based on link grammar active Python 3.X LGPL