Coreference resolution

Corpus

Name Description Size License Creator Download
Han-Coref 🪿 Han-Coref: Thai Coreference resolution by PyThaiNLP 1,338 documents CC BY 3.0 PyThaiNLP GitHub
ThaiCoref ThaiCoref, a dataset for Thai coreference resolution. Our dataset comprises 777,271 tokens, 44,082 mentions and 10,429 entities across four text genres: university essays, newspapers, speeches, and Wikipedia. Chulalongkorn University GitHub
Z-coref Z-coref: Thai Coreference and Zero Pronoun Resolution 1,338 documents Apache-2.0 license Poomphob Suwannapichat and Sansiri Tarnpradab and Santitham Prom-on GitHub