GrETEL 4 Afrikaans

This website hosts GrETEL for Afrikaans, a search engine for the Afrikaans NCHLT treebank. The construction of the treebank and the adaptation of GrETEL were part of the AfriBooms project.

What is GrETEL?

GrETEL stands for Greedy Extraction of Trees for Empirical Linguistics. It is a user-friendly search engine for the exploitation of treebanks. It comes in two formats:

Example-based search

In this search mode you can use a natural language example as a starting point for searching a treebank [?] with limited knowledge about tree representations and formal query languages. [?]

XPath search

In this search mode you have to build the XPath query yourself. We strongly recommend to use the XPath search tool only when you are an experienced XPath user!

Please cite the following paper if you are using GrETEL for Afrikaans for your research:

Liesbeth Augustinus, Peter Dirix, Daniel van Niekerk, Ineke Schuurman, Vincent Vandeghinste, Frank Van Eynde, and Gerhard van Huyssteen (2016). "AfriBooms: An Online Treebank for Afrikaans." In: Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (eds.) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). Portoro┼ż, Slovenia. pp. 677-682.


The purpose of this website is to provide access to annotated text corpora for linguistic research.
GrETEL is developed at the Centre for Computational Linguistics (CCL). For questions or comments on the search tool, or if you want to become a member of our user group, you can contact us at .
The Afrikaans NCHLT treebank and the parser are developed at the Centre for Text Technology (CTexT). For comments on the linguistic annotations and/or the Afrikaans NCHLT corpus, contact Daniel van Niekerk.



Related tools



The AfriBooms project was funded by the Nederlandse Taalunie and the Department of Arts and Culture of the Government of the Republic of South Africa in the context of collaboration in the domain of speech and language technology.