Skip to content
/ UDante Public

Data related to the development of a new Universal Dependencies treebank containing all of Dante Alighieri's Latin works.

Notifications You must be signed in to change notification settings

CIRCSE/UDante

Repository files navigation

UDante

This repository contains the Gold Standard (GS), annotated by an UD expert, created in the context of the UDante project that has the aim of developing a new treebank containing all of Dante Alighieri’s Latin works. Texts were taken from the DanteSearch corpus: the original TEI-XML files, which were already manually lemmatised and morphologically tagged, were converted into the CoNLL-U format and then syntactically annotated using ConlluEditor.

The GS includes:

  • 33 sentences of increasing complexity used to train four annotators
  • 10 sentences per work

How to cite

Cecchini, F. M., Sprugnoli, R., Moretti, G., & Passarotti, M. (2020). UDante: First Steps Towards the Universal Dependencies Treebank of Dante’s Latin Works. In Seventh Italian Conference on Computational Linguistics (pp. 1-7). CEUR-WS. org. PDF

Sprugnoli, Rachele, Passarotti, Marco, Cecchini, Flavio Massimiliano, Pedonese, Giulia, & Moretti, Giovanni. (2023). CIRCSE/UDante: UDante in LiLa (v1.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.8435313

Funding

The LiLa: Linking Latin project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme – Grant Agreement No. 769994.

Copyright

Creative Commons Licence
UDante is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License.

About

Data related to the development of a new Universal Dependencies treebank containing all of Dante Alighieri's Latin works.

Resources

Stars

Watchers

Forks

Packages

No packages published